JP7208254B2

JP7208254B2 - Work optimization system and work optimization device

Info

Publication number: JP7208254B2
Application number: JP2020557092A
Authority: JP
Inventors: 誠也伊藤; 聡笹谷; 秀行粂; 亮祐三木
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2018-11-21
Filing date: 2018-11-21
Publication date: 2023-01-18
Anticipated expiration: 2038-11-21
Also published as: CN113016000A; JPWO2020105157A1; WO2020105157A1

Description

本発明は作業最適化システムおよび作業最適化装置に関し、例えば機器と作業者とが協働して行う作業（協働作業）を最適化するための作業最適化システムおよび作業最適化装置に適用して好適なものである。 The present invention relates to a work optimization system and a work optimization device, and is applied, for example, to a work optimization system and a work optimization device for optimizing work (collaborative work) performed by a device and a worker in cooperation. It is suitable for

機器と作業者とが協働する環境で作業しながら、所定のタスクを実行するケースは、ロボット技術等の発展、労働力不足による自動化のニーズなどによって増加傾向になる。このような状況においては、熟練度の高い作業者に合わせた機器の制御によって使いこなすという点と、最適化された機器に対して作業者が整合する行動を取るという点とが求められる。 The number of cases in which predetermined tasks are executed while working in an environment where equipment and workers cooperate is on the rise due to the development of robot technology and the need for automation due to labor shortages. In such a situation, it is required that the equipment should be mastered by a highly skilled worker by controlling the equipment, and that the operator should act in accordance with the optimized equipment.

前者については、熟練度の高い作業者の行動は、未熟者には習得が困難であることが多く、熟練者と同様の効率が期待できないことが考えられ得る。後者については、機器の動作に作業者の行動を合わせるということは、機器側に合わせることによる負荷が懸念され、集中力の低下などによる歩留まりの低下、体力面への負荷、精神面への負荷などが問題となる。 As for the former, the actions of highly skilled workers are often difficult for unskilled workers to master, and it is conceivable that the same efficiency as skilled workers cannot be expected. Regarding the latter, there is concern about the burden of matching the behavior of the operator to the operation of the equipment, and the decrease in yield due to the decrease in concentration, etc., the physical load, and the mental load. etc. is a problem.

この点、人に対して適切な補助を行うロボットの制御方法が開示されている（特許文献１参照）。 In this respect, a robot control method for appropriately assisting a person has been disclosed (see Patent Document 1).

特開２０１７－３０１３７号公報JP 2017-30137 A

特許文献１に記載の技術では、作業者に負荷のかからないようにロボットを制御することで、前述の問題の一部は解決されるものの、作業者の負荷が減るにとどまり、作業者の行動が最適とならない問題がある。 In the technique described in Patent Document 1, by controlling the robot so as not to impose a burden on the worker, a part of the above-mentioned problems can be solved, but the burden on the worker is reduced, and the worker's behavior is slowed down. There is a suboptimal problem.

本発明は以上の点を考慮してなされたもので、機器の効率を最大化しつつ、作業者の行動を制御し得る作業最適化システム等を提案しようとするものである。 The present invention has been made in consideration of the above points, and is intended to propose a work optimization system or the like capable of controlling the actions of workers while maximizing the efficiency of equipment.

かかる課題を解決するため本発明においては、機器の機器制御パタンと前記機器制御パタンのパラメータとに基づいて前記機器の制御を行う機器制御部と、前記機器制御部により前記機器の制御が行われている際、センサにより取得された前記機器と協働して作業を行う作業者のセンサ情報から、前記作業者の作業を計測する作業者計測部と、前記作業者計測部で計測された作業を所定の目標値に基づいて評価する評価部と、前記評価部による評価がしきい値より高いか否かを判定し、高いと判定した場合、高いと判定した前記機器制御パタンのパラメータを含む行動制御パタンを生成する行動制御パタン生成部と、を設けるようにした。 In order to solve such a problem, in the present invention, a device control unit that controls the device based on a device control pattern of the device and parameters of the device control pattern, and the device control unit controls the device. a worker measurement unit that measures the work of the worker from sensor information of a worker who works in cooperation with the device when the device is being used, and the work measured by the worker measurement unit. based on a predetermined target value; and an evaluation unit that determines whether the evaluation by the evaluation unit is higher than a threshold value. and a behavior control pattern generator for generating behavior control patterns.

上記構成によれば、機器制御パタンに基づいて行動制御パタンが生成される。つまり、機器制御に基づいて行動制御が行われるようになるので、例えば、熟練者の機器制御パタンを用いることで、機器の効率を最大化しつつ、作業者の行動を制御することができるようになる。 According to the above configuration, the action control pattern is generated based on the device control pattern. In other words, since action control is performed based on device control, for example, by using the device control pattern of an expert, it is possible to maximize the efficiency of the device and control the action of the worker. Become.

本発明によれば、機器の効率を最大化しつつ、作業者の行動を制御することができる。 According to the present invention, operator behavior can be controlled while maximizing equipment efficiency.

第１の実施の形態による作業最適化システムに係る構成の一例である。It is an example of the configuration related to the work optimization system according to the first embodiment. 第１の実施の形態による作業環境の一例を示す図である。1 is a diagram showing an example of a working environment according to a first embodiment; FIG. 第１の実施の形態による機器制御パタンの生成に係る構成の一例を示す図である。FIG. 2 is a diagram illustrating an example of a configuration related to generation of device control patterns according to the first embodiment; FIG. 第１の実施の形態による機器制御を最適化するモデルの一例を示す図である。FIG. 2 is a diagram showing an example of a model for optimizing device control according to the first embodiment; FIG. 第１の実施の形態による機器制御パタン生成処理に係るフローチャートの一例を示す図である。FIG. 7 is a diagram showing an example of a flowchart relating to device control pattern generation processing according to the first embodiment; 第１の実施の形態による機器制御に基づいて行動制御を最適化するモデルの一例を示す図である。FIG. 4 is a diagram showing an example of a model for optimizing behavior control based on device control according to the first embodiment; 第１の実施の形態による行動制御パタン生成処理に係るフローチャートの一例を示す図である。FIG. 7 is a diagram showing an example of a flowchart relating to action control pattern generation processing according to the first embodiment; 第１の実施の形態による作業者の作業に係る状態を示す指標の一例を示す図である。FIG. 10 is a diagram showing an example of an index indicating a state related to work performed by a worker according to the first embodiment; 第１の実施の形態によるパラメータ指標関連情報の一例を示す図である。FIG. 4 is a diagram showing an example of parameter index related information according to the first embodiment; FIG.

以下図面について、本発明の一実施の形態を詳述する。 One embodiment of the present invention will be described in detail below with reference to the drawings.

本発明は、機器の制御によって作業者の行動に変化を与えて最適化する技術に関するものである。より具体的には、機器と作業者とが共存する環境において、あるタスクを実行する際に、機器側の機器制御（最適な機器制御パタン）によって、関連する作業者を最適に行動制御（最適な行動制御パタンを生成して作業者を制御）する技術に関するものである。 TECHNICAL FIELD The present invention relates to a technology for optimizing a worker's behavior by controlling it. More specifically, in an environment where equipment and workers coexist, when a certain task is executed, the equipment control on the equipment side (optimal equipment control pattern) is used to optimally control the behavior of related workers (optimal control pattern). The present invention relates to a technology for generating an action control pattern to control a worker.

例えば、本実施の形態では、作業者の行動を計測したデータである行動計測データと目標値とに基づいて作業評価を実施し、作業評価の結果と行動制御パタンとから報酬評価を行い、作業評価が高くなる行動制御パタンを保存し、作業者の行動を最適化するための行動制御パタンを選択し、選択した行動制御パタンに基づいて機器を制御することを特徴とする。 For example, in the present embodiment, work evaluation is performed based on behavior measurement data, which is data obtained by measuring the behavior of a worker, and target values, reward evaluation is performed based on the results of work evaluation and behavior control patterns, and work It is characterized by storing behavior control patterns with high evaluation, selecting behavior control patterns for optimizing the behavior of workers, and controlling equipment based on the selected behavior control patterns.

また、例えば、本実施の形態では、規範とされる作業者（例えば、熟練者）の行動から、最適な機器制御パタンを学習することを特徴とする。 Further, for example, the present embodiment is characterized by learning an optimum device control pattern from behavior of a worker (for example, a skilled worker) who is regarded as a model.

また、例えば、本実施の形態では、予め取得した機器制御パタンを作業者に提示することによって、最適な作業が可能な熟練者の行動パタンに近づくことが可能であって、機器制御により作業者の行動制御ができること特徴とする。 Further, for example, in the present embodiment, by presenting equipment control patterns that have been acquired in advance to the worker, it is possible to approach the action pattern of an expert who is capable of performing optimal work. It is characterized by being able to control the behavior of

また、例えば、本実施の形態では、機器制御パタンと行動制御パタンとを個人ごとに定義することによって、個人に最適な作業を提示することができることを特徴とする。 Further, for example, the present embodiment is characterized in that by defining device control patterns and action control patterns for each individual, it is possible to present the optimum work for each individual.

本実施の形態においては、工場環境内での、ロボット装置と作業者との協働作業の形態における事例を例に挙げて説明する。以下、詳細について説明する。 In the present embodiment, an example of collaborative work between a robot device and a worker in a factory environment will be described. Details will be described below.

（１）第１の実施の形態
図１において、１００は全体として第１の実施の形態による作業最適化システムを示す。(1) First Embodiment In FIG. 1, 100 generally indicates a work optimization system according to the first embodiment.

図１は、作業最適化システム１００に係る構成の一例を示す図である。作業最適化システム１００では、作業者の行動、機器の動作などを計測し、機器と作業者との協働作業を最適化する。作業最適化システム１００は、撮影装置１１０と、処理部（作業者計測部１２０、評価部１３０、目標指示部１４０、機器計測部１５０、行動制御パタン生成部１６０、行動制御計画部１７０、機器制御部１８０など）と、各種の情報（機器制御パタン１９１、行動制御パタン１９２など）とを含んで構成される。なお、各構成要素の一部については、作業最適化システム１００外に設けられていてもよい。 FIG. 1 is a diagram showing an example of the configuration of a work optimization system 100. As shown in FIG. The work optimization system 100 measures the behavior of the worker, the operation of the equipment, and the like, and optimizes the collaborative work between the equipment and the worker. The work optimization system 100 includes an imaging device 110, processing units (worker measurement unit 120, evaluation unit 130, target instruction unit 140, equipment measurement unit 150, behavior control pattern generation unit 160, behavior control planning unit 170, equipment control 180, etc.) and various information (device control pattern 191, action control pattern 192, etc.). Note that part of each component may be provided outside the work optimization system 100 .

処理部は、例えば、作業最適化装置（コンピュータ）の機能として実現される、作業最適化装置は、ノートパソコン、サーバ装置等であり、図示は省略するＣＰＵ（Central Processing Unit）、ＲＡＭ（Random Access Memory）、ＲＯＭ（Read Only Memory）、ＨＤＤ（Hard Disk Drive）、通信装置などを含んで構成される。 The processing unit is, for example, realized as a function of a work optimization device (computer). Memory), ROM (Read Only Memory), HDD (Hard Disk Drive), communication device, and the like.

作業最適化装置の機能（処理部）は、例えば、ＣＰＵがＲＯＭに格納されたプログラムをＲＡＭに読み出して実行すること（ソフトウェア）により実現されてもよいし、専用の回路などのハードウェアにより実現されてもよいし、ソフトウェアとハードウェアとが組み合わされて実現されてもよい。また、作業最適化装置の機能の一部は、作業最適化装置と通信可能な他のコンピュータにより実現されてもよい。 The function (processing unit) of the work optimization device may be implemented by, for example, having the CPU read out a program stored in the ROM into the RAM and execute it (software), or by hardware such as a dedicated circuit. may be implemented, or may be realized by combining software and hardware. Also, part of the functions of the work optimization device may be implemented by another computer that can communicate with the work optimization device.

撮影装置１１０は、ビデオカメラ、ネットワークカメラ等である。撮影装置１１０は、レンズ、画像センサなどを含んで構成され、所定の画像（画像情報）が取得できる。画像センサは、ＣＭＯＳ（Complementary Metal Oxide Semiconductor）、ＣＣＤ（Charge Coupled Device）などの撮像素子（いずれも図示せず）を含む機構である。このような撮影装置１１０から取得した画像（映像）を用いることで、作業者などの作業環境内の状況をセンシングすることができる。なお、作業環境については、図２を用いて後述する。 The imaging device 110 is a video camera, a network camera, or the like. The photographing device 110 includes a lens, an image sensor, and the like, and can acquire a predetermined image (image information). The image sensor is a mechanism including an imaging element (none of which is shown) such as a CMOS (Complementary Metal Oxide Semiconductor) or a CCD (Charge Coupled Device). By using an image (video) acquired from such a photographing device 110, it is possible to sense the situation of a worker or the like in the work environment. Note that the work environment will be described later with reference to FIG.

作業者計測部１２０は、例えば、機器制御部１８０により機器の制御が行われている際、撮影装置１１０（センサの一例）により取得された機器と協働して作業を行う作業者の画像情報（センサ情報の一例）から、作業者の作業を計測する。 For example, when the device control unit 180 controls the device, the worker measurement unit 120 obtains image information of the worker working in cooperation with the device acquired by the imaging device 110 (an example of a sensor). (an example of sensor information), the worker's work is measured.

評価部１３０は、例えば、作業者計測部１２０で計測された作業を所定の目標値（目標指示部１４０により設定された目標値）に基づいて評価する。例えば、評価部１３０は、作業者の作業に係る状態を示す指標が目標値を超えるか否かを判定し、超えると判定した場合、超えないと判断したときと比べて高い評価を行う。なお、作業者の作業に係る状態を示す指標については、図８を用いて後述する。 The evaluation unit 130, for example, evaluates the work measured by the worker measurement unit 120 based on a predetermined target value (target value set by the target instruction unit 140). For example, the evaluation unit 130 determines whether or not the index indicating the state related to the worker's work exceeds the target value. In addition, the index indicating the state of the worker's work will be described later with reference to FIG. 8 .

目標指示部１４０は、例えば、目標値を設定する。なお、目標値は、所定の期間に達成しようとする作業量（例えば、一日の間に、第１の場所にある箱を第２の場所に移動して積み重ねるといったもの）である。 The target instruction unit 140 sets, for example, a target value. Note that the target value is the amount of work to be achieved in a predetermined period (for example, moving boxes from a first location to a second location and stacking them in one day).

機器計測部１５０は、例えば、機器制御部１８０により機器の制御が行われている際、撮影装置１１０により取得された機器の画像情報から、機器の動作を計測する。 For example, when the device control unit 180 is controlling the device, the device measurement unit 150 measures the operation of the device from the image information of the device acquired by the imaging device 110 .

行動制御パタン生成部１６０は、例えば、評価部１３０による評価がしきい値より高いか否かを判定し、高いと判定した場合、高いと判定した機器制御パタンのパラメータを含む行動制御パタンを生成する。 For example, the behavior control pattern generation unit 160 determines whether or not the evaluation by the evaluation unit 130 is higher than a threshold value, and if it is determined to be high, generates a behavior control pattern including parameters of the device control pattern determined to be high. do.

ここで、行動制御パタンを学習する段階（学習フェーズ）と、行動制御パタンを使用する段階（運用フェーズ）とに分けて考える場合、学習フェーズにおいては、例えば、行動制御計画部１７０設けられていなくてもよい。この場合、行動制御パタン生成部１６０は、例えば、学習中の機器制御パタンのパラメータを評価が高くなるように探索的に他のパラメータに変更し、当該機器制御パタンと、変更した他のパラメータとを用いて機器を制御するように機器制御部１８０に指示を与える。かかる構成によれば、最適な行動制御パタンを適切に得ることができるようになる。付言するならば、作業最適化システム１００は、学習フェーズと運用フェーズとに分けることなく、運用しながら学習を行うものであってもよい。この場合、運用しつつ、最適な行動制御パタンを得ることができるようになる。 Here, when considering the phase of learning the behavior control pattern (learning phase) and the phase of using the behavior control pattern (operation phase), in the learning phase, for example, the behavior control planning unit 170 is not provided. may In this case, the action control pattern generation unit 160, for example, exploratoryly changes the parameter of the device control pattern being learned to another parameter so that the evaluation becomes higher, and the device control pattern and the changed other parameter are combined. is used to instruct the device control unit 180 to control the device. According to such a configuration, it becomes possible to appropriately obtain an optimum action control pattern. In addition, the work optimization system 100 may learn while operating without dividing into a learning phase and an operating phase. In this case, it becomes possible to obtain an optimum action control pattern while operating.

行動制御計画部１７０は、例えば、行動制御パタン生成部１６０により生成された行動制御パタンの中から、一の行動制御パタンを選択する。また、例えば、行動制御計画部１７０は、現在の状態を示す指標がしきい値を超えるか否かを判定し、超えないと判定した場合、図９を用いて後述するパラメータ指標関連情報に基づいて現在の状態を示す指標がしきい値を超える行動制御パタンを特定し、選択している行動制御パタンを、特定した行動制御パタンに変更するものであってもよい。 The action control planner 170 selects one action control pattern from the action control patterns generated by the action control pattern generator 160, for example. Further, for example, the behavior control planning unit 170 determines whether or not the index indicating the current state exceeds the threshold. may be used to specify an action control pattern whose index indicating the current state exceeds a threshold value, and change the selected action control pattern to the specified action control pattern.

機器制御部１８０は、例えば、行動制御計画部１７０により選択された行動制御パタンに基づいて機器の機器制御パタンと当該機器制御パタンのパラメータとを特定して機器の制御を行う。 The device control unit 180 specifies, for example, the device control pattern of the device and the parameters of the device control pattern based on the behavior control pattern selected by the behavior control planning unit 170, and controls the device.

図２は、作業環境の一例を示す図である。ロボット装置２００と作業者２０１とが同一環境で作業している様子を俯瞰で示している。より具体的には、ロボット装置２００は、ワーク２０２をピックアップして目標位置まで移動させ、作業者２０１は、ロボット装置２００ができないような作業を事前に、途中で、または事後に適宜サポートし、協働しながらタスクをこなしている様子を示している。それらの外観を撮影装置１１０で観察しながら、ロボット装置２００と作業者２０１との行動、作業状況について、認識技術を活用して計測している。 FIG. 2 is a diagram showing an example of a working environment. It shows a bird's-eye view of a robot device 200 and a worker 201 working in the same environment. More specifically, the robot device 200 picks up the workpiece 202 and moves it to a target position, and the worker 201 appropriately supports the work that the robot device 200 cannot do before, during, or after the fact, It shows how they work together to complete a task. While observing their appearance with the photographing device 110, the actions and work situations between the robot device 200 and the worker 201 are measured using recognition technology.

例えば、ロボット装置２００としては、６軸のピッキングロボットなどが代表的であるが、双腕型のもの、自律移動するタイプのもの等でもよく、さらに搬送用の移動体のようなものでよく、作業者の作業をサポートするような機能をもったものであるなら、特に問わない。 For example, the robot device 200 is typically a 6-axis picking robot, but it may be a dual-arm type, an autonomously moving type, or a mobile body for transportation. There is no particular limitation as long as it has a function to support the work of the operator.

なお、本実施の形態のセンサとしては、様々な機器が適用できるが、撮影装置１１０を例に挙げて説明する。 Various devices can be applied as the sensor of this embodiment, but the imaging device 110 will be described as an example.

作業最適化システム１００では、撮影装置１１０から取得した画像情報を用いて、作業者計測部１２０は、作業者２０１の作業（作業状態）を計測する。評価部１３０は、その作業を評価する。評価指標としては、目標指示部１４０に入力された予め設定した目標値（例えば、１分間あたりのワーク２０２の移動数）に応じたものとする。 In the work optimization system 100 , the worker measurement unit 120 measures the work (work state) of the worker 201 using the image information acquired from the imaging device 110 . The evaluation unit 130 evaluates the work. The evaluation index corresponds to a preset target value (for example, the number of movements of the workpiece 202 per minute) input to the target instruction unit 140 .

また、機器計測部１５０は、撮影装置１１０から得られた画像情報に基づいて、ロボット装置２００の動作（動作状態）などを計測する。ここでは、ロボット装置２００から直接得たロボット装置２００の姿勢、位置などの情報を活用してもよい。前提として、機器制御部１８０は、事前に学習された機器制御パタン１９１に基づいて、ロボット制御信号（機器制御信号）をロボット装置２００に伝送し、ロボット装置２００を動作させる。 In addition, the device measurement unit 150 measures the motion (operation state) of the robot device 200 based on the image information obtained from the imaging device 110 . Here, information such as the posture and position of the robot device 200 directly obtained from the robot device 200 may be utilized. As a premise, the device control unit 180 transmits a robot control signal (device control signal) to the robot device 200 based on the device control pattern 191 learned in advance, and causes the robot device 200 to operate.

ここで、機器制御パタン１９１は、熟練者との作業、ティーチング等によって生成された最適な機器の制御パタンが格納されたものである。機器制御パタン１９１は、例えば、ロボット装置２００のアームの軌道といった機器の動作が規定された情報である。機器制御パタン１９１としては、ワーク２０２を１個つかみで移動する機器制御パタン、ワーク２０２を２個つかみで移動する機器制御パタン、ワーク２０２を作業者２０１の手前まで移動する機器制御パタン、ワーク２０２を最終目的地まで移動する機器制御パタンなどが挙げられる。なお、機器制御パタン１９１については、説明のために例示したものに限定されるものではない。 Here, the device control pattern 191 stores an optimum device control pattern generated by working with a skilled person, teaching, or the like. The device control pattern 191 is, for example, information that defines the operation of the device such as the trajectory of the arm of the robot device 200 . The machine control patterns 191 include a machine control pattern to move the work 202 by grasping one, a machine control pattern to move the work 202 by grasping two, a machine control pattern to move the work 202 to the front of the worker 201, and the work 202 to move. to the final destination. It should be noted that the device control pattern 191 is not limited to the one illustrated for the purpose of explanation.

図３は、機器制御パタン１９１の生成に係る構成の一例を示す図である。図３に示すように、作業最適化システム１００は、機器制御パタン生成部３００を処理部として含んで構成されてもよい。機器制御パタン生成部３００は、例えば、規範とする作業者の行動から最適な機器制御パタン１９１を生成する。 FIG. 3 is a diagram showing an example of a configuration related to generation of the device control pattern 191. As shown in FIG. As shown in FIG. 3, the work optimization system 100 may be configured including an equipment control pattern generation section 300 as a processing section. The device control pattern generation unit 300 generates an optimal device control pattern 191 from, for example, a behavior of a worker as a model.

機器制御パタン１９１の生成については、熟練者などによって、目標指示部１４０で提示される目標値に沿って、例えば、最も効率のよい作業について、熟練者と共に試行錯誤しながら決定していく仮定において、機械学習手法で取得する方法がある。 Regarding the generation of the device control pattern 191, it is assumed that an expert or the like decides, for example, the most efficient work in accordance with the target values presented by the target instruction unit 140 through trial and error with the expert. , there is a way to get it by machine learning method.

機械学習手法として、例えば、強化学習と呼ばれる手法を用いることができる。 As a machine learning method, for example, a method called reinforcement learning can be used.

図４は、機械学習手法（機器制御を最適化するモデル）の一例を示す図である。図４に示すように、機器４００と作業環境４０１というようなシンプルなモデルを定義した場合に、機器４００のアクションａによって作業環境４０１の変化（価値）を評価して、報酬ｒとしてフィードバックする。これにより、報酬ｒによって最適なアクションａを構築する方法である。このような方法は、様々に提案されているが、「Watkins, C.J.C.H. (1989). Learning from Delayed Rewards.」などに代表されるＱ学習などの方式を採用してよい。 FIG. 4 is a diagram showing an example of a machine learning method (a model for optimizing device control). As shown in FIG. 4, when a simple model such as a device 400 and a work environment 401 is defined, the change (value) of the work environment 401 is evaluated by the action a of the device 400 and fed back as a reward r. This is the method of constructing the optimal action a with the reward r. Various such methods have been proposed, and a method such as Q-learning typified by "Watkins, C.J.C.H. (1989). Learning from Delayed Rewards." may be employed.

これらの手法を駆使して求めることによれば、目標指示部１４０で設定している目標値に対しての作業評価を行い、この作業が評価として高い（例えば、目標値に近い、目標値より高い等の）機器制御パタンを、熟練度の高い作業者に対する、機器側の役割であるとして取得することが可能となる。次に、機器制御パタンを生成する方法について図５を用いて説明する。 According to obtaining by making full use of these methods, work evaluation is performed for the target value set by the target instruction unit 140, and this work is evaluated as high (for example, close to the target value, higher than the target value high, etc.) can be acquired as a role on the equipment side for a highly skilled worker. Next, a method of generating a device control pattern will be explained using FIG.

図５は、機器制御パタン生成部３００が機器制御パタン１９１を生成する処理（機器制御パタン生成処理）に係るフローチャートの一例を示す図である。 FIG. 5 is a diagram showing an example of a flowchart relating to processing (device control pattern generation processing) for the device control pattern generation unit 300 to generate the device control pattern 191 .

まず、評価部１３０は、撮影装置１１０から取得したデータを用いて作業者計測部１２０にて得た行動データ（作業）と、所定の目標値（目標値５００）とを取得する（ステップＳ５０１）。 First, the evaluation unit 130 acquires action data (work) obtained by the worker measurement unit 120 using data obtained from the photographing device 110 and a predetermined target value (target value 500) (step S501). .

続いて、評価部１３０は、作業者計測部１２０で計測された作業を目標値に基づいて評価（行動評価）を実行する（ステップＳ５０２）。 Subsequently, the evaluation unit 130 evaluates (behavior evaluation) the work measured by the worker measurement unit 120 based on the target value (step S502).

ここで、作業（行動）としては、作業者が移動した距離、身体に負荷の大きいしゃがみ、高い位置での保持姿勢、ストレスのかかる作業などが挙げられる。行動評価では、商品の詰め込み作業を仮定すると、１ケースあたりどの程度の時間を要したのか、身体への負荷（身体不可）がどれくらいあったのかを評価し、作業時間が短く、かつ、身体負荷が低い行動についての評価が高い等として評価基準（作業者の作業に係る状態を示す指標）を定義できる。評価基準については、設置環境等にもよるため、作業環境に応じた定義を行うことが好適である。なお、身体負荷の計測については、撮影装置１１０以外に、ストレス計測センサ、脳波センサなどの別のデバイスを活用することも可能である。さらには、アンケート評価など、感性評価も活用した形としてよい。付言するならば、作業については、説明のために例示したものに限定されるものではない。行動評価については、説明のために例示したものに限定されるものではない。 Here, the work (behavior) includes the distance traveled by the worker, squatting which puts a heavy load on the body, a holding posture in a high position, and stressful work. In behavioral evaluation, assuming product stuffing work, how much time was required for each case and how much physical load (physical impossibility) was applied was evaluated. It is possible to define an evaluation criterion (an index indicating the state of a worker's work) such that an action with a low evaluation is highly evaluated. Since the evaluation criteria depend on the installation environment, etc., it is preferable to define them according to the work environment. It should be noted that, in addition to the imaging device 110, other devices such as a stress measurement sensor and an electroencephalogram sensor can be used to measure the physical load. Furthermore, it is good to take the form which utilized sensitivity evaluation, such as questionnaire evaluation. In addition, the operations are not limited to those exemplified for explanation. Behavioral evaluation is not limited to the examples given for explanation.

続いて、機器制御パタン生成部３００は、機器制御パタン生成処理を終了するか否かを判定する（ステップＳ５０３）。機器制御パタン生成部３００は、終了すると判定した場合、ステップＳ５０５に処理を移し、終了しないと判定した場合、ステップＳ５０４に処理を移す。なお、終了の判断については、行動評価値に変化が見られない、所定の回数実行した、目標値との誤差がしきい値未満になったなど、事前に決めた基準に従って行われる。 Subsequently, the device control pattern generation unit 300 determines whether or not to end the device control pattern generation process (step S503). The device control pattern generation unit 300 shifts the process to step S505 when determining to end, and shifts the process to step S504 when determining not to end. Termination is determined according to predetermined criteria, such as no change in action evaluation value, execution of a predetermined number of times, error from target value less than a threshold value, and the like.

ステップＳ５０４では、機器制御パタン生成部３００は、行動評価の結果、しきい値より高い評価を得ることができた機器の状態を、理想的な機器制御パタンとして更新する。なお、機器制御パタン生成部３００は、目標値５００に向けて機器および／または作業者が試行錯誤で改善したものに対して、逐次評価を繰り返す。 In step S504, the device control pattern generation unit 300 updates, as an ideal device control pattern, the states of devices that have obtained evaluations higher than the threshold as a result of behavior evaluation. It should be noted that the equipment control pattern generation unit 300 repeats successive evaluations for improvements made by the equipment and/or the worker through trial and error toward the target value 500 .

ステップＳ５０５では、機器制御パタン生成部３００は、熟練者に最適な機器制御パタン（例えば、目標値および／またはしきい値を超える一または複数の機器制御パタン）を機器制御パタン１９１として保存する。 In step S<b>505 , the device control pattern generation unit 300 saves the device control pattern optimal for the expert (for example, one or more device control patterns exceeding the target value and/or threshold value) as the device control pattern 191 .

なお、機器制御パタン１９１については、制約条件（安全性の観点、保守の観点など）に基づいて、絞込みが行われてもよい。 Note that the device control patterns 191 may be narrowed down based on constraint conditions (safety viewpoint, maintenance viewpoint, etc.).

次に、機器制御に基づいて行動制御を実現する構成について図６および図７を用いて説明する。上述したように、機器制御パタン１９１の学習においては、作業効率のような目的に関して、どのように機器を動作させると、効率的な動きとなるかを学習する方法について説明した。これは、いわば熟練者に合わせた機器の制御方法を機器側が学ぶ作業となる。 Next, a configuration for implementing action control based on device control will be described with reference to FIGS. 6 and 7. FIG. As described above, in the learning of the device control pattern 191, the method of learning how to operate the device to achieve efficient movement has been described with respect to the purpose such as work efficiency. This is, so to speak, a task for the equipment side to learn the control method of the equipment suitable for the expert.

一方で、評価部１３０および行動制御パタン生成部１６０においては、学習された機器の動作に基づいて、作業を実施する作業者側がどのように効率的な行動を習得するのか、どのような機器の動作を実行すれば、作業者と機器との組み合わせが高効率な動きとなるのかを求める手続きとなる。 On the other hand, in the evaluation unit 130 and the behavior control pattern generation unit 160, based on the learned behavior of the equipment, how the worker who performs the work learns efficient behavior and what kind of equipment is used. If the movement is executed, it becomes a procedure to find out whether the combination of the worker and the equipment is a highly efficient movement.

図６は、機器制御に基づいて行動制御を最適化するモデルの一例を示す図である。まず、機器６００によって所定の動作としてアクションａ_ｍで動作する。するとそれに呼応して作業者６０２が動作をあわせて、アクションａ_ｗを実行する。それに対して作業環境６０１に状態の変化ｓが発生する。FIG. 6 is a diagram showing an example of a model for optimizing behavior control based on device control. First, the device ₆₀₀ performs an action am as a predetermined operation. Then, in response to this, the worker 602 performs the action _aw in concert. On the other hand, a state change s occurs in the work environment 601 .

作業者６０２を中心に考えると、どのような機器６００のアクションａ_ｍで、どのような作業者６０２のアクションａ_ｗが発生し、作業環境６０１に変化ｓが生じたのか、ということは、アクションａ_ｍを報酬ｒと考えると、どのような報酬ｒ（アクションａ_ｍ）で作業者６０２がどう理想の変化ｓに近づくかという、逆強化学習の枠組みで考えることができる。Considering the worker 602 as the center, what kind of action am of the device ₆₀₀ , what kind of action _aw of the worker 602 occurred, and what change s occurred in the work environment 601 is the action Assuming that a _m is a reward r, it can be considered in the framework of inverse reinforcement learning how the worker 602 approaches the ideal change s with what kind of reward r (action a _m ).

逆強化学習とは、エキスパートによる行動から報酬（どの状態がどのくらいよいのか）を推定する方法であり、代表的なものとして「Andrew Ng, et al :”Algorithms for Inverse Reinforcement Learning”」などが挙げられる。理想的な作業が可能な熟練者の行動からどのような報酬（行動の的確さ）を判断して動作しているか、どのような判断および行動が熟練者の成熟度に起因しているのか、を学習するものである。 Inverse Reinforcement Learning is a method of estimating rewards (which state is better) from the actions of an expert. . What kind of reward (accuracy of action) is judged from the actions of an expert who can perform ideal work and what kind of judgments and actions are caused by the maturity of the expert? is to learn.

本実施の形態では、熟練者によって生成された行動をそのまま、未熟者への行動指針にするような直接的な逆強化学習の活用とせず、熟練者により定義された機器側の最適な機器制御パタンを通じて、これらの機器制御パタンすなわちアクションａ_ｍを、作業者６０２（未熟者、非熟練者など）に与えた場合に、どのアクションａ_ｍが作業者６０２にとって最適な報酬となりうるかを推定するものであり、機器６００を通じて行動を制御すると言い換えてもよい。In the present embodiment, instead of directly utilizing inverse reinforcement learning such that actions generated by experts are used as action guidelines for inexperienced people, optimal device control on the device side defined by experts is performed. Predicting, through patterns, which action am can be the optimal _reward for the worker 602 when these machine control patterns, that is, action am, are given to the worker 602 (inexperienced worker, _unskilled worker, etc.). , which can be rephrased as controlling actions through the device 600 .

その効果としては、直接的な指南であった場合に、個々人に適合しない指示となり、かえって習熟時間がかかったり、個人にあった最適な行動とならなかったりするケースも多いなか、間接的な指示によって、理想の行動を促すことによって、作業者の行動が最適化に導かれると同時に、機器側の制御も進化して適合することが効果として考えられる。 As for the effect of direct instruction, there are many cases in which the instruction does not match the individual, and it takes time to learn and the behavior is not optimal for the individual. By encouraging ideal behavior, the operator's behavior is optimized, and at the same time, the control on the equipment side evolves and adapts.

さらには、機器側の機器制御の学習と、作業者側の行動制御の学習とを繰り返すことによる相互進化のような効果も得られるということがいえる。 Furthermore, it can be said that an effect such as mutual evolution can be obtained by repeating the learning of device control on the device side and the learning of action control on the worker side.

ここで、作業最適化システム１００では、機器制御パタン１９１のパラメータ（速度といった機器の動かし方をチューニングするためのパラメータ）を変更することで、アクションａ_ｍを変更し、そのときの機器制御パタン１９１とパラメータとの組合せを行動制御パタンとしている。次に、行動制御パタンの生成方法について図７を用いて説明する。Here, in the work optimization system 100, the action _am is changed by changing the parameter of the equipment control pattern 191 (the parameter for tuning the method of moving the equipment such as speed), and the equipment control pattern 191 at that time is changed. and parameters are used as behavior control patterns. Next, a method of generating an action control pattern will be described with reference to FIG.

図７は、行動制御パタン生成部１６０が行動制御パタン１９２を生成する処理（行動制御パタン生成処理）に係るフローチャートの一例を示す図である。 FIG. 7 is a diagram showing an example of a flowchart relating to processing (behavior control pattern generation processing) in which the behavior control pattern generation unit 160 generates the behavior control pattern 192 .

まず、評価部１３０は、撮影装置１１０から取得したデータを用いて作業者計測部１２０にて得た行動データ（作業）と、所定の目標値（目標値７００）とを取得する（ステップＳ７０１）。なお、目標値５００と目標値７００とは、同じであっていてもよいし、異なっていてもよい。 First, the evaluation unit 130 acquires action data (work) obtained by the worker measurement unit 120 using data obtained from the photographing device 110 and a predetermined target value (target value 700) (step S701). . Note that the target value 500 and the target value 700 may be the same or different.

続いて、評価部１３０は、作業者計測部１２０で計測された作業を目標値に基づいて評価（行動評価）を実行する（ステップＳ７０２）。例えば、評価部１３０は、ステップＳ５０２と同様に評価を行い、行動の優劣を比較する。 Subsequently, the evaluation unit 130 evaluates the work measured by the worker measurement unit 120 based on the target value (behavior evaluation) (step S702). For example, the evaluation unit 130 performs evaluation in the same manner as in step S502 and compares the superiority or inferiority of actions.

続いて、行動制御パタン生成部１６０は、行動制御パタン生成処理を終了するか否かを判定する（ステップＳ７０３）。行動制御パタン生成部１６０は、終了すると判定した場合、ステップＳ７０５に処理を移し、終了しないと判定した場合、ステップＳ７０４に処理を移す。なお、終了の判断については、行動評価値に変化が見られない、所定の回数実行した、目標値との誤差がしきい値未満になったなど、事前に決めた基準に従って行われる。 Subsequently, the action control pattern generation unit 160 determines whether or not to end the action control pattern generation process (step S703). If the action control pattern generation unit 160 determines to end the process, it moves the process to step S705, and if it determines not to end the process, it moves the process to step S704. Termination is determined according to predetermined criteria, such as no change in action evaluation value, execution of a predetermined number of times, error from target value less than a threshold value, and the like.

ステップＳ７０４では、行動制御パタン生成部１６０は、行動評価の結果、しきい値より高い評価を得ることができた行動制御パタンを、理想的な行動制御パタンとして更新する。例えば、行動制御パタン生成部１６０は、しきい値より高いと判定した機器制御パタン（機器制御パタンを識別可能な識別子であってもよい。）とパラメータとを含む行動制御パタンを生成する。 In step S704, the behavior control pattern generation unit 160 updates the behavior control pattern, which has been evaluated higher than the threshold as the result of the behavior evaluation, as an ideal behavior control pattern. For example, the behavior control pattern generation unit 160 generates a behavior control pattern including a device control pattern determined to be higher than a threshold value (which may be an identifier by which the device control pattern can be identified) and a parameter.

なお、行動制御パタン生成部１６０は、行動制御パタン（作業者の行動制御を可能とする機器制御パタンのパラメータ）を変更したものに対して、逐次評価を繰り返す。例えば、機器のアームの移動速度が第１の速度である行動制御パタンの場合、丁寧にワークを積み重ねるが移動に要する時間は長く、機器のアームの移動速度が第１の速度よりも速い第２の速度である行動制御パタンの場合、雑にワークを積み重ねるが移動に要する時間は短い。前者の場合には、作業者の待ち時間での十分なワークの確認や調整の時間を必要とするが効率が低くなり、単調な作業によるミスを誘発するなど、個々人によっての適合性によっては適さない場合もある。他方、後者の場合には、非常に早い速度であるが、そういった作業の適合性が高い作業者にとっては、効率よく作業することもできる。 Note that the behavior control pattern generation unit 160 repeats the evaluation sequentially for those with changed behavior control patterns (parameters of device control patterns that enable the behavior control of the worker). For example, in the case of a behavior control pattern in which the movement speed of the equipment arm is the first speed, the workpieces are carefully stacked, but the time required for movement is long, and the movement speed of the equipment arm is the second speed, which is faster than the first speed. In the case of the action control pattern with a speed of , the workpieces are piled up roughly, but the time required for movement is short. In the former case, sufficient work check and adjustment time is required while the worker is waiting, but efficiency is low, and monotonous work induces mistakes. Sometimes not. On the other hand, in the latter case, although the speed is very high, it is also possible to work efficiently for workers who are highly adaptable to such work.

上記は非常に単純なケースを記載した。ここで、行動制御パタンには、速度や正確さのような一般的な機器精度と考えられるものもあるが、異なる行動制御パタンも考えうる。一般的に、正確なリズムでの作業が工場機械にとって理想とされる傾向があるが、ランダムなゆらぎやノイズのような動きを入れることで、作業者に対して注意喚起や作業の多様性を与えることによる、ストレスや集中力の低下を防ぐことも期待できる。逆にそういった作業を望まないケースもある。 The above describes a very simple case. Here, some action control patterns are considered to be general device accuracy such as speed and accuracy, but different action control patterns are also conceivable. In general, there is a tendency to work in a precise rhythm ideal for factory machines, but by introducing random fluctuations and noise-like movements, it is possible to call attention to the workers and diversify the work. It can also be expected to prevent stress and loss of concentration by giving. On the other hand, there are cases where such work is not desired.

さらには、作業の時間帯や作業の継続時間、現在の環境や身体の状態など作業者に関わる情報を加味することによって、より適切な行動制御パタンを適用することができる。例えば、午前中に対して午後は疲労から作業の正確性が低下するという事前情報があった場合に、作業のスピードを適切にコントロールするなども考えられる。 Furthermore, a more appropriate action control pattern can be applied by taking into account information related to the worker, such as the time zone of the work, the duration of the work, the current environment, and the state of the body. For example, if there is advance information that the accuracy of work will be lower in the afternoon due to fatigue than in the morning, it is conceivable to appropriately control the work speed.

このように、どの行動制御パタンが作業者にとっての報酬（良い状態に導くパタン）になるのかという逆強化学習を行うこととなる。 In this way, inverse reinforcement learning is performed to determine which action control pattern is a reward (a pattern that leads to a good state) for the worker.

また、ステップＳ７０４では、行動制御パタン生成部１６０は、機器の制御に用いた機器制御パタンのパラメータと作業者の作業に係る状態を示す指標と関連付けた情報（パラメータ指標関連情報）を行動制御パタンに対応付けてＨＤＤに記憶してもよい。作業者の作業に係る状態を示す指標として、体力面な指標、精神面な指標などが設けられている。 Further, in step S704, the action control pattern generation unit 160 converts information (parameter index related information) associated with the parameter of the device control pattern used for controlling the device and the index indicating the state related to the work of the worker into the action control pattern. may be stored in the HDD in association with . A physical strength index, a mental index, and the like are provided as indices that indicate the worker's working condition.

図８は、作業者の作業に係る状態を示す指標の一例（指標テーブル８００）を示す図である。図８に示すように、ある機器制御パタンの場合、どういった行動計画に効果的であるのかといった情報が指標テーブル８００に記憶されている。これは、業種により異なっており、それぞれの機器制御パタンについて定義される。 FIG. 8 is a diagram showing an example of indicators (indicator table 800) indicating the status of work performed by a worker. As shown in FIG. 8, an index table 800 stores information about what kind of action plan is effective for a certain device control pattern. This differs depending on the type of business and is defined for each device control pattern.

より具体的には、指標テーブル８００には、機器制御パタン８０１ごとに、習熟レベル８０２、移動量８０３、集中力８０４、精密さ８０５、ストレス８０６、作業実績８０７といった指標に係る情報が格納されている。なお、指標テーブル８００には、上述したように、作業者に関わる情報が含まれていてもよい。 More specifically, the index table 800 stores information relating to indices such as a proficiency level 802, movement amount 803, concentration 804, precision 805, stress 806, and work performance 807 for each device control pattern 801. there is Note that the index table 800 may include information related to workers as described above.

また、パラメータ指標関連情報の一例（近似曲線９００）を図９に示す。なお、パラメータ指標関連情報は、指標およびパラメータごとに生成されている。 Also, FIG. 9 shows an example of the parameter index related information (approximate curve 900). Note that the parameter index related information is generated for each index and parameter.

例えば、作業者が疲れてきた場合、移動量８０３の近似曲線９００を用い、しきい値を満足する範囲内でパラメータを変更することができるようになる。また、例えば、作業者の作業が雑になってきた場合、精密さ８０５の近似曲線９００を用い、しきい値を満足する範囲内でパラメータを変更することができるようになる。また、例えば、一日のノルマが変更された場合、作業実績８０７の近似曲線９００を用い、しきい値を満足する範囲内でパラメータを変更することができるようになる。このように、パラメータ指標関連情報によれば、現在の状態に応じて、しきい値を満足する範囲内でパラメータを変更することができるようになる。 For example, when the worker gets tired, the approximate curve 900 of the movement amount 803 can be used to change the parameters within the range that satisfies the threshold value. Further, for example, when the work of the worker becomes rough, the approximate curve 900 of the precision 805 can be used to change the parameters within the range that satisfies the threshold value. Also, for example, when the daily quota is changed, the approximated curve 900 of the work record 807 can be used to change the parameters within a range that satisfies the threshold value. In this way, according to the parameter index related information, it becomes possible to change the parameter within a range that satisfies the threshold according to the current state.

ステップＳ７０５では、行動制御パタン生成部１６０は、作業者に最適な行動制御パタンを行動制御パタン１９２として保存する。 In step S<b>705 , the action control pattern generation unit 160 saves the optimum action control pattern for the worker as the action control pattern 192 .

ここでは、行動制御パタンが作業者ごとに異なっており、ある作業を実行する際にどういった機器の動きであれば、最適な作業となるのかという報酬が学習された状態となる。 Here, the action control pattern is different for each worker, and the reward is learned as to what kind of movement of the equipment will be the optimum work when executing a certain work.

上述した構成によれば、例えば、機器の最適な機器制御パタンを取得すると共に、その機器制御パタンを別の作業者への行動制御に活用することによって、機器の性能を十分に引き出すことができ、かつ、個人にあった最適な作業となり、全体の効率を最大化することができる。 According to the above-described configuration, for example, by acquiring the optimum device control pattern of the device and utilizing the device control pattern for behavior control of another worker, the performance of the device can be sufficiently brought out. In addition, the work can be optimized for each individual, and the overall efficiency can be maximized.

また、上述した構成によれば、例えば、機器側も機器制御を最適化できるので、機器と作業者との組み合わせで最適な行動制御パタンを生成し、効率を最大化することが可能である。 In addition, according to the above-described configuration, for example, the device side can also optimize the device control, so it is possible to generate the optimum action control pattern for the combination of the device and the worker and maximize the efficiency.

また、上述した構成によれば、例えば、最適な機器制御パタンを取得できると共に、人の行動制御も可能となるため、熟練者に近い形で作業を習熟させることができる。 In addition, according to the above-described configuration, for example, it is possible to acquire the optimum device control pattern and also to control human behavior, so that it is possible to make the worker master the work in a manner similar to that of an expert.

また、上述した構成によれば、例えば、行動制御パタンは、個人の嗜好や動作パタンに関連する項目であるため、ある機器で生成した行動制御パタンおよび／またはパラメータ指標関連情報は、別の機器への活用が可能であるといった、展開しやすさも効果として考え得る。 Further, according to the above-described configuration, for example, since the action control pattern is an item related to personal preference or action pattern, the action control pattern and/or the parameter index-related information generated by one device can be used by another device. Ease of deployment, such as being able to be used for

以上、本実施の形態によれば、規範の事例となる熟練者の行動と、作業効率などの目標が最も高い機器制御パタンとを事前に取得しておき、非熟練者のような最適な作業が即時には実行できない場合に、事前に取得した機器制御パタンを介して、非熟練者の作業を熟練者の作業に導くことで、結果として作業者の行動が最適化され、生産目標、高効率な作業などに近づけることの効果を得ることができる。また、例えば、作業者の状態を示す指標を設けることで、個人に合った行動制御を行うことの効果を得ることができる。また、例えば、現在の状態に応じた行動制御を行うことの効果を得ることができる。 As described above, according to the present embodiment, the action of an expert as an example of the norm and the device control pattern with the highest target such as work efficiency are acquired in advance, and the optimal work such as that of an unskilled person is obtained. can not be executed immediately, by guiding the work of unskilled workers to the work of skilled workers through equipment control patterns obtained in advance, as a result, the behavior of workers is optimized, and production targets and high efficiency are achieved. It is possible to obtain the effect of being close to a work such as Also, for example, by providing an index indicating the state of the worker, it is possible to obtain the effect of performing action control suitable for the individual. Also, for example, it is possible to obtain the effect of performing action control according to the current state.

このように、予め学習した機器制御パタンをもとに、作業者への行動評価と報酬学習とを行うことで、作業者の習熟度を機器側の変化によって促すことができる作業最適化システムおよび作業最適化装置を提供することが可能である。 In this way, based on machine control patterns that have been learned in advance, a work optimization system that can encourage the worker's proficiency level by changing the machine side by performing action evaluation and reward learning for the worker, and It is possible to provide a work optimization device.

なお、本実施の形態では、機器制御パタンの更新など、繰り返し相互に深化する形式については図示していないが、当然、繰り返す構成を採用し、相互進化が可能なようにしてもよい。 In the present embodiment, the form of repetitive mutual deepening, such as updating of device control patterns, is not illustrated, but a repetitive configuration may of course be employed to enable mutual evolution.

（２）他の実施の形態
なお上述の実施の形態においては、本発明を作業最適化システムに適用するようにした場合について述べたが、本発明はこれに限らず、この他種々のシステム、装置、方法、プログラムに広く適用することができる。(2) Other Embodiments In the above-described embodiments, the case where the present invention is applied to a work optimization system was described, but the present invention is not limited to this, and various other systems, It can be widely applied to devices, methods and programs.

また、上記の説明において、各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（Solid State Drive）等の記憶装置、または、ＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に置くことができる。 In the above description, information such as programs, tables, and files that implement each function is stored in a memory, hard disk, SSD (Solid State Drive), or other storage device, or recorded on an IC card, SD card, DVD, or the like. You can put it on the medium.

また上述した構成については、本発明の要旨を超えない範囲において、適宜に、変更したり、組み替えたり、組み合わせたり、省略したりしてもよい。 Moreover, the above-described configurations may be appropriately changed, rearranged, combined, or omitted within the scope of the present invention.

上述した構成によれば、機器の効率を最大化しつつ、作業者の行動を制御することができる。 According to the configuration described above, it is possible to control the actions of the operator while maximizing the efficiency of the equipment.

１００……作業最適化システム、１１０……撮影装置、１２０……作業者計測部、１３０……評価部、１４０……目標指示部、１５０……機器計測部、１６０……行動制御パタン生成部、１７０……行動制御計画部、１８０……機器制御部、１９１……機器制御パタン、１９２……行動制御パタン。 100 work optimization system 110 imaging device 120 worker measurement unit 130 evaluation unit 140 target instruction unit 150 equipment measurement unit 160 action control pattern generation unit , 170... action control plan part, 180... device control part, 191... device control pattern, 192... action control pattern.

Claims

a device control unit that controls the device based on a device control pattern that controls the operation of the device and parameters of the device control pattern;
Worker measurement for measuring behavior of a worker from image information of a worker working in cooperation with the device acquired by a photographing device when the device is controlled by the device control unit. Department and
an evaluation unit that evaluates the behavior of the worker measured by the worker measurement unit based on a predetermined target value;
Determining whether or not the evaluation by the evaluation unit is higher than a threshold value, and generating an action control pattern for controlling the action of the worker, including the parameter of the device control pattern determined to be high when it is determined to be higher. a behavior control pattern generator that
An action control plan for selecting one action control pattern from among the action control patterns of the worker generated by the action control pattern generation unit based on the result of reward evaluation for evaluating the correctness of the action of the worker. Department and
with
The action control pattern generation unit changes the parameter of the device control pattern to another parameter based on a predetermined approximate curve, and controls the device using the device control pattern and the changed other parameter. giving an instruction to the device control unit to
The device control unit specifies a device control pattern of the device and a parameter of the device control pattern based on the behavior control pattern selected by the behavior control planning unit, and controls the device;
The action control pattern generation unit associates parameter index related information, which is information that associates a parameter of the device control pattern of the device with an index indicating a state related to the work of the worker, with the action control pattern including the parameter. and remember
The behavior control planning unit determines whether or not the index indicating the current state exceeds the threshold. identifying an action control pattern exceeding the value and changing the selected action control pattern to the identified action control pattern;
A work optimization system characterized by:

a device control unit that controls the device based on a device control pattern that controls the operation of the device and parameters of the device control pattern;
Worker measurement for measuring behavior of a worker from image information of a worker working in cooperation with the device acquired by a photographing device when the device is controlled by the device control unit. Department and
an evaluation unit that evaluates the behavior of the worker measured by the worker measurement unit based on a predetermined target value;
Determining whether or not the evaluation by the evaluation unit is higher than a threshold value, and generating an action control pattern for controlling the action of the worker, including the parameter of the device control pattern determined to be high when it is determined to be higher. a behavior control pattern generator that
An action control plan for selecting one action control pattern from among the action control patterns of the worker generated by the action control pattern generation unit based on the result of reward evaluation for evaluating the correctness of the action of the worker. Department and
with
The action control pattern generation unit changes the parameter of the device control pattern to another parameter based on a predetermined approximate curve, and controls the device using the device control pattern and the changed other parameter. giving an instruction to the device control unit to
The device control unit specifies a device control pattern of the device and a parameter of the device control pattern based on the behavior control pattern selected by the behavior control planning unit, and controls the device;
The action control pattern generation unit associates parameter index related information, which is information that associates a parameter of the device control pattern of the device with an index indicating a state related to the work of the worker, with the action control pattern including the parameter. and remember
The behavior control planning unit determines whether or not the index indicating the current state exceeds the threshold. identifying an action control pattern exceeding the value and changing the selected action control pattern to the identified action control pattern;
A work optimization device characterized by: