JP2021147112A

JP2021147112A - Method of feed processing

Info

Publication number: JP2021147112A
Application number: JP2020045073A
Authority: JP
Inventors: 充宏吉田; Mitsuhiro Yoshida
Original assignee: Screen Holdings Co Ltd
Current assignee: Screen Holdings Co Ltd
Priority date: 2020-03-16
Filing date: 2020-03-16
Publication date: 2021-09-27

Abstract

To provide a technology allowing for reducing the time required for machine learning and preventing damages to a real installation.SOLUTION: A method of feed processing includes ejecting a processing substance to a surface while feeding a base material. The method has a) a step of acquiring plural pieces of information related to a control value related to feeding including a rotational speed of a motor to feed a base material or to a control value related to ejection including ejection timing of a processing substance and a measured value related to a tension or feed rate of the base material, while actually feeding the base material, b) a step of preparing a simplified simulation model indicative of a relationship between the control value and the measured value on a computer, c) a step of preparing an in-learning model by conducting reinforcement learning to maintain an output value from the simplified simulation model in a predetermined range while updating the control value on the computer, and d) a step of conducting reinforcement learning to maintain the measured value in the predetermined range in the case of ejecting a processing substance to a surface while actually feeding the base material with updating the control value outputted from the in-learning model.SELECTED DRAWING: Figure 4

Description

本発明は、長尺帯状の基材を長手方向に搬送しながら基材の表面に処理物質を吐出する、搬送処理方法に関する。 The present invention relates to a transport processing method in which a processing substance is discharged onto the surface of a base material while transporting a long strip-shaped base material in the longitudinal direction.

従来、長尺帯状の印刷用紙を長手方向に搬送しつつ、複数の記録ヘッドからインクを吐出することにより、印刷用紙に画像を記録するインクジェット方式の画像記録装置が知られている。画像記録装置は、複数のヘッドから、それぞれ異なる色のインクを吐出する。そして、各色のインクにより形成される単色画像の重ね合わせによって、印刷用紙の表面に多色画像を記録する。 Conventionally, there is known an inkjet type image recording device that records an image on a printing paper by ejecting ink from a plurality of recording heads while transporting a long strip-shaped printing paper in the longitudinal direction. The image recording device ejects inks of different colors from a plurality of heads. Then, a multicolor image is recorded on the surface of the printing paper by superimposing the monochromatic images formed by the inks of each color.

この種の画像記録装置においては、複数のローラにより、印刷用紙を一定の搬送速度で搬送することが求められる。しかしながら、ローラの表面と印刷用紙との間のスリップや、インクによる印刷用紙の伸びによって、記録ヘッドの下方における印刷用紙の搬送速度が、理想的な搬送速度からずれる場合がある。そうすると、印刷用紙の表面における各色のインクの吐出位置が搬送方向にずれる虞がある。そこで、このようなずれを補償する方法が、例えば、特許文献１に記載されている。 In this type of image recording apparatus, it is required that printing paper is conveyed at a constant transfer speed by a plurality of rollers. However, the transfer speed of the printing paper below the recording head may deviate from the ideal transfer speed due to the slip between the surface of the roller and the printing paper or the stretching of the printing paper due to the ink. Then, the ejection position of the ink of each color on the surface of the printing paper may shift in the transport direction. Therefore, for example, Patent Document 1 describes a method for compensating for such a deviation.

特開２０１９−１６６８３２号公報Japanese Unexamined Patent Publication No. 2019-166832

特許文献１には、ジェッティングドラムを回転させることによって印刷基材を搬送しつつ、印刷基材にインク滴を滴下して画像（９）を記録する方法が開示されている。そして、ジェッティングドラムの駆動トルクの測定結果と、印刷基材に記録された画像データのグレースケール値推移（１０）に基づき、補償トルクを計算し、当該補償トルクを考慮してジェッティングドラムを駆動制御するステップが開示されている。また、補償トルクを計算する際に機械学習を行うことが記載されている。 Patent Document 1 discloses a method of recording an image (9) by dropping ink droplets on a printing substrate while transporting the printing substrate by rotating a jetting drum. Then, the compensation torque is calculated based on the measurement result of the drive torque of the jetting drum and the grayscale value transition (10) of the image data recorded on the printing substrate, and the jetting drum is set in consideration of the compensation torque. The steps of drive control are disclosed. In addition, it is described that machine learning is performed when calculating the compensation torque.

しかしながら、実装置を用いて一から機械学習を行う場合、所望のレベルに達するまでに膨大な時間を要する虞がある。また、学習初期段階においては、理想的な動作と大きく離れた制御を行うことによって、実装置に損傷に及ぼす虞がある。 However, when machine learning is performed from scratch using an actual device, it may take an enormous amount of time to reach a desired level. In addition, in the initial stage of learning, there is a risk of damaging the actual device by performing control that is far from the ideal operation.

本発明は、このような事情に鑑みなされたものであり、機械学習に要する時間を短縮できる技術を提供することを目的とする。また、学習初期段階においても、実装置に損傷に及ぼすことを防止できる技術を提供することを目的とする。 The present invention has been made in view of such circumstances, and an object of the present invention is to provide a technique capable of shortening the time required for machine learning. It is also an object of the present invention to provide a technique capable of preventing damage to an actual device even in the initial stage of learning.

上記課題を解決するため、本願の第１発明は、長尺帯状の基材を長手方向に搬送しながら、前記基材の表面に処理物質を吐出する搬送処理方法であって、ａ）実際に前記基材を搬送しながら、前記基材を搬送するための駆動源であるモータの回転数を含む搬送に係る制御値、または前記処理物質の吐出タイミングを含む吐出に係る制御値に係る情報と、前記基材の張力または搬送速度に係る計測値とを複数取得する工程と、ｂ）前記工程ａ）による取得結果に基づいて、前記制御値と、前記計測値との関係を示す簡易シミュレーションモデルをコンピュータ上で作成する工程と、ｃ）前記コンピュータ上で、前記制御値を更新しつつ、前記簡易シミュレーションモデルからの出力値を所定の範囲に維持するための強化学習を行うことにより、学習中モデルを作成する工程と、ｄ）前記学習中モデルから出力される前記制御値を更新しつつ、実際に前記基材を搬送しながら前記基材の表面に前記処理物質を吐出する場合における、前記計測値を所定の範囲に維持するための強化学習を行う工程と、を有する。 In order to solve the above problems, the first invention of the present application is a transport processing method in which a processing substance is discharged onto the surface of the base material while transporting the long strip-shaped base material in the longitudinal direction. While transporting the base material, the control value related to the transport including the rotation speed of the motor which is the drive source for transporting the base material, or the information related to the control value related to the discharge including the discharge timing of the processing substance. , A simple simulation model showing the relationship between the control value and the measured value based on the step of acquiring a plurality of measured values related to the tension or the transport speed of the base material and b) the acquisition result in the step a). On a computer, and c) Reinforcement learning for maintaining the output value from the simple simulation model within a predetermined range while updating the control value on the computer is being learned. The step of creating a model and d) the case where the processing substance is discharged to the surface of the base material while actually transporting the base material while updating the control value output from the learning model. It has a step of performing reinforcement learning for maintaining the measured value within a predetermined range.

本願の第２発明は、第１発明の搬送処理方法であって、前記基材は、複数のローラに掛け渡され、前記複数のローラの少なくとも１つである駆動ローラが前記モータにより駆動されて回転することによって、搬送され、前記搬送に係る制御値は、前記駆動ローラの回転速度を含む。 The second invention of the present invention is the transport processing method of the first invention, in which the base material is hung on a plurality of rollers, and a drive roller which is at least one of the plurality of rollers is driven by the motor. It is conveyed by rotating, and the control value related to the transfer includes the rotation speed of the drive roller.

本願の第３発明は、第１発明または第２発明の搬送処理方法であって、前記吐出に係る制御値は、前記処理物質の吐出量を含む。 The third invention of the present application is the transport processing method of the first invention or the second invention, and the control value related to the discharge includes the discharge amount of the processed substance.

本願の第４発明は、第１発明から第３発明までのいずれか１発明の搬送処理方法であって、前記工程ａ）では、前記基材の厚みまたは種類に係る情報をさらに取得し、前記工程ｂ）では、前記工程ａ）による取得結果に基づいて、前記制御値および前記基材の情報と、前記計測値との関係を示す前記簡易シミュレーションモデルを作成する。 The fourth invention of the present application is the transport processing method of any one of the first to third inventions, and in the step a), information on the thickness or type of the base material is further acquired, and the above-mentioned In the step b), the simple simulation model showing the relationship between the control value, the information on the base material, and the measured value is created based on the acquisition result in the step a).

本願の第５発明は、第１発明から第４発明までのいずれか１発明の搬送処理方法であって、前記工程ａ）では、前記処理物質の特性に係る情報をさらに取得し、前記工程ｂ）では、前記工程ａ）による取得結果に基づいて、前記制御値および前記処理物質の情報と、前記計測値との関係を示す前記簡易シミュレーションモデルを作成する。 The fifth invention of the present application is the transport processing method of any one of the first to fourth inventions, and in the step a), information relating to the characteristics of the treated substance is further acquired, and the step b. ), The simple simulation model showing the relationship between the control value, the information of the processing substance, and the measured value is created based on the acquisition result in the step a).

本願の第６発明は、第１発明から第５発明までのいずれか１発明の搬送処理方法であって、前記工程ａ）では、前記基材の周囲の温度または湿度を含む環境条件に係る情報をさらに取得し、前記工程ｂ）では、前記工程ａ）による取得結果に基づいて、前記制御値および前記環境条件に係る情報と、前記計測値との関係を示す前記簡易シミュレーションモデルを作成する。 The sixth invention of the present application is the transport processing method of any one of the first to fifth inventions, and in the step a), information relating to environmental conditions including the temperature or humidity around the base material. In the step b), the simple simulation model showing the relationship between the control value, the information related to the environmental condition, and the measured value is created based on the acquisition result in the step a).

本願の第７発明は、第１発明から第６発明までのいずれか１発明の搬送処理方法であって、前記工程ｂ）において、前記簡易シミュレーションモデルは、決定木を含み、前記決定木に含まれるパラメータが調整される。 The seventh invention of the present application is the transport processing method of any one invention from the first invention to the sixth invention, and in the step b), the simple simulation model includes a decision tree and is included in the decision tree. Parameters are adjusted.

本願の第８発明は、第１発明から第７発明までのいずれか１発明の搬送処理方法であって、前記工程ｃ）において行われる前記強化学習は、ＰＰＯまたはＤＱＮの技法によって実行される機械学習である。 The eighth invention of the present application is the transport processing method of any one invention from the first invention to the seventh invention, and the reinforcement learning performed in the step c) is a machine executed by the technique of PPO or DQN. Learning.

本願の第１発明〜第８発明によれば、予め実装置においてデータ取りをした結果を用いて、コンピュータ上で簡易シミュレーションモデルを作成し、当該簡易シミュレーションモデルを用いてコンピュータ上で強化学習を行う。そして、コンピュータ上で強化学習を行うことによって得られた学習中モデルを、再度、実装置に移して、引き続き強化学習を行う。これにより、実装置の動作に見合ったある程度進んだ状態から強化学習を開始することができる。また、コンピュータ上で相当量の強化学習を行うことができる。この結果、機械学習に要する時間を大幅に短縮できる。また、学習初期段階に、制御値が過大になった場合でも、実装置に損傷に及ぼすことを防止できる。さらに、最終的に再度実装置を用いて強化学習を行うことにより、より高精度な制御値を出力する学習済みモデルを作成することができる。 According to the first to eighth inventions of the present application, a simple simulation model is created on a computer using the result of collecting data in an actual device in advance, and reinforcement learning is performed on the computer using the simple simulation model. .. Then, the learning model obtained by performing reinforcement learning on the computer is transferred to the actual device again, and reinforcement learning is continuously performed. As a result, reinforcement learning can be started from a state advanced to some extent commensurate with the operation of the actual device. In addition, a considerable amount of reinforcement learning can be performed on a computer. As a result, the time required for machine learning can be significantly reduced. In addition, even if the control value becomes excessive in the initial stage of learning, it is possible to prevent damage to the actual device. Furthermore, by finally performing reinforcement learning using the actual device again, it is possible to create a trained model that outputs more accurate control values.

画像記録装置の構成を示した図である。It is a figure which showed the structure of the image recording apparatus. 画像記録部付近における画像記録装置の部分上面図である。It is a partial top view of the image recording apparatus in the vicinity of an image recording unit. エッジ位置検出部の構造を模式的に示した図である。It is a figure which showed typically the structure of the edge position detection part. 事前学習の流れを示すフローチャートである。It is a flowchart which shows the flow of the pre-learning. データ集合体の例を概念的に示した図である。It is a figure which showed the example of the data aggregate conceptually. 簡易シミュレーションモデルに含まれる決定木の例を概念的に示した図である。It is a figure which conceptually showed the example of the decision tree included in the simple simulation model. コンピュータ上で強化学習を行う様子を概念的に示したブロック図である。It is a block diagram conceptually showing the state of performing reinforcement learning on a computer. コンピュータ上での強化学習の詳細な流れを示すフローチャートである。It is a flowchart which shows the detailed flow of reinforcement learning on a computer. 実装置で強化学習を行う様子を概念的に示したブロック図である。It is a block diagram conceptually showing the state of performing reinforcement learning with an actual device. 実装置での強化学習の詳細な流れを示すフローチャートである。It is a flowchart which shows the detailed flow of reinforcement learning in an actual device. 第１エッジ信号の例および第２エッジ信号の例を示したグラフである。It is a graph which showed the example of the 1st edge signal and the example of the 2nd edge signal.

以下、本発明の実施形態について、図面を参照しつつ説明する。本発明の一実施形態では、搬送処理装置の例として、搬送される印刷用紙に画像を記録する画像記録装置を例に挙げて、説明する。そして、印刷用紙の搬送速度または張力の計測値を所定の範囲に維持するための方法について、説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. In one embodiment of the present invention, as an example of the transfer processing device, an image recording device that records an image on the transferred printing paper will be described as an example. Then, a method for maintaining the measured value of the transport speed or tension of the printing paper within a predetermined range will be described.

＜１．第１実施形態＞
＜１−１．画像記録装置の構成＞
まず、本発明の搬送処理装置の一例となる画像記録装置１の全体構成について、図１を参照しつつ説明する。図１は、画像記録装置１の構成を示した図である。この画像記録装置１は、長尺帯状の基材である印刷用紙９を搬送しつつ、複数の記録ヘッド２１〜２４から印刷用紙９へ向けてインクを吐出することにより、印刷用紙９に画像を記録するインクジェット方式の印刷装置である。図１に示すように、画像記録装置１は、搬送機構１０、画像記録部２０、２つのエッジ位置検出部３０、エンコーダ４０、張力検出部５０、情報取得部６０、制御部８０、およびシミュレーション用のコンピュータ９０を備えている。 <1. First Embodiment>
<1-1. Image recording device configuration>
First, the overall configuration of the image recording apparatus 1 which is an example of the transport processing apparatus of the present invention will be described with reference to FIG. FIG. 1 is a diagram showing a configuration of an image recording device 1. The image recording device 1 transfers an image on the printing paper 9 by ejecting ink from a plurality of recording heads 21 to 24 toward the printing paper 9 while conveying the printing paper 9 which is a long strip-shaped base material. This is an inkjet printing device for recording. As shown in FIG. 1, the image recording device 1 includes a transport mechanism 10, an image recording unit 20, two edge position detection units 30, an encoder 40, a tension detection unit 50, an information acquisition unit 60, a control unit 80, and a simulation unit. It is equipped with a computer 90.

搬送機構１０は、印刷用紙９をその長手方向に沿う搬送方向に搬送する機構である。本実施形態の搬送機構１０は、巻き出しローラ１１、複数の搬送ローラ１２、および巻き取りローラ１３を含む複数のローラと、１または複数（本実施形態では、３つ）のモータ１４とを有する。印刷用紙９は、当該複数のローラに掛け渡される。また、本実施形態では、巻き出しローラ１１、複数の搬送ローラ１２のうちの１つ（図１における搬送ローラ１２１）、および巻き取りローラ１３に、それぞれモータ１４が接続される。モータ１４は、印刷用紙９を搬送するための駆動源である。巻き出しローラ１１、搬送ローラ１２１、および巻き取りローラ１３は、モータ１４により駆動され、それぞれ回転軸を中心として回転する。各搬送ローラ１２は、回転軸を中心として回転することによって、印刷用紙９を搬送経路の下流側へ案内する。これにより、印刷用紙９は、巻き出しローラ１１から繰り出され、複数の搬送ローラ１２により構成される搬送経路に沿って搬送される。また、搬送後の印刷用紙９は、巻き取りローラ１３へ回収される。 The transport mechanism 10 is a mechanism for transporting the printing paper 9 in the transport direction along the longitudinal direction thereof. The transport mechanism 10 of the present embodiment includes a plurality of rollers including a winding roller 11, a plurality of transport rollers 12, and a winding roller 13, and one or a plurality of (three in this embodiment) motors 14. .. The printing paper 9 is hung on the plurality of rollers. Further, in the present embodiment, the motor 14 is connected to the unwinding roller 11, one of the plurality of conveying rollers 12 (the conveying roller 121 in FIG. 1), and the take-up roller 13, respectively. The motor 14 is a drive source for conveying the printing paper 9. The unwinding roller 11, the transport roller 121, and the take-up roller 13 are driven by a motor 14 and rotate about a rotation axis, respectively. Each transfer roller 12 guides the printing paper 9 to the downstream side of the transfer path by rotating around the rotation axis. As a result, the printing paper 9 is unwound from the unwinding roller 11 and is conveyed along the conveying path composed of the plurality of conveying rollers 12. Further, the printed paper 9 after being conveyed is collected by the take-up roller 13.

すなわち、本実施形態では、複数のローラのうちの巻き出しローラ１１、搬送ローラ１２１、および巻き取りローラ１３が、モータ１４に接続される駆動ローラとなっている。ただし、モータ１４が接続されるローラは、これに限定されない。印刷用紙９は、複数のローラの少なくとも１つである駆動ローラが、モータ１４に駆動されて回転することによって、搬送されればよい。 That is, in the present embodiment, the unwinding roller 11, the transport roller 121, and the take-up roller 13 of the plurality of rollers are drive rollers connected to the motor 14. However, the roller to which the motor 14 is connected is not limited to this. The printing paper 9 may be conveyed by a drive roller, which is at least one of a plurality of rollers, driven by a motor 14 to rotate.

図１に示すように、印刷用紙９は、後述する複数の記録ヘッド２１〜２４の下方において、複数の記録ヘッド２１〜２４の配列方向と略平行に移動する。このとき、印刷用紙９の表面（記録面）は、上方（記録ヘッド２１〜２４側）に向けられている。また、印刷用紙９は、張力が掛かった状態で、複数の搬送ローラ１２に掛け渡される。これにより、搬送中における印刷用紙９の弛みや皺が抑制される。 As shown in FIG. 1, the printing paper 9 moves below the plurality of recording heads 21 to 24, which will be described later, substantially parallel to the arrangement direction of the plurality of recording heads 21 to 24. At this time, the surface (recording surface) of the printing paper 9 is directed upward (recording heads 21 to 24 side). Further, the printing paper 9 is hung on a plurality of transport rollers 12 in a state where tension is applied. As a result, slack and wrinkles of the printing paper 9 during transportation are suppressed.

画像記録部２０は、搬送機構１０により搬送される印刷用紙９に対して、インクの液滴（以下「インク滴」と称する）を吐出する処理部である。本実施形態の画像記録部２０は、第１記録ヘッド２１、第２記録ヘッド２２、第３記録ヘッド２３、および第４記録ヘッド２４を有する。第１記録ヘッド２１、第２記録ヘッド２２、第３記録ヘッド２３、および第４記録ヘッド２４は、印刷用紙９の搬送経路に沿って配置されている。 The image recording unit 20 is a processing unit that ejects ink droplets (hereinafter referred to as “ink droplets”) onto the printing paper 9 conveyed by the conveying mechanism 10. The image recording unit 20 of the present embodiment includes a first recording head 21, a second recording head 22, a third recording head 23, and a fourth recording head 24. The first recording head 21, the second recording head 22, the third recording head 23, and the fourth recording head 24 are arranged along the transport path of the printing paper 9.

図２は、画像記録部２０付近における画像記録装置１の部分上面図である。４つの記録ヘッド２１〜２４は、それぞれ、印刷用紙９の幅方向の全体を覆っている。また、図２中に破線で示したように、各記録ヘッド２１〜２４の下面には、印刷用紙９の幅方向と平行に配列された複数のノズル２５０が設けられている。各記録ヘッド２１〜２４は、複数のノズル２５０から印刷用紙９の上面へ向けて、多色画像の色成分となるＫ（ブラック）、Ｃ（シアン）、Ｍ（マゼンタ）、Ｙ（イエロー）の各色のインク滴を、それぞれ吐出する。 FIG. 2 is a partial top view of the image recording device 1 in the vicinity of the image recording unit 20. Each of the four recording heads 21 to 24 covers the entire width direction of the printing paper 9. Further, as shown by a broken line in FIG. 2, a plurality of nozzles 250 arranged in parallel with the width direction of the printing paper 9 are provided on the lower surface of each of the recording heads 21 to 24. Each of the recording heads 21 to 24 is directed from the plurality of nozzles 250 toward the upper surface of the printing paper 9, and is composed of K (black), C (cyan), M (magenta), and Y (yellow), which are color components of the multicolor image. Ink droplets of each color are ejected.

すなわち、第１記録ヘッド２１は、搬送経路上の第１処理位置Ｐ１において、印刷用紙９の上面に、Ｋ色のインク滴を吐出する。第２記録ヘッド２２は、第１処理位置Ｐ１よりも下流側の第２処理位置Ｐ２において、印刷用紙９の上面に、Ｃ色のインク滴を吐出する。第３記録ヘッド２３は、第２処理位置Ｐ２よりも下流側の第３処理位置Ｐ３において、印刷用紙９の上面に、Ｍ色のインク滴を吐出する。第４記録ヘッド２４は、第３処理位置Ｐ３よりも下流側の第４処理位置Ｐ４において、印刷用紙９の上面に、Ｙ色のインク滴を吐出する。本実施形態では、第１処理位置Ｐ１、第２処理位置Ｐ２、第３処理位置Ｐ３、および第４処理位置Ｐ４は、印刷用紙９の搬送方向に沿って、等間隔に配列されている。 That is, the first recording head 21 ejects K-color ink droplets onto the upper surface of the printing paper 9 at the first processing position P1 on the transport path. The second recording head 22 ejects C-color ink droplets onto the upper surface of the printing paper 9 at the second processing position P2 on the downstream side of the first processing position P1. The third recording head 23 ejects M-color ink droplets onto the upper surface of the printing paper 9 at the third processing position P3 on the downstream side of the second processing position P2. The fourth recording head 24 ejects Y-color ink droplets onto the upper surface of the printing paper 9 at the fourth processing position P4 on the downstream side of the third processing position P3. In the present embodiment, the first processing position P1, the second processing position P2, the third processing position P3, and the fourth processing position P4 are arranged at equal intervals along the transport direction of the printing paper 9.

４つの記録ヘッド２１〜２４は、インク滴を吐出することによって、印刷用紙９の上面に、それぞれ単色画像を記録する。そして、４つの単色画像の重ね合わせにより、印刷用紙９の上面に、多色画像が形成される。したがって、仮に、４つの記録ヘッド２１〜２４から吐出されるインク滴の印刷用紙９上における搬送方向の位置が相互にずれていると、印刷物の画像品質が低下する。このような、印刷用紙９上における単色画像の位置の誤差を許容範囲内に抑えることが、画像記録装置１の印刷品質を向上させるための重要な要素となる。 The four recording heads 21 to 24 record a single color image on the upper surface of the printing paper 9 by ejecting ink droplets. Then, a multicolor image is formed on the upper surface of the printing paper 9 by superimposing the four monochromatic images. Therefore, if the positions of the ink droplets ejected from the four recording heads 21 to 24 on the printing paper 9 in the transport direction are deviated from each other, the image quality of the printed matter deteriorates. Suppressing such an error in the position of a single-color image on the printing paper 9 within an allowable range is an important factor for improving the print quality of the image recording apparatus 1.

なお、記録ヘッド２１〜２４の搬送方向下流側に、印刷用紙９の記録面に吐出されたインクを乾燥させる乾燥処理部が、さらに設けられていてもよい。乾燥処理部は、例えば、印刷用紙９へ向けて加熱された気体を吹き付けて、印刷用紙９に付着したインク中の溶媒を気化させることにより、インクを乾燥させる。ただし、乾燥処理部は、ヒートローラによる加熱や、光照射等の他の方法で、インクを乾燥させるものであってもよい。 A drying processing unit for drying the ink discharged on the recording surface of the printing paper 9 may be further provided on the downstream side of the recording heads 21 to 24 in the transport direction. The drying processing unit dries the ink by, for example, blowing a heated gas toward the printing paper 9 to vaporize the solvent in the ink adhering to the printing paper 9. However, the drying processing unit may be one that dries the ink by another method such as heating with a heat roller or light irradiation.

２つのエッジ位置検出部３０はそれぞれ、印刷用紙９のエッジ（幅方向の端部）９１の幅方向の位置を検出する検出部である。本実施形態では、搬送経路上の第１処理位置Ｐ１よりも上流側の第１検出位置Ｐａと、搬送経路上において第１検出位置Ｐａから下流側へ離間した第４処理位置Ｐ４よりもさらに下流側の第２検出位置Ｐｂとに、エッジ位置検出部３０が配置されている。ただし、エッジ位置検出部３０は、必ずしも設けられなくてもよい。 Each of the two edge position detection units 30 is a detection unit that detects the position of the edge (edge in the width direction) 91 of the printing paper 9 in the width direction. In the present embodiment, the first detection position Pa on the upstream side of the first processing position P1 on the transfer path and the fourth processing position P4 further downstream from the first detection position Pa on the transfer path are further downstream. The edge position detection unit 30 is arranged at the second detection position Pb on the side. However, the edge position detection unit 30 does not necessarily have to be provided.

図３は、エッジ位置検出部３０の構造を模式的に示した図である。図３に示すように、エッジ位置検出部３０は、印刷用紙９のエッジ９１の上方に位置する投光器３０１と、エッジ９１の下方に位置するラインセンサ３０２とを有する。投光器３０１は、下方へ向けて平行光を照射する。ラインセンサ３０２は、幅方向に配列された複数の受光素子３２１を有する。図３のように、印刷用紙９のエッジ９１よりも外側においては、投光器３０１から照射された光が受光素子３２１に入射し、受光素子３２１が光を検出する。一方、印刷用紙９のエッジ９１よりも内側においては、投光器３０１から照射された光が印刷用紙９に遮られるため、受光素子３２１は光を検出しない。エッジ位置検出部３０は、このような複数の受光素子３２１における光検出の有無に基づいて、印刷用紙９のエッジ９１の幅方向の位置を検出する。 FIG. 3 is a diagram schematically showing the structure of the edge position detection unit 30. As shown in FIG. 3, the edge position detecting unit 30 has a floodlight 301 located above the edge 91 of the printing paper 9 and a line sensor 302 located below the edge 91. The floodlight 301 irradiates parallel light downward. The line sensor 302 has a plurality of light receiving elements 321 arranged in the width direction. As shown in FIG. 3, outside the edge 91 of the printing paper 9, the light emitted from the floodlight 301 is incident on the light receiving element 321 and the light receiving element 321 detects the light. On the other hand, inside the edge 91 of the printing paper 9, the light emitted from the floodlight 301 is blocked by the printing paper 9, so that the light receiving element 321 does not detect the light. The edge position detection unit 30 detects the position of the edge 91 of the printing paper 9 in the width direction based on the presence or absence of light detection in the plurality of light receiving elements 321.

図１および図２に示すように、以下では、第１検出位置Ｐａに配置されたエッジ位置検出部３０を、第１エッジ位置検出部３１と称する。また、第２検出位置Ｐｂに配置されたエッジ位置検出部３０を、第２エッジ位置検出部３２と称する。第１エッジ位置検出部３１は、第１検出位置Ｐａにおいて、印刷用紙９のエッジ９１の幅方向の位置を、断続的に検出する。これにより、第１検出位置Ｐａにおけるエッジ９１の幅方向の位置の経時変化を示す検出結果を取得する。そして、得られた検出結果を示す検出信号（以下、「第１エッジ信号Ｅｄ１」と称する）を、制御部８０へ出力する。第２エッジ位置検出部３２は、第２検出位置Ｐｂにおいて、印刷用紙９のエッジ９１の幅方向の位置を、断続的に検出する。これにより、第２検出位置Ｐｂにおけるエッジ９１の幅方向の位置の経時変化を示す検出結果を取得する。そして、得られた検出結果を示す検出信号（以下、「第２エッジ信号Ｅｄ２」と称する）を、制御部８０へ出力する。ただし、第１エッジ位置検出部３１および第２エッジ位置検出部３２はそれぞれ、印刷用紙９のエッジ９１の幅方向の位置を連続的に検出してもよい。 As shown in FIGS. 1 and 2, in the following, the edge position detection unit 30 arranged at the first detection position Pa will be referred to as a first edge position detection unit 31. Further, the edge position detection unit 30 arranged at the second detection position Pb is referred to as a second edge position detection unit 32. The first edge position detection unit 31 intermittently detects the position of the edge 91 of the printing paper 9 in the width direction at the first detection position Pa. As a result, the detection result indicating the time-dependent change of the position of the edge 91 in the width direction at the first detection position Pa is acquired. Then, a detection signal (hereinafter, referred to as “first edge signal Ed1”) indicating the obtained detection result is output to the control unit 80. The second edge position detection unit 32 intermittently detects the position of the edge 91 of the printing paper 9 in the width direction at the second detection position Pb. As a result, the detection result indicating the time-dependent change of the position of the edge 91 in the width direction at the second detection position Pb is acquired. Then, a detection signal (hereinafter, referred to as “second edge signal Ed2”) indicating the obtained detection result is output to the control unit 80. However, the first edge position detection unit 31 and the second edge position detection unit 32 may continuously detect the position of the edge 91 of the printing paper 9 in the width direction, respectively.

エンコーダ４０は、複数の搬送ローラ１２のうちの１つ（図１における搬送ローラ１２２）の軸芯に取り付けられる。ただし、エンコーダ４０が取り付けられるローラは、搬送ローラ１２２には限定されない。エンコーダ４０は、搬送機構１０に含まれる複数のローラの少なくとも１つに取り付けられればよい。本実施形態では、エンコーダ４０は、搬送ローラ１２２の回転駆動量を検出し、搬送ローラ１２２の回転に同期した連続パルス信号Ｅｎを、制御部８０へ出力する。連続パルス信号Ｅｎは、搬送ローラ１２２を含む複数の搬送ローラ１２によって搬送される印刷用紙９の搬送速度の経時変化を反映したデータとなる。制御部８０は、入力された連続パルス信号Ｅｎに基づいて、後述する速度算出部８１において、搬送ローラ１２２を含む複数の搬送ローラ１２によって搬送される印刷用紙９の搬送速度の計測値Ｍｓを算出する。 The encoder 40 is attached to the shaft core of one of the plurality of transfer rollers 12 (transfer roller 122 in FIG. 1). However, the roller to which the encoder 40 is attached is not limited to the transport roller 122. The encoder 40 may be attached to at least one of a plurality of rollers included in the transport mechanism 10. In the present embodiment, the encoder 40 detects the rotation drive amount of the transfer roller 122 and outputs a continuous pulse signal En synchronized with the rotation of the transfer roller 122 to the control unit 80. The continuous pulse signal En is data that reflects a change over time in the transfer speed of the printing paper 9 conveyed by the plurality of transfer rollers 12 including the transfer roller 122. Based on the input continuous pulse signal En, the control unit 80 calculates the measured value Ms of the transfer speed of the printing paper 9 conveyed by the plurality of transfer rollers 12 including the transfer roller 122 in the speed calculation unit 81 described later. do.

張力検出部５０は、複数の搬送ローラ１２のうちの１つ（図１における搬送ローラ１２３）に取り付けられる。ただし、張力検出部５０が取り付けられるローラは、搬送ローラ１２３には限定されない。張力検出部５０は、搬送機構１０に含まれる複数のローラの少なくとも１つに取り付けられればよい。本実施形態では、張力検出部５０は、搬送ローラ１２３において印刷用紙９から受ける力を連続的または断続的に計測する。これにより、張力検出部５０は、印刷用紙９に加わる張力を検出し、検出結果に係る張力信号Ｔｅを、制御部８０へ出力する。張力信号Ｔｅは、搬送ローラ１２３に接触しつつ搬送ローラ１２３を含む複数の搬送ローラ１２によって搬送される印刷用紙９に加わる張力の経時変化を反映したデータとなる。ただし、張力検出部５０は、必ずしも設けられなくてもよい。 The tension detection unit 50 is attached to one of the plurality of transfer rollers 12 (convey roller 123 in FIG. 1). However, the roller to which the tension detection unit 50 is attached is not limited to the transport roller 123. The tension detection unit 50 may be attached to at least one of a plurality of rollers included in the transport mechanism 10. In the present embodiment, the tension detection unit 50 continuously or intermittently measures the force received from the printing paper 9 by the transport roller 123. As a result, the tension detection unit 50 detects the tension applied to the printing paper 9, and outputs the tension signal Te related to the detection result to the control unit 80. The tension signal Te is data that reflects the change over time in the tension applied to the printing paper 9 conveyed by the plurality of transfer rollers 12 including the transfer roller 123 while being in contact with the transfer roller 123. However, the tension detection unit 50 does not necessarily have to be provided.

情報取得部６０は、画像記録装置１における様々な設定値および条件に係る情報を取得する装置である。情報取得部６０は、例えば、タッチパネル等の入力インターフェースを含む。作業員は、当該入力インターフェースを介して、例えば、画像記録部２０の複数の記録ヘッド２１〜２４から吐出されるインクの種類または特性、印刷用紙９の周囲の温度または湿度を含む環境条件、および印刷用紙９の種類、形状、または厚み等に係る情報（以下、「情報Ｓｃ」と称する）を入力する。これにより、情報取得部６０は、これらの情報Ｓｃを取得する。ただし、情報取得部６０は、温度計や湿度計等のセンサを独自に有していてもよい。また、情報取得部６０は、当該センサを介して情報Ｓｃを直接的に取得してもよい。また、情報取得部６０は、上述した様々な設定値および条件に係る情報の少なくとも一つを取得するものであればよい。さらに、情報取得部６０は、上述した様々な設定値および条件に係る情報以外の情報を取得するものであってもよい。情報取得部６０によって取得された情報Ｓｃに係る信号は、制御部８０へ送信される。 The information acquisition unit 60 is a device that acquires information related to various set values and conditions in the image recording device 1. The information acquisition unit 60 includes an input interface such as a touch panel, for example. The worker can use the input interface to perform, for example, the type or characteristics of ink ejected from the plurality of recording heads 21 to 24 of the image recording unit 20, environmental conditions including the ambient temperature or humidity of the printing paper 9, and environmental conditions including the ambient temperature or humidity of the printing paper 9. Information related to the type, shape, thickness, etc. of the printing paper 9 (hereinafter referred to as "information Sc") is input. As a result, the information acquisition unit 60 acquires these information Scs. However, the information acquisition unit 60 may have its own sensor such as a thermometer or a hygrometer. Further, the information acquisition unit 60 may directly acquire the information Sc via the sensor. Further, the information acquisition unit 60 may acquire at least one of the information related to the various set values and conditions described above. Further, the information acquisition unit 60 may acquire information other than the information related to the various set values and conditions described above. The signal related to the information Sc acquired by the information acquisition unit 60 is transmitted to the control unit 80.

制御部８０は、画像記録装置１内の各部を動作制御するための手段である。図１中に概念的に示したように、制御部８０は、ＣＰＵ等のプロセッサ８０１、ＲＡＭ等のメモリ８０２、およびハードディスクドライブ等の記憶部８０３を有する。また、記憶部８０３内には、画像記録装置１を動作制御するためのプログラム８０Ｐおよびデータ８０Ｄが、記憶されている。また、図１中に破線で示したように、制御部８０は、搬送機構１０、４つの記録ヘッド２１〜２４、２つのエッジ位置検出部３０、エンコーダ４０、張力検出部５０、および情報取得部６０と、それぞれ電気的に接続されている。また、制御部８０は、さらにコンピュータ９０と電気的に接続されてもよい。 The control unit 80 is a means for controlling the operation of each unit in the image recording device 1. As conceptually shown in FIG. 1, the control unit 80 includes a processor 801 such as a CPU, a memory 802 such as a RAM, and a storage unit 803 such as a hard disk drive. Further, the program 80P and the data 80D for controlling the operation of the image recording device 1 are stored in the storage unit 803. Further, as shown by a broken line in FIG. 1, the control unit 80 includes a transport mechanism 10, four recording heads 21 to 24, two edge position detection units 30, an encoder 40, a tension detection unit 50, and an information acquisition unit. It is electrically connected to each of the 60s. Further, the control unit 80 may be further electrically connected to the computer 90.

制御部８０は、記憶部８０３に記憶されたプログラム８０Ｐやデータ８０Ｄをメモリ８０２に読み出し、プログラム８０Ｐおよびデータ８０Ｄに基づいて、プロセッサ８０１が演算処理を行うことにより、画像記録装置１内の上記各部を動作制御する。これにより、画像記録装置１における印刷処理が進行する。 The control unit 80 reads the program 80P and the data 80D stored in the storage unit 803 into the memory 802, and the processor 801 performs arithmetic processing based on the program 80P and the data 80D to perform arithmetic processing on each of the above units in the image recording device 1. To control the operation. As a result, the printing process in the image recording device 1 proceeds.

また、記憶部８０３には、後述する事前学習により取得された学習済みモデルＭ３が、記憶されている。制御部８０は、学習済みモデルＭ３から出力される制御値に基づき、３つのモータ１４をそれぞれ回転させて印刷用紙９を搬送し、かつ、記録ヘッド２１〜２４からインクを吐出して印刷を行う。これにより、画像記録装置１において印刷処理を行う際に、印刷用紙９の搬送速度が所定の範囲に維持される。 Further, the storage unit 803 stores the learned model M3 acquired by the pre-learning described later. Based on the control values output from the learned model M3, the control unit 80 rotates each of the three motors 14 to convey the printing paper 9, and ejects ink from the recording heads 21 to 24 to perform printing. .. As a result, when the image recording apparatus 1 performs the printing process, the transport speed of the printing paper 9 is maintained within a predetermined range.

コンピュータ９０は、上記の印刷用紙９の搬送経路とは別に設けられる。コンピュータ９０は、市販のパーソナルコンピュータ（パソコン）の形態を有している。また、コンピュータ９０は、図示を省略したモニタ、キーボード、ポインティングデバイス、またはタッチパネル等を有していてもよい。コンピュータ９０は、ＣＰＵ等のプロセッサ９０１、ＲＡＭ等のメモリ９０２、およびハードディスクドライブ等の記憶部９０３を有する。記憶部９０３内には、コンピュータ９０に所定の処理を実行させるためのプログラム９０Ｐおよびデータ９０Ｄが、記憶されている。ただし、コンピュータ９０は、上記の制御部８０に組み込まれていてもよい。 The computer 90 is provided separately from the transfer path of the printing paper 9. The computer 90 has the form of a commercially available personal computer (personal computer). Further, the computer 90 may have a monitor, a keyboard, a pointing device, a touch panel, or the like (not shown). The computer 90 has a processor 901 such as a CPU, a memory 902 such as a RAM, and a storage unit 903 such as a hard disk drive. The program 90P and the data 90D for causing the computer 90 to execute a predetermined process are stored in the storage unit 903. However, the computer 90 may be incorporated in the control unit 80 described above.

＜１−２．事前学習について＞
続いて、画像記録装置１において印刷処理を行う前に予め実行される事前学習について、説明する。なお、事前学習のうち、後述するステップＳ１およびステップＳ４は、実際に画像記録装置１を駆動しつつ行う。ステップＳ１またはステップＳ４を行う際に用いられる、後述する速度算出部８１および判定部８００の機能は、記憶部８０３に記憶されたプログラム８０Ｐおよびデータ８０Ｄをメモリ８０２に一時的に読み出し、当該プログラム８０Ｐおよびデータ８０Ｄに基づいて、プロセッサ８０１が演算処理を行うことによって、実現される。また、後述するステップＳ２およびステップＳ３は、コンピュータ９０上で行う。ステップＳ２およびステップＳ３を行う際に用いられる、後述する判定部９００の機能は、記憶部９０３に記憶されたプログラム９０Ｐおよびデータ９０Ｄをメモリ９０２に一時的に読み出し、当該プログラム９０Ｐおよびデータ９０Ｄに基づいて、プロセッサ９０１が演算処理を行うことによって、実現される。 <1-2. About pre-learning ＞
Subsequently, the pre-learning that is executed in advance before the printing process is performed in the image recording apparatus 1 will be described. Of the pre-learning, steps S1 and S4, which will be described later, are performed while actually driving the image recording device 1. The functions of the speed calculation unit 81 and the determination unit 800, which will be described later, used when performing step S1 or step S4, temporarily read the program 80P and the data 80D stored in the storage unit 803 into the memory 802, and the program 80P. And, based on the data 80D, it is realized by the processor 801 performing arithmetic processing. Further, steps S2 and S3, which will be described later, are performed on the computer 90. The function of the determination unit 900, which will be described later, used when performing step S2 and step S3 is to temporarily read the program 90P and data 90D stored in the storage unit 903 into the memory 902, and based on the program 90P and data 90D. Therefore, it is realized by the processor 901 performing arithmetic processing.

図４は、事前学習の流れを示すフローチャートである。図４に示すように、事前学習を行う際には、まず、画像記録装置１において実際に印刷用紙９を搬送しながら、印刷用紙９の表面にインクを吐出しつつ、画像記録装置１の各部における様々な計測値を複数取得する（ステップＳ１）。具体的には、搬送に係る制御値ＣＶｄを複数パターン変えながら、実際に印刷用紙９を搬送する。搬送に係る制御値ＣＶｄは、例えば、巻き出しローラ１１、搬送ローラ１２１、および巻き取りローラ１３にそれぞれ接続されるモータ１４の回転数（駆動ローラの回転速度）である。同時に、インクの吐出に係る制御値ＣＶｅを複数パターン変えながら、実際に印刷用紙９の表面にインクを吐出する。インクの吐出に係る制御値ＣＶｅは、例えば、記録ヘッド２１〜２４からのインクの吐出タイミング（各ノズル２５０からのインク滴の吐出タイミング）である。なお、搬送に係る制御値ＣＶｄは、制御部８０から各モータ１４に入力される。インクの吐出に係る制御値ＣＶｅは、制御部８０から記録ヘッド２１〜２４に入力される。 FIG. 4 is a flowchart showing the flow of pre-learning. As shown in FIG. 4, when performing pre-learning, first, while actually transporting the printing paper 9 in the image recording device 1, ink is ejected to the surface of the printing paper 9, and each part of the image recording device 1 is performed. Acquire a plurality of various measured values in (step S1). Specifically, the printing paper 9 is actually conveyed while changing a plurality of patterns of control values CVd related to the transfer. The control value CVd related to the transfer is, for example, the rotation speed (rotational speed of the drive roller) of the motor 14 connected to the unwind roller 11, the transfer roller 121, and the take-up roller 13, respectively. At the same time, the ink is actually ejected to the surface of the printing paper 9 while changing a plurality of patterns of the control value CVe related to the ejection of the ink. The control value CVe related to ink ejection is, for example, the ink ejection timing from the recording heads 21 to 24 (ink ejection timing from each nozzle 250). The control value CVd related to the transfer is input from the control unit 80 to each motor 14. The control value CVe related to ink ejection is input from the control unit 80 to the recording heads 21 to 24.

なお、ステップＳ１において、モータ１４の回転数は、時間に依らず一定であってもよいし、時間の経過とともに変化してもよい。また、インクの吐出タイミングは、規則的であってもよいし、不規則であってもよい。また、吐出に係る制御値ＣＶｅとして、インクの吐出タイミングの代わりに、またはインクの吐出タイミングに加えて、記録ヘッド２１〜２４からのインクの吐出量（各ノズル２５０からのインク滴の吐出量）を複数パターン変えながら、実際に印刷用紙９の表面にインクを吐出してもよい。また、モータ１４の回転数を過大にして複数のローラの表面と印刷用紙９との間のスリップを生じさせたり、インクの吐出量を過大にして印刷用紙９を大きく膨張させたりしてもよい。さらに、画像記録装置１の電源をオフにして、モータ１４を駆動させない場合や、インクを吐出しない場合を含めた、多岐に亘るパターンを実施してもよい。すなわち、搬送に係る制御値ＣＶｄは、少なくともモータ１４の回転数に係る情報を含んでいれば、自由に設定できる。また、インクの吐出に係る制御値ＣＶｅは、少なくとも記録ヘッド２１〜２４からのインクの吐出タイミングに係る情報を含んでいれば、自由に設定できる。さらに、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅの一方は、制御値として扱わず、予め定めた所定の値としてもよい。 In step S1, the rotation speed of the motor 14 may be constant regardless of time, or may change with the passage of time. Further, the ink ejection timing may be regular or irregular. Further, as the control value CVe related to ejection, the amount of ink ejected from the recording heads 21 to 24 (the amount of ink droplets ejected from each nozzle 250) instead of the ink ejection timing or in addition to the ink ejection timing. Ink may be actually ejected onto the surface of the printing paper 9 while changing a plurality of patterns. Further, the rotation speed of the motor 14 may be excessive to cause slip between the surfaces of the plurality of rollers and the printing paper 9, or the ink ejection amount may be excessive to cause the printing paper 9 to be greatly expanded. .. Further, a wide variety of patterns may be performed, including a case where the power of the image recording device 1 is turned off and the motor 14 is not driven, and a case where ink is not ejected. That is, the control value CVd related to transportation can be freely set as long as it includes at least information related to the rotation speed of the motor 14. Further, the control value CVe related to ink ejection can be freely set as long as it includes at least information related to ink ejection timing from the recording heads 21 to 24. Further, one of the control value CVd related to transport and the control value CVe related to discharge may not be treated as a control value, but may be a predetermined value set in advance.

ステップＳ１では、実際に印刷用紙９を搬送しながら、印刷用紙９の表面にインクを吐出しつつ、エンコーダ４０から制御部８０へ連続パルス信号Ｅｎを入力する。制御部８０は、速度算出部８１（後述する図９参照）を有する。制御部８０は、入力された連続パルス信号Ｅｎに基づいて、速度算出部８１において、印刷用紙９の搬送速度の計測値Ｍｓを算出する。 In step S1, a continuous pulse signal En is input from the encoder 40 to the control unit 80 while actually conveying the printing paper 9 and ejecting ink to the surface of the printing paper 9. The control unit 80 has a speed calculation unit 81 (see FIG. 9 described later). The control unit 80 calculates the measured value Ms of the transport speed of the printing paper 9 in the speed calculation unit 81 based on the input continuous pulse signal En.

ただし、ステップＳ１では、印刷用紙９の搬送速度の計測値Ｍｓ以外の計測値をさらに複数取得してもよい。例えば、搬送される印刷用紙９の加速度や振動数の計測値を取得してもよい。 However, in step S1, a plurality of measured values other than the measured value Ms of the transport speed of the printing paper 9 may be acquired. For example, the measured values of the acceleration and the frequency of the printed paper 9 to be conveyed may be acquired.

また、ステップＳ１では、第１エッジ位置検出部３１が、第１検出位置Ｐａにおけるエッジ９１の幅方向の位置の経時変化を検出し、その検出結果を示す第１エッジ信号Ｅｄ１を、制御部８０へ出力する。本実施形態では、制御部８０は、第１エッジ信号Ｅｄ１を第１参照値ＣＦ１として取得する。また、第２エッジ位置検出部３２が、第２検出位置Ｐｂにおけるエッジ９１の幅方向の位置の経時変化を検出し、その検出結果を示す第２エッジ信号Ｅｄ２を、制御部８０へ出力する。本実施形態では、制御部８０は、第２エッジ信号Ｅｄ２を第２参照値ＣＦ２として取得する。また、ステップＳ１では、張力検出部５０が、印刷用紙９に加わる張力の経時変化を検出し、その検出結果を示す張力信号Ｔｅを、制御部８０へ出力する。本実施形態では、制御部８０は、張力信号Ｔｅを第３参照値ＣＦ３として取得する。ただし、制御部８０は、第１参照値ＣＦ１、第２参照値ＣＦ２、または第３参照値ＣＦ３を取得しなくてもよい。 Further, in step S1, the first edge position detection unit 31 detects a change with time in the position of the edge 91 in the width direction at the first detection position Pa, and outputs the first edge signal Ed1 indicating the detection result to the control unit 80. Output to. In the present embodiment, the control unit 80 acquires the first edge signal Ed1 as the first reference value CF1. Further, the second edge position detection unit 32 detects a change with time in the position of the edge 91 in the width direction at the second detection position Pb, and outputs a second edge signal Ed2 indicating the detection result to the control unit 80. In the present embodiment, the control unit 80 acquires the second edge signal Ed2 as the second reference value CF2. Further, in step S1, the tension detection unit 50 detects a change in tension applied to the printing paper 9 with time, and outputs a tension signal Te indicating the detection result to the control unit 80. In the present embodiment, the control unit 80 acquires the tension signal Te as the third reference value CF3. However, the control unit 80 does not have to acquire the first reference value CF1, the second reference value CF2, or the third reference value CF3.

さらに、ステップＳ１では、情報取得部６０によって、インクの種類または特性（インクの情報）、印刷用紙９の周囲の温度または湿度を含む環境条件、および印刷用紙９の種類、形状、または厚み（印刷用紙９の情報）に係る情報Ｓｃが取得される。情報取得部６０によって取得された情報Ｓｃに係る信号は、制御部８０へ送信される。 Further, in step S1, the information acquisition unit 60 determines the type or characteristic of the ink (ink information), the environmental conditions including the ambient temperature or humidity of the printing paper 9, and the type, shape, or thickness of the printing paper 9 (printing). Information Sc related to (information on Form 9) is acquired. The signal related to the information Sc acquired by the information acquisition unit 60 is transmitted to the control unit 80.

取得された印刷用紙９の搬送速度の計測値Ｍｓは、制御部８０の記憶部８０３において、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅと、上記の第１参照値ＣＦ１、第２参照値ＣＦ２、または第３参照値ＣＦ３と、情報Ｓｃとに関連付けられ、データ集合体ＤＡとして蓄積される。図５は、データ集合体ＤＡの例を概念的に示した図である。ただし、データ集合体ＤＡは、作業員によってコンピュータ９０等において手動で記録されてもよい。 The acquired measured values Ms of the transport speed of the printing paper 9 are the control value CVd related to transport and the control value CVe related to ejection in the storage unit 803 of the control unit 80, and the above-mentioned first reference values CF1 and second reference. It is associated with the value CF2 or the third reference value CF3 and the information Sc, and is accumulated as a data aggregate DA. FIG. 5 is a diagram conceptually showing an example of a data aggregate DA. However, the data aggregate DA may be manually recorded by a worker on a computer 90 or the like.

データ集合体ＤＡとして十分な数のデータが蓄積されると、作業員はコンピュータ９０上で、蓄積された制御値ＣＶｄ，ＣＶｅ、計測値Ｍｓ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃに基づいて、制御値ＣＶｄ，ＣＶｅ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃと、計測値Ｍｓとの関係を示す簡易シミュレーションモデルＳｉＭｏを作成する（ステップＳ２）。具体的には、印刷用紙９の搬送速度の計測値Ｍｓを教師データ（正解のデータ）としつつ、制御値ＣＶｄ，ＣＶｅ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃから、印刷用紙９の搬送速度を高精度に算出するための簡易シミュレーションモデルＳｉＭｏを機械学習する。 When a sufficient number of data is accumulated as the data aggregate DA, the worker is based on the accumulated control values CVd, CVe, measured value Ms, reference values CF1, CF2, CF3, and information Sc on the computer 90. Then, a simple simulation model SiMo showing the relationship between the control values CVd, CVe, the reference values CF1, CF2, CF3, and the information Sc and the measured value Ms is created (step S2). Specifically, while using the measured value Ms of the transfer speed of the printing paper 9 as the teacher data (correct answer data), the transfer of the printing paper 9 is performed from the control values CVd, CVe, the reference values CF1, CF2, CF3, and the information Sc. Machine learning is performed on the simple simulation model SiMo for calculating the speed with high accuracy.

簡易シミュレーションモデルＳｉＭｏは、決定木Ｘ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）を含む。図６は、本実施形態の決定木Ｘ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）の例を概念的に示した図である。ステップＳ２では、機械学習において、入力された制御値ＣＶｄ，ＣＶｅ、計測値Ｍｓ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃに基づいて算出した印刷用紙９の搬送速度の算出値ＣＡｓと、データ集合体ＤＡに蓄積された対応する計測値Ｍｓとの差異を最小化するように、決定木Ｘ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）に含まれる複数のパラメータ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）を調整しつつ更新保存していく。 The simple simulation model SiMo includes decision trees X (a, b, c, f (CVd, CVe, CF1, CF2, CF3, Sc) ...). FIG. 6 is a diagram conceptually showing an example of the decision tree X (a, b, c, f (CVd, CVe, CF1, CF2, CF3, Sc) ...) Of the present embodiment. In step S2, in machine learning, the calculated value CAs of the transfer speed of the printing paper 9 calculated based on the input control values CVd, CVe, measured value Ms, reference values CF1, CF2, CF3, and information Sc, and data. Included in the decision tree X (a, b, c, f (CVd, CVe, CF1, CF2, CF3, Sc) ...) so as to minimize the difference from the corresponding measured value Ms accumulated in the aggregate DA. The plurality of parameters (a, b, c, f (CVd, CVe, CF1, CF2, CF3, Sc) ...) Are adjusted and updated and saved.

簡易シミュレーションモデルＳｉＭｏから出力される算出値ＣＡｓとデータ集合体ＤＡに蓄積された対応する計測値Ｍｓとの差異が、所定値以下になると、機械学習が完了する。これにより、簡易シミュレーションモデルＳｉＭｏを用いて、入力された制御値ＣＶｄ，ＣＶｅ、計測値Ｍｓ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃに基づいて、印刷用紙９の搬送速度を高精度に算出することが可能となる。ただし、簡易シミュレーションモデルＳｉＭｏに入力される値として、制御値ＣＶｄ，ＣＶｅ、計測値Ｍｓ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃの一部が省略されてもよい。 Machine learning is completed when the difference between the calculated value CAs output from the simple simulation model SiMo and the corresponding measured value Ms stored in the data aggregate DA is equal to or less than a predetermined value. As a result, the transfer speed of the printing paper 9 is calculated with high accuracy based on the input control values CVd, CVe, measured values Ms, reference values CF1, CF2, CF3, and information Sc using the simple simulation model SiMo. It becomes possible to do. However, as the values input to the simple simulation model SiMo, the control values CVd, CVe, the measured values Ms, the reference values CF1, CF2, CF3, and a part of the information Sc may be omitted.

ただし、簡易シミュレーションモデルＳｉＭｏは、決定木以外の機械学習（例えば、ＧＢＭ（Gradient Boosting Machine））によって作成されたモデルを含むものであってもよい。また、簡易シミュレーションモデルＳｉＭｏは、機械学習によって作成されたモデルの代わりに、制御値ＣＶｄ，ＣＶｅ、参照値ＣＦ１，ＣＦ２，ＣＦ３、または情報Ｓｃと、印刷用紙９の搬送速度の算出値ＣＡｓとの関係を示す計算式（関数）を含むものであってもよい。また、簡易シミュレーションモデルＳｉＭｏは、例えば「画像記録装置１の電源がオンかオフか」等の条件式（例えば、電源がオフのときは０（ゼロ）を出力する式等）を含むものであってもよい。 However, the simple simulation model SiMo may include a model created by machine learning (for example, GBM (Gradient Boosting Machine)) other than the decision tree. Further, in the simple simulation model SiMo, instead of the model created by machine learning, the control values CVd, CVe, the reference values CF1, CF2, CF3, or the information Sc, and the calculated value CAs of the transfer speed of the printing paper 9 are used. It may include a calculation formula (function) indicating the relationship. Further, the simple simulation model SiMo includes a conditional expression such as "whether the power of the image recording device 1 is on or off" (for example, an expression that outputs 0 (zero) when the power is off). You may.

続いて、コンピュータ９０上で、簡易シミュレーションモデルＳｉＭｏを用いて、印刷用紙９の搬送速度を所定の範囲に維持するための制御ルールを、自律的に学習（機械学習）する（ステップＳ３）。ステップＳ３では、強化学習を行うことにより、後述する第２段階学習中モデルＭ２を作成する。図７は、ステップＳ３において強化学習を行う様子を概念的に示したブロック図である。また、図８は、ステップＳ３の詳細な流れを示すフローチャートである。 Subsequently, on the computer 90, the simple simulation model SiMo is used to autonomously learn (machine learning) the control rules for maintaining the transport speed of the printing paper 9 within a predetermined range (step S3). In step S3, the second-stage learning model M2, which will be described later, is created by performing reinforcement learning. FIG. 7 is a block diagram conceptually showing how reinforcement learning is performed in step S3. Further, FIG. 8 is a flowchart showing a detailed flow of step S3.

コンピュータ９０上で強化学習を行うときには、強化学習プログラムに基づいて、後述する学習済みモデルＭ３の原型となる学習前モデルＭ０を用意する。そして、用意された学習前モデルＭ０に、制御値ＣＶｄ，ＣＶｅと、簡易シミュレーションモデルＳｉＭｏからの出力を反映した報酬との関係を、学習させる。なお、本実施形態では、簡易シミュレーションモデルＳｉＭｏに入力される参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃは、予め設定される。 When performing reinforcement learning on the computer 90, a pre-learning model M0, which is a prototype of the learned model M3 described later, is prepared based on the reinforcement learning program. Then, the prepared pre-learning model M0 is made to learn the relationship between the control values CVd and CVe and the reward reflecting the output from the simple simulation model SiMo. In the present embodiment, the reference values CF1, CF2, CF3, and the information Sc input to the simple simulation model SiMo are set in advance.

具体的には、まず、学習前モデルＭ０から、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅとしてそれぞれ、ある値を出力して、簡易シミュレーションモデルＳｉＭｏへ入力する（ステップＳ３１）。ただし、上記のとおり、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅの一方は、予め定めた所定の値としてもよい。また、搬送に係る制御値ＣＶｄについては、巻き出しローラ１１に接続されるモータ１４へ入力する値と、搬送ローラ１２１に接続されるモータ１４へ入力する値と、巻き取りローラ１３に接続されるモータ１４へ入力する値とが、互いに異なっていてもよい。 Specifically, first, from the pre-learning model M0, certain values are output as the control value CVd related to the transfer and the control value CVe related to the discharge, and are input to the simple simulation model SiMo (step S31). However, as described above, one of the control value CVd related to transport and the control value CVe related to discharge may be a predetermined value set in advance. Further, regarding the control value CVd related to the transfer, the value input to the motor 14 connected to the unwinding roller 11 and the value input to the motor 14 connected to the transport roller 121 are connected to the take-up roller 13. The values input to the motor 14 may be different from each other.

ステップＳ３１の初期段階では、学習前モデルＭ０から出力される値は、ランダムな値となる。簡易シミュレーションモデルＳｉＭｏは、学習前モデルＭ０から入力された値と、上記の参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃとに基づいて、印刷用紙９の搬送速度に係る算出値ＣＡｓを出力する（ステップＳ３２）。以下では、このように学習が開始された後の学習前モデルＭ０を、第１段階学習中モデルＭ１と呼ぶこととする。 In the initial stage of step S31, the value output from the pre-learning model M0 is a random value. The simple simulation model SiMo outputs the calculated value CAs related to the transport speed of the printing paper 9 based on the values input from the pre-learning model M0, the above reference values CF1, CF2, CF3, and the information Sc ( Step S32). In the following, the pre-learning model M0 after the learning is started in this way will be referred to as the first-stage learning in-learning model M1.

コンピュータ９０は、判定部９００としての機能をさらに有する。判定部９００は、強化学習プログラムに基づいて、簡易シミュレーションモデルＳｉＭｏからの出力値である印刷用紙９の搬送速度に係る算出値ＣＡｓに応じた報酬ＲＥ１を付与する（ステップＳ３３）。このとき、判定部９００は、算出値ＣＡｓが予め設定された目標値に近づくほど、高い報酬ＲＥ１を付与する。一方、判定部９００は、算出値ＣＡｓが予め設定された目標値から離れるほど、低い報酬ＲＥ１を付与する。 The computer 90 further has a function as a determination unit 900. Based on the reinforcement learning program, the determination unit 900 grants the reward RE1 according to the calculated value CAs related to the transport speed of the printing paper 9, which is the output value from the simple simulation model SiMo (step S33). At this time, the determination unit 900 gives a higher reward RE1 as the calculated value CAs approaches the preset target value. On the other hand, the determination unit 900 gives a lower reward RE1 as the calculated value CAs deviates from the preset target value.

第１段階学習中モデルＭ１は、判定部９００から報酬ＲＥ１を付与されると、当該付与された報酬ＲＥ１を踏まえて、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅとしての次の値を出力する。例えば、第１段階学習中モデルＭ１は、判定部９００から高い報酬ＲＥ１を付与されると、その高い報酬ＲＥ１を付与されるに至った制御値ＣＶｄ，ＣＶｅに近い値を、次の値として出力する。また、第１段階学習中モデルＭ１は、判定部９００から低い報酬ＲＥ１を付与されると、その低い報酬ＲＥ１を付与されるに至った制御値ＣＶｄ，ＣＶｅから離れた値を、次の値として出力する。 When the reward RE1 is given by the determination unit 900, the first-stage learning model M1 sets the following values as the control value CVd related to transportation and the control value CVe related to discharge based on the given reward RE1. Output. For example, in the first stage learning model M1, when a high reward RE1 is given by the determination unit 900, a value close to the control values CVd and CVe that led to the high reward RE1 being given is output as the next value. do. Further, in the first-stage learning model M1, when a low reward RE1 is given by the determination unit 900, a value away from the control values CVd and CVe that led to the low reward RE1 being given is set as the next value. Output.

このように、ステップＳ３では、第１段階学習中モデルＭ１から出力する値（搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅ）を更新しつつ、簡易シミュレーションモデルＳｉＭｏへ入力し、さらに簡易シミュレーションモデルＳｉＭｏからの算出値ＣＡｓを所定の範囲に維持するための強化学習を行う。また、本実施形態では、強化学習（深層強化学習）として、例えば、ＰＰＯ（Proximal Policy Optimization）の技法によって実行される機械学習を行う。ただし、強化学習として、ＤＱＮ（Deep Q Network）、Ｑ学習法、ＳＡＲＳＡ、モンテカルロ法、epsilon-greedy法、BoltzmannQPolicy法等の技法によって実行される機械学習を行ってもよい。 In this way, in step S3, while updating the values output from the first-stage learning model M1 (control value CVd related to transport and control value CVe related to discharge), they are input to the simple simulation model SiMo, and further simple simulation is performed. Reinforcement learning is performed to maintain the calculated value CAs from the model SiMo within a predetermined range. Further, in the present embodiment, as reinforcement learning (deep reinforcement learning), for example, machine learning executed by a technique of PPO (Proximal Policy Optimization) is performed. However, as reinforcement learning, machine learning executed by techniques such as DQN (Deep Q Network), Q-learning method, SARSA, Monte Carlo method, epsilon-greedy method, and Boltzmann QPolicy method may be performed.

上記のとおり、コンピュータ９０上での強化学習の初期段階では、第１段階学習中モデルＭ１は、制御値ＣＶｄ，ＣＶｅとしてランダムな値を出力する。しかしながら、コンピュータ９０は、ステップＳ３１〜Ｓ３３の処理を繰り返す過程で、判定部９００から付与される報酬ＲＥ１に応じて、制御値ＣＶｄ，ＣＶｅの増加、維持、減少等を試行する。これにより、制御値ＣＶｄ，ＣＶｅと、簡易シミュレーションモデルＳｉＭｏからの出力を反映した報酬との関係を、自動的に学習する。そして、第１段階学習中モデルＭ１は、次第に、高い報酬ＲＥ１を得ることができるようになる。すなわち、第１段階学習中モデルＭ１は、次第に、簡易シミュレーションモデルＳｉＭｏからの算出値ＣＡｓを所定の範囲内に維持するための制御値ＣＶｄ，ＣＶｅを、出力できるようになる。 As described above, in the initial stage of reinforcement learning on the computer 90, the first stage learning model M1 outputs random values as control values CVd and CVe. However, in the process of repeating the processes of steps S31 to S33, the computer 90 tries to increase, maintain, decrease, and the like the control values CVd and CVe according to the reward RE1 given by the determination unit 900. As a result, the relationship between the control values CVd and CVe and the reward reflecting the output from the simple simulation model SiMo is automatically learned. Then, the first-stage learning model M1 can gradually obtain a high reward RE1. That is, the first-stage learning model M1 can gradually output the control values CVd and CVe for maintaining the calculated values CAs from the simple simulation model SiMo within a predetermined range.

その後、コンピュータ９０は、強化学習プログラムに基づいて、強化学習を終了するか否かを判断する（ステップＳ３４）。判定部９００による報酬ＲＥ１が、所望のレベルに達していない場合には、引き続き、コンピュータ９０上での強化学習を継続する（ステップＳ３４：ｎｏ）。その場合、上述したステップＳ３１〜Ｓ３４の処理を、再度実行する。 After that, the computer 90 determines whether or not to end the reinforcement learning based on the reinforcement learning program (step S34). If the reward RE1 by the determination unit 900 does not reach the desired level, the reinforcement learning on the computer 90 is continued (step S34: no). In that case, the process of steps S31 to S34 described above is executed again.

一方、ステップＳ３４において、付与される報酬ＲＥ１が所望のレベルに達したと判断されると、強化学習プログラムに基づいて、コンピュータ９０上での強化学習を終了する（ステップＳ３４：ｙｅｓ）。そして、以上のコンピュータ９０上での強化学習が完了した第１段階学習中モデルＭ１は、第２段階学習中モデルＭ２となる。 On the other hand, in step S34, when it is determined that the reward RE1 to be given has reached a desired level, the reinforcement learning on the computer 90 is terminated based on the reinforcement learning program (step S34: yes). Then, the first-stage learning model M1 for which the reinforcement learning on the computer 90 is completed becomes the second-stage learning model M2.

その後、制御部８０の記憶部８０３に、第２段階学習中モデルＭ２がインストールされる。そして、図４に示すように、再度、実際に印刷用紙９を搬送しながら、印刷用紙９の表面にインクを吐出しつつ、印刷用紙９の搬送速度を所定の範囲に維持するための制御ルールを、自律的に強化学習（機械学習）する（ステップＳ４）。 After that, the second-stage learning model M2 is installed in the storage unit 803 of the control unit 80. Then, as shown in FIG. 4, a control rule for maintaining the transport speed of the printing paper 9 within a predetermined range while ejecting ink to the surface of the printing paper 9 while actually transporting the printing paper 9 again. Is autonomously reinforced learning (machine learning) (step S4).

図９は、ステップＳ４において強化学習を行う様子を概念的に示したブロック図である。また、図１０は、ステップＳ４の詳細な流れを示すフローチャートである。図９および図１０に示すように、実装置での強化学習を行うときには、まず、第２段階学習中モデルＭ２から、搬送に係る制御値ＣＶｄ（モータ１４の回転数）として、ある値を出力し、この値に基づき、駆動ローラである巻き出しローラ１１、搬送ローラ１２１、および巻き取りローラ１３にそれぞれ接続されるモータ１４を回転させる。また、同時に、第２段階学習中モデルＭ２から、吐出に係る制御値ＣＶｅ（吐出タイミング）として、ある値を出力し、この値に基づき、記録ヘッド２１〜２４からインクを吐出する（ステップＳ４１）。 FIG. 9 is a block diagram conceptually showing how reinforcement learning is performed in step S4. Further, FIG. 10 is a flowchart showing a detailed flow of step S4. As shown in FIGS. 9 and 10, when performing reinforcement learning in an actual device, first, a certain value is output as a control value CVd (rotational speed of the motor 14) related to transportation from the model M2 during the second stage learning. Then, based on this value, the motor 14 connected to the unwinding roller 11, the transport roller 121, and the winding roller 13, which are the driving rollers, is rotated. At the same time, a certain value is output from the second-stage learning model M2 as the control value CVe (ejection timing) related to ejection, and ink is ejected from the recording heads 21 to 24 based on this value (step S41). ..

次に、上記のとおり、印刷用紙９を搬送しながら、印刷用紙９の表面にインクを吐出しつつ、さらにエンコーダ４０から制御部８０へ連続パルス信号Ｅｎを入力する。制御部８０は、入力された連続パルス信号Ｅｎに基づいて、速度算出部８１において、印刷用紙９の搬送速度の計測値Ｍｓを取得する（ステップＳ４２）。 Next, as described above, while transporting the printing paper 9, ink is ejected to the surface of the printing paper 9, and a continuous pulse signal En is further input from the encoder 40 to the control unit 80. The control unit 80 acquires the measured value Ms of the transport speed of the printing paper 9 in the speed calculation unit 81 based on the input continuous pulse signal En (step S42).

制御部８０は、判定部８００としての機能をさらに有する。判定部８００は、強化学習プログラムに基づいて、上記の印刷用紙９の搬送速度の計測値Ｍｓに応じた報酬ＲＥ２を付与する（ステップＳ４３）。このとき、判定部８００は、計測値Ｍｓが予め設定された目標値に近づくほど、高い報酬ＲＥ２を付与する。一方、判定部８００は、計測値Ｍｓが予め設定された目標値から離れるほど、低い報酬ＲＥ２を付与する。 The control unit 80 further has a function as a determination unit 800. Based on the reinforcement learning program, the determination unit 800 grants the reward RE2 according to the measured value Ms of the transport speed of the printing paper 9 (step S43). At this time, the determination unit 800 gives a higher reward RE2 as the measured value Ms approaches the preset target value. On the other hand, the determination unit 800 gives a lower reward RE2 as the measured value Ms deviates from the preset target value.

第２段階学習中モデルＭ２は、判定部８００から報酬ＲＥ２を付与されると、当該付与された報酬ＲＥ２を踏まえて、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅとしての次の値を出力する。例えば、第２段階学習中モデルＭ２は、判定部８００から高い報酬ＲＥ２を付与されると、その高い報酬ＲＥ２を付与されるに至った制御値ＣＶｄ，ＣＶｅに近い値を、次の値として出力する。また、第２段階学習中モデルＭ２は、判定部８００から低い報酬ＲＥ２を付与されると、その低い報酬ＲＥ２を付与されるに至った制御値ＣＶｄ，ＣＶｅから離れた値を、次の値として出力する。 When the reward RE2 is given by the determination unit 800, the second-stage learning model M2 sets the following values as the control value CVd related to transportation and the control value CVe related to discharge based on the given reward RE2. Output. For example, in the second stage learning model M2, when a high reward RE2 is given by the determination unit 800, a value close to the control values CVd and CVe that led to the high reward RE2 being given is output as the next value. do. Further, in the second-stage learning model M2, when a low reward RE2 is given by the determination unit 800, a value away from the control values CVd and CVe that led to the low reward RE2 being given is set as the next value. Output.

このように、ステップＳ４では、第２段階学習中モデルＭ２から出力する値（搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅ）を更新しつつ、モータ１４および記録ヘッド２１〜２４を駆動し、エンコーダ４０からの連続パルス信号Ｅｎに基づく印刷用紙９の搬送速度の計測値Ｍｓを所定の範囲に維持するための強化学習を行う。また、本実施形態では、強化学習（深層強化学習）として、例えば、ＰＰＯ（Proximal Policy Optimization）の技法によって実行される機械学習を行う。ただし、強化学習として、ＤＱＮ（Deep Q Network）、Ｑ学習法、ＳＡＲＳＡ、モンテカルロ法、epsilon-greedy法、BoltzmannQPolicy法等の技法によって実行される機械学習を行ってもよい。 In this way, in step S4, the motor 14 and the recording heads 21 to 24 are driven while updating the values output from the second-stage learning model M2 (control value CVd related to transport and control value CVe related to discharge). , Reinforcement learning is performed to maintain the measured value Ms of the transport speed of the printing paper 9 based on the continuous pulse signal En from the encoder 40 within a predetermined range. Further, in the present embodiment, as reinforcement learning (deep reinforcement learning), for example, machine learning executed by a technique of PPO (Proximal Policy Optimization) is performed. However, as reinforcement learning, machine learning executed by techniques such as DQN (Deep Q Network), Q-learning method, SARSA, Monte Carlo method, epsilon-greedy method, and Boltzmann QPolicy method may be performed.

上記のとおり、ステップＳ４では、予めコンピュータ９０上での強化学習が完了した第２段階学習中モデルＭ２を用いて、実装置においてさらに強化学習を行う。そして、ステップＳ４１〜Ｓ４３の処理を繰り返す過程で、判定部８００から付与される報酬ＲＥ２に応じて、実装置において制御値ＣＶｄ，ＣＶｅの増加、維持、減少等を試行する。これにより、制御値ＣＶｄ，ＣＶｅと、実際に搬送される印刷用紙９の搬送速度の計測値Ｍｓを反映した報酬との関係を、自動的に学習する。そして、第２段階学習中モデルＭ２は、次第に、高い報酬ＲＥ２を得ることができるようになる。すなわち、第２段階学習中モデルＭ２は、次第に、実装置において印刷用紙９の搬送速度の計測値Ｍｓを所定の範囲内に維持するための制御値ＣＶｄ，ＣＶｅを、出力できるようになる。 As described above, in step S4, further reinforcement learning is performed in the actual device using the second-stage learning in-flight model M2 for which reinforcement learning on the computer 90 has been completed in advance. Then, in the process of repeating the processes of steps S41 to S43, the control values CVd and CVe are increased, maintained, decreased, and the like in the actual device according to the reward RE2 given by the determination unit 800. As a result, the relationship between the control values CVd and CVe and the reward reflecting the measured value Ms of the transport speed of the printing paper 9 actually transported is automatically learned. Then, the second-stage learning model M2 can gradually obtain a high reward RE2. That is, the second-stage learning model M2 can gradually output the control values CVd and CVe for maintaining the measured value Ms of the transport speed of the printing paper 9 within a predetermined range in the actual device.

その後、制御部８０上で、強化学習プログラムに基づいて、強化学習を終了するか否かを判断する（ステップＳ４４）。判定部８００による報酬ＲＥ２が、所望のレベルに達していない場合には、引き続き、実装置を用いた強化学習を継続する（ステップＳ４４：ｎｏ）。その場合、上述したステップＳ４１〜Ｓ４４の処理を、再度実行する。 After that, on the control unit 80, it is determined whether or not to end the reinforcement learning based on the reinforcement learning program (step S44). If the reward RE2 by the determination unit 800 does not reach the desired level, reinforcement learning using the actual device is continued (step S44: no). In that case, the processes of steps S41 to S44 described above are executed again.

一方、ステップＳ４４において、付与される報酬ＲＥ２が所望のレベルに達したと判断されると、強化学習プログラムに基づいて、実装置での強化学習を終了する（ステップＳ４４：ｙｅｓ）。そして、以上の実装置での強化学習が完了した第２段階学習中モデルＭ２は、学習済みモデルＭ３となる。 On the other hand, in step S44, when it is determined that the reward RE2 to be given has reached a desired level, the reinforcement learning in the actual device is terminated based on the reinforcement learning program (step S44: yes). Then, the second-stage learning model M2 for which the reinforcement learning on the actual device is completed becomes the trained model M3.

上記のとおり、本実施形態では、予め実装置においてデータ取りをした結果を用いて（ステップＳ１）、コンピュータ９０上で簡易シミュレーションモデルＳｉＭｏを作成し（ステップＳ２）、当該簡易シミュレーションモデルＳｉＭｏを用いてコンピュータ９０上で強化学習を行う（ステップＳ３）。そして、コンピュータ９０上で強化学習を行うことによって得られた第２段階学習中モデルＭ２を、再度、実装置に移して、引き続き強化学習を行う（ステップＳ４）。これにより、実装置の動作に見合ったある程度進んだ状態から強化学習を開始することができる。また、コンピュータ９０上で相当量の強化学習を行うことができる。この結果、機械学習に要する時間を大幅に短縮できる。また、学習初期段階に、コンピュータ９０上で第１段階学習中モデルＭ１から出力される制御値ＣＶｄ，ＣＶｅが過大になった場合でも、実装置に損傷に及ぼすことを防止できる。さらに、最終的に再度実装置を用いて強化学習を行うことにより、より高精度な制御値ＣＶｄ，ＣＶｅを出力する学習済みモデルＭ３を作成することができる。これにより、例えば、実装置毎の特徴（例えば、装置の経年に伴い現れる特徴や、印刷用紙９における微小な凹凸による厚みの違いや、インク毎の粘性の僅かな違い等）に沿った、より良い動作を実現することができる。 As described above, in the present embodiment, a simple simulation model SiMo is created on the computer 90 (step S2) using the result of collecting data in the actual device in advance (step S1), and the simple simulation model SiMo is used. Reinforcement learning is performed on the computer 90 (step S3). Then, the second-stage learning model M2 obtained by performing reinforcement learning on the computer 90 is transferred to the actual device again, and reinforcement learning is continuously performed (step S4). As a result, reinforcement learning can be started from a state advanced to some extent commensurate with the operation of the actual device. In addition, a considerable amount of reinforcement learning can be performed on the computer 90. As a result, the time required for machine learning can be significantly reduced. Further, even if the control values CVd and CVe output from the model M1 during the first stage learning on the computer 90 become excessive in the initial stage of learning, it is possible to prevent damage to the actual device. Furthermore, by finally performing reinforcement learning using the actual device again, it is possible to create a trained model M3 that outputs more accurate control values CVd and CVe. As a result, for example, in line with the characteristics of each actual device (for example, the characteristics that appear with the aging of the device, the difference in thickness due to minute irregularities on the printing paper 9, the slight difference in viscosity for each ink, etc.). Good operation can be achieved.

ステップＳ４に係る実装置での強化学習が完了した後、画像記録装置１における印刷処理が開始される。制御部８０は、学習済みモデルＭ３に基づき、３つのモータ１４をそれぞれ回転させて印刷用紙９を搬送し、かつ、記録ヘッド２１〜２４からインクを吐出して印刷を行う。これにより、搬送機構１０のローラの表面と印刷用紙９との間のスリップや、インクによる印刷用紙９の伸びが発生した場合でも、印刷用紙９の搬送速度が所定の範囲に維持される。これにより、印刷用紙９上における各色のインクの吐出位置の搬送方向の誤差が抑制される。この結果、画像記録装置１の印刷品質を向上できる。 After the reinforcement learning in the actual device according to step S4 is completed, the printing process in the image recording device 1 is started. Based on the learned model M3, the control unit 80 rotates each of the three motors 14 to convey the printing paper 9, and ejects ink from the recording heads 21 to 24 to perform printing. As a result, the transport speed of the printing paper 9 is maintained within a predetermined range even when the surface of the roller of the transport mechanism 10 and the printing paper 9 slip or the printing paper 9 is stretched by the ink. As a result, an error in the transport direction of the ink ejection position of each color on the printing paper 9 is suppressed. As a result, the print quality of the image recording device 1 can be improved.

＜２．変形例＞
以上、本発明の一実施形態について説明したが、本発明は、上記の実施形態に限定されるものではない。以下では、種々の変形例について、上記の実施形態との相違点を説明する。 <2. Modification example>
Although one embodiment of the present invention has been described above, the present invention is not limited to the above embodiment. Hereinafter, the differences between the various modifications and the above-described embodiments will be described.

＜２−１．第１変形例＞
上記の実施形態では、制御部８０の速度算出部８１は、搬送ローラ１２２に接続されたエンコーダ４０から入力された連続パルス信号Ｅｎに基づき、搬送ローラ１２２を含む複数の搬送ローラ１２によって搬送される印刷用紙９の搬送速度の計測値Ｍｓを算出していた。しかしながら、制御部８０は、第１エッジ位置検出部３１から入力された第１エッジ信号Ｅｄ１および第２エッジ位置検出部３２から入力された第２エッジ信号Ｅｄ２に基づき、印刷用紙９の搬送速度の計測値Ｍｓを算出してもよい。以下に、詳細を説明する。 <2-1. First modification>
In the above embodiment, the speed calculation unit 81 of the control unit 80 is conveyed by a plurality of transfer rollers 12 including the transfer roller 122 based on the continuous pulse signal En input from the encoder 40 connected to the transfer roller 122. The measured value Ms of the transport speed of the printing paper 9 was calculated. However, the control unit 80 determines the transfer speed of the printing paper 9 based on the first edge signal Ed1 input from the first edge position detection unit 31 and the second edge signal Ed2 input from the second edge position detection unit 32. The measured value Ms may be calculated. Details will be described below.

図１１は、第１エッジ信号Ｅｄ１の例および第２エッジ信号Ｅｄ２の例をそれぞれ示したグラフである。図１１において、横軸は時刻を示す。図１１の縦軸は、エッジ９１の幅方向の位置を示す。なお、図１１のグラフの横軸は、左端が現在時刻であり、右側へ向かうほど時刻が古くなる。したがって、図１１中のデータ線は、時間の経過とともに、白抜き矢印のように右側へ移動する。印刷用紙９のエッジ９１には、微細な凹凸が存在する。第１エッジ位置検出部３１および第２エッジ位置検出部３２は、予め設定された微小時間ごとに（例えば５０マイクロ秒ごとに）、印刷用紙９のエッジ９１の幅方向の位置を検出する。これにより、図１１のように、印刷用紙９のエッジ９１の幅方向の位置の経時変化を示すデータが得られる。第１エッジ信号Ｅｄ１は、第１検出位置Ｐａを通過する印刷用紙９のエッジ９１の形状を反映したデータとなる。第２エッジ信号Ｅｄ２は、第２検出位置Ｐｂを通過する印刷用紙９のエッジ９１の形状を反映したデータとなる。 FIG. 11 is a graph showing an example of the first edge signal Ed1 and an example of the second edge signal Ed2, respectively. In FIG. 11, the horizontal axis represents the time. The vertical axis of FIG. 11 indicates the position of the edge 91 in the width direction. On the horizontal axis of the graph of FIG. 11, the left end is the current time, and the time becomes older toward the right side. Therefore, the data line in FIG. 11 moves to the right as shown by the white arrow with the passage of time. The edge 91 of the printing paper 9 has fine irregularities. The first edge position detection unit 31 and the second edge position detection unit 32 detect the position of the edge 91 of the printing paper 9 in the width direction every minute time set in advance (for example, every 50 microseconds). As a result, as shown in FIG. 11, data showing the time-dependent change in the position of the edge 91 of the printing paper 9 in the width direction can be obtained. The first edge signal Ed1 is data that reflects the shape of the edge 91 of the printing paper 9 that passes through the first detection position Pa. The second edge signal Ed2 is data that reflects the shape of the edge 91 of the printing paper 9 that passes through the second detection position Pb.

制御部８０は、第１エッジ信号Ｅｄ１と第２エッジ信号Ｅｄ２とを比較する。そして、第１エッジ信号Ｅｄ１と第２エッジ信号Ｅｄ２とで、印刷用紙９の同一のエッジ９１を検出した箇所を特定する。具体的には、第１エッジ信号Ｅｄ１に含まれるデータ区間（一定の時間範囲）ごとに、第２エッジ信号Ｅｄ２に含まれる複数のデータ区間（一定の時間範囲）のうち、一致性の高いデータ区間を特定する。以下では、第１エッジ信号Ｅｄ１に含まれるデータ区間を、第１データ区間ＤＳ１と称する。また、第２エッジ信号Ｅｄ２に含まれるデータ区間を、第２データ区間ＤＳ２と称する。 The control unit 80 compares the first edge signal Ed1 and the second edge signal Ed2. Then, the location where the same edge 91 of the printing paper 9 is detected is specified by the first edge signal Ed1 and the second edge signal Ed2. Specifically, for each data section (constant time range) included in the first edge signal Ed1, data with high consistency among a plurality of data sections (constant time range) included in the second edge signal Ed2. Identify the section. Hereinafter, the data section included in the first edge signal Ed1 is referred to as the first data section DS1. Further, the data section included in the second edge signal Ed2 is referred to as a second data section DS2.

一致性の高いデータ区間の特定には、例えば、相互相関や残差平方和等のマッチング手法が用いられる。制御部８０は、第１データ区間ＤＳ１ごとに、複数の第２データ区間ＤＳ２を、対応するデータ区間の候補として選択する。また、選択された候補である複数の第２データ区間ＤＳ２のそれぞれについて、第１データ区間ＤＳ１との一致性を示す評価値を算出する。そして、評価値が最も高くなる第２データ区間ＤＳ２を、第１データ区間ＤＳ１に対応する（第１データ区間ＤＳ１と最も一致性が高い）第２データ区間ＤＳ２として特定する。 For example, a matching method such as cross-correlation or residual sum of squares is used to identify a data interval with high consistency. The control unit 80 selects a plurality of second data sections DS2 as candidates for the corresponding data sections for each first data section DS1. In addition, for each of the plurality of second data sections DS2 that are selected candidates, an evaluation value indicating consistency with the first data section DS1 is calculated. Then, the second data section DS2 having the highest evaluation value is specified as the second data section DS2 corresponding to the first data section DS1 (which has the highest consistency with the first data section DS1).

その後、制御部８０は、第１データ区間ＤＳ１の検出時刻ＴＭ１と、第１データ区間ＤＳ１と最も一致性の高い第２データ区間ＤＳ２の検出時刻ＴＭ２との時間差に基づいて、第１検出位置Ｐａから第２検出位置Ｐｂまでの印刷用紙９の搬送にかかる実際の搬送時間ΔＴ（時刻ＴＭ２と時刻ＴＭ１との間の時間差）を算出する。そして、第１検出位置Ｐａから第２検出位置Ｐｂまでの距離を、搬送時間ΔＴで除することによって、画像記録部２０の下方における印刷用紙９の搬送速度を算出することができる。 After that, the control unit 80 determines the first detection position Pa based on the time difference between the detection time TM1 of the first data section DS1 and the detection time TM2 of the second data section DS2 having the highest coincidence with the first data section DS1. The actual transport time ΔT (time difference between the time TM2 and the time TM1) required for transporting the printing paper 9 from the second detection position Pb to the second detection position Pb is calculated. Then, by dividing the distance from the first detection position Pa to the second detection position Pb by the transfer time ΔT, the transfer speed of the printing paper 9 below the image recording unit 20 can be calculated.

＜２−２．第２変形例＞
上記の実施形態または変形例に記載された方法のほか、印刷用紙９の搬送速度を検出するためのレーザ速度計（図示省略）が用いられてもよい。そして、当該レーザ速度計から、印刷用紙９へ向けて２本のレーザ光線を照射し、これらがそれぞれ印刷用紙９によって反射された反射光の波長を解析した結果に基づいて、印刷用紙９の搬送速度を算出してもよい。 <2-2. Second modification>
In addition to the methods described in the above embodiments or modifications, a laser speedometer (not shown) for detecting the transport speed of the printing paper 9 may be used. Then, the laser speed meter irradiates the printing paper 9 with two laser beams, and the printing paper 9 is conveyed based on the result of analyzing the wavelengths of the reflected light reflected by the printing paper 9, respectively. The speed may be calculated.

＜２−３．第３変形例＞
上記の実施形態では、画像記録装置１において印刷処理を行う前に予め実行される事前学習の際に、印刷用紙９の搬送速度の計測値Ｍｓを所定の範囲に維持するための強化学習を行っていた。しかしながら、事前学習において、印刷用紙９の張力の計測値Ｍｔを所定の範囲に維持するための強化学習を行ってもよい。 <2-3. Third variant>
In the above embodiment, reinforcement learning for maintaining the measured value Ms of the transport speed of the printing paper 9 within a predetermined range is performed during the pre-learning that is executed in advance before the printing process is performed in the image recording apparatus 1. Was there. However, in the pre-learning, reinforcement learning for maintaining the measured value Mt of the tension of the printing paper 9 within a predetermined range may be performed.

具体的には、上記のステップＳ１において、実際に印刷用紙９を搬送しながら、印刷用紙９の表面にインクを吐出しつつ、張力検出部５０から出力される張力信号Ｔｅに基づき、張力検出部５０が接続された搬送ローラ１２３に接触しつつ搬送ローラ１２３を含む複数の搬送ローラ１２によって搬送される印刷用紙９に加わる張力の計測値を複数取得して、データ集合体ＤＡに蓄積してもよい。 Specifically, in step S1 described above, the tension detection unit is based on the tension signal Te output from the tension detection unit 50 while actually transporting the printing paper 9 and ejecting ink to the surface of the printing paper 9. Even if a plurality of measured values of tension applied to the printing paper 9 conveyed by a plurality of transfer rollers 12 including the transfer roller 123 while being in contact with the transfer roller 123 to which the 50 is connected are acquired and stored in the data aggregate DA. good.

また、上記のステップＳ２において、印刷用紙９の張力の計測値Ｍｔを教師データ（正解のデータ）としつつ、制御値ＣＶｄ，ＣＶｅ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃから、印刷用紙９の張力を高精度に算出するための簡易シミュレーションモデルＳｉＭｏを機械学習してもよい。そして、簡易シミュレーションモデルＳｉＭｏは、決定木Ｘ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）を含み、入力された制御値ＣＶｄ，ＣＶｅ、計測値Ｍｓ、参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃに基づいて算出した印刷用紙９の張力の算出値ＣＡｔと、データ集合体ＤＡに蓄積された対応する計測値Ｍｔとの差異を最小化するように、決定木Ｘ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）に含まれる複数のパラメータ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…）を調整しつつ更新保存してもよい。簡易シミュレーションモデルＳｉＭｏからの出力値である印刷用紙９の張力の算出値ＣＡｔとデータ集合体ＤＡに蓄積された対応する計測値Ｍｔとの差異が、所定値以下になると、機械学習が完了する。 Further, in step S2 above, the printing paper 9 is obtained from the control values CVd, CVe, the reference values CF1, CF2, CF3, and the information Sc, while using the measured value Mt of the tension of the printing paper 9 as the teacher data (correct answer data). The simple simulation model SiMo for calculating the tension of the above with high accuracy may be machine-learned. Then, the simple simulation model SiMo includes the decision trees X (a, b, c, f (CVd, CVe, CF1, CF2, CF3, Sc) ...), and the input control values CVd, CVe, measured values Ms, To minimize the difference between the calculated tension CAt of the printing paper 9 calculated based on the reference values CF1, CF2, CF3, and the information Sc, and the corresponding measured value Mt stored in the data aggregate DA. A plurality of parameters (a, b, c, f (CVd, CVe, CF1, CF2, CF3) included in the decision tree X (a, b, c, f (CVd, CVe, CF1, CF2, CF3, Sc) ...) , Sc) ...) may be updated and saved. Machine learning is completed when the difference between the calculated value CAt of the tension of the printing paper 9 which is the output value from the simple simulation model SiMo and the corresponding measured value Mt stored in the data aggregate DA becomes equal to or less than a predetermined value.

また、上記のステップＳ３において、コンピュータ９０上で、簡易シミュレーションモデルＳｉＭｏを用いて、印刷用紙９の張力を所定の範囲に維持するための制御ルールを、機械学習してもよい。具体的には、ステップＳ３１において、搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅとしてそれぞれ、ある値を出力して、簡易シミュレーションモデルＳｉＭｏへ入力してもよい。ステップＳ３２では、簡易シミュレーションモデルＳｉＭｏは、学習前モデルＭ０から入力された値と、上記の参照値ＣＦ１，ＣＦ２，ＣＦ３、および情報Ｓｃとに基づいて、印刷用紙９の張力の算出値ＣＡｔを出力してもよい。 Further, in step S3 described above, the control rule for maintaining the tension of the printing paper 9 within a predetermined range may be machine-learned on the computer 90 by using the simple simulation model SiMo. Specifically, in step S31, certain values may be output as the control value CVd related to the transfer and the control value CVe related to the discharge, and input to the simple simulation model SiMo. In step S32, the simple simulation model SiMo outputs the calculated value CAt of the tension of the printing paper 9 based on the values input from the pre-learning model M0, the above reference values CF1, CF2, CF3, and the information Sc. You may.

ステップＳ３３では、簡易シミュレーションモデルＳｉＭｏからの出力値である、印刷用紙９の張力の算出値ＣＡｔに応じた報酬ＲＥ１を付与してもよい。これにより、ステップＳ３では、第１段階学習中モデルＭ１から出力する値（搬送に係る制御値ＣＶｄおよび吐出に係る制御値ＣＶｅ）を更新しつつ、簡易シミュレーションモデルＳｉＭｏへ入力し、さらに簡易シミュレーションモデルＳｉＭｏからのＣＡｔを所定の範囲に維持するための強化学習を行ってもよい。さらに、ステップＳ３４では、報酬ＲＥ１が所望のレベルに達したと判断されると、コンピュータ９０上での強化学習を終了してもよい。そして、当該ステップＳ３において作成された学習モデルを、第２段階学習中モデルＭ２としてもよい。 In step S33, the reward RE1 corresponding to the calculated value CAt of the tension of the printing paper 9, which is the output value from the simple simulation model SiMo, may be given. As a result, in step S3, while updating the values output from the first-stage learning model M1 (control value CVd related to transport and control value CVe related to discharge), the values are input to the simple simulation model SiMo, and further, the simple simulation model. Reinforcement learning may be performed to maintain the CAt from SiMo within a predetermined range. Further, in step S34, when it is determined that the reward RE1 has reached a desired level, the reinforcement learning on the computer 90 may be terminated. Then, the learning model created in step S3 may be used as the second-stage learning in-progress model M2.

また、上記のステップＳ４において、ステップＳ３において作成された第２段階学習中モデルＭ２から、搬送に係る制御値ＣＶｄ（モータ１４の回転数）として、ある値を出力し、この値に基づき、駆動ローラである巻き出しローラ１１、搬送ローラ１２１、および巻き取りローラ１３にそれぞれ接続されるモータ１４を回転させてもよい。また、同時に、第２段階学習中モデルＭ２から、吐出に係る制御値ＣＶｅ（吐出タイミング）として、ある値を出力し、この値に基づき、記録ヘッド２１〜２４からインクを吐出してもよい（ステップＳ４１）。また、ステップＳ４２において、張力検出部５０から出力される張力信号Ｔｅに基づき、このときの印刷用紙９に加わる張力の計測値Ｍｔを取得してもよい。 Further, in step S4 described above, a certain value is output as the control value CVd (rotational speed of the motor 14) related to the transfer from the second stage learning model M2 created in step S3, and the drive is performed based on this value. The motor 14 connected to the unwinding roller 11, the transport roller 121, and the winding roller 13, which are rollers, may be rotated. At the same time, a certain value may be output from the second-stage learning model M2 as a control value CVe (ejection timing) related to ejection, and ink may be ejected from the recording heads 21 to 24 based on this value ( Step S41). Further, in step S42, the measured value Mt of the tension applied to the printing paper 9 at this time may be acquired based on the tension signal Te output from the tension detection unit 50.

ステップＳ４３では、印刷用紙９の張力の計測値Ｍｔに応じた報酬ＲＥ２を付与してもよい。そして、ステップＳ４４では、報酬ＲＥ２が所望のレベルに達したと判断されると、実装置を用いた強化学習を終了してもよい。 In step S43, the reward RE2 corresponding to the measured value Mt of the tension of the printing paper 9 may be given. Then, in step S44, when it is determined that the reward RE2 has reached a desired level, reinforcement learning using the actual device may be terminated.

さらに、事前学習において、上記の実施形態および変形例のほか、印刷用紙９の伸びの計測値を所定の範囲に維持するための強化学習を行ってもよい。 Further, in the pre-learning, in addition to the above-described embodiment and modification, reinforcement learning for maintaining the measured value of the elongation of the printing paper 9 within a predetermined range may be performed.

＜２−４．他の変形例＞
また、上記の画像記録装置１は、印刷用紙９を搬送しながら、処理物質としてインクを吐出して印刷用紙９の表面に画像を記録するものであった。しかしながら、画像記録装置１は、インク以外の処理物質を印刷用紙９の表面に吐出して画像を記録するものであってもよい。また、本発明の搬送処理装置は、インクジェット以外の方法（例えば、電子写真方式や露光等）で、印刷用紙９に画像を記録する装置であってもよい。また、上記の画像記録装置１は、基材としての印刷用紙９に印刷処理を行うものであった。しかしながら、本発明の搬送処理装置は、一般的な紙以外の長尺帯状の基材（例えば、樹脂製のフィルム，金属箔等）に、所定の処理を行うものであってもよい。 <2-4. Other variants>
Further, the above-mentioned image recording device 1 discharges ink as a processing substance while conveying the printing paper 9, and records an image on the surface of the printing paper 9. However, the image recording device 1 may record an image by ejecting a processing substance other than ink onto the surface of the printing paper 9. Further, the transport processing device of the present invention may be a device that records an image on the printing paper 9 by a method other than inkjet (for example, electrophotographic method, exposure, etc.). Further, the above-mentioned image recording apparatus 1 performs a printing process on printing paper 9 as a base material. However, the transport processing apparatus of the present invention may perform a predetermined treatment on a long strip-shaped base material (for example, a resin film, a metal foil, etc.) other than general paper.

また、上記の実施形態や変形例に登場した各要素を、矛盾が生じない範囲で、適宜に組み合わせてもよい。 Further, the elements appearing in the above-described embodiments and modifications may be appropriately combined as long as there is no contradiction.

１画像記録装置
９印刷用紙
１０搬送機構
１１巻き出しローラ
１２搬送ローラ
１３巻き取りローラ
１４モータ
２０画像記録部
２１第１記録ヘッド
２２第２記録ヘッド
２３第３記録ヘッド
２４第４記録ヘッド
３０エッジ位置検出部
３１第１エッジ位置検出部
３２第２エッジ位置検出部
４０エンコーダ
５０張力検出部
６０情報取得部
８０制御部
８１速度算出部
９０コンピュータ
９１エッジ
１２１搬送ローラ
１２２搬送ローラ
１２３搬送ローラ
８００判定部
９００判定部
ＣＡｓ（搬送速度の）算出値
ＣＡｔ（張力の）算出値
ＣＦ１，ＣＦ２，ＣＦ３参照値
ＣＶｄ搬送に係る制御値
ＣＶｅ吐出に係る制御値
ＤＡデータ集合体
Ｍ０学習前モデル
Ｍ１第１段階学習中モデル
Ｍ２第２段階学習中モデル
Ｍ３学習済みモデル
Ｍｓ（搬送速度の）計測値
Ｍｔ（張力の）計測値
ＲＥ１，ＲＥ２報酬
Ｒｅ１，Ｒｅ２参照値
Ｓｃ情報
ＳｉＭｏ簡易シミュレーションモデル
Ｔｅ張力信号
Ｘ（ａ，ｂ，ｃ，ｆ（ＣＶｄ，ＣＶｅ，ＣＦ１，ＣＦ２，ＣＦ３，Ｓｃ）…））決定木 1 Image recording device 9 Printing paper 10 Conveying mechanism 11 Unwinding roller 12 Conveying roller 13 Rewinding roller 14 Motor 20 Image recording unit 21 1st recording head 22 2nd recording head 23 3rd recording head 24 4th recording head 30 Edge position Detection unit 31 1st edge position detection unit 32 2nd edge position detection unit 40 Encoder 50 Tension detection unit 60 Information acquisition unit 80 Control unit 81 Speed calculation unit 90 Computer 91 Edge 121 Transport roller 122 Transport roller 123 Transport roller 800 Judgment unit 900 Judgment unit CAs (conveyance speed) calculated value CAt (tension) calculated value CF1, CF2, CF3 Reference value CVd Conveyance control value CVe discharge control value DA data aggregate M0 Pre-learning model M1 First stage learning Model M2 Second stage Learning model M3 Trained model Ms (conveyance speed) measured value Mt (tension) measured value RE1, RE2 Reward Re1, Re2 Reference value Sc Information SiMo Simple simulation model Te Tension signal X (a, b) , C, f (CVd, CVe, CF1, CF2, CF3, Sc) ...)) Determining tree

Claims

A transport processing method in which a processing substance is discharged onto the surface of the substrate while the long strip-shaped substrate is conveyed in the longitudinal direction.
a) A control value related to transportation including the rotation speed of a motor which is a drive source for transporting the base material while actually transporting the base material, or a control value related to discharge including the discharge timing of the processed substance. And the step of acquiring a plurality of measured values related to the tension or the transport speed of the base material, and
b) A step of creating a simple simulation model showing the relationship between the control value and the measured value on a computer based on the acquisition result in the step a).
c) A step of creating a learning model by performing reinforcement learning to maintain the output value from the simple simulation model within a predetermined range while updating the control value on the computer.
d) While updating the control value output from the learning model, the measured value in the case of discharging the processed substance to the surface of the base material while actually transporting the base material is set within a predetermined range. The process of performing reinforcement learning to maintain,
A transport processing method.

The transport processing method according to claim 1.
The base material is spread over a plurality of rollers, and is conveyed by rotating a drive roller, which is at least one of the plurality of rollers, driven by the motor.
The control value related to the transfer is a transfer processing method including the rotation speed of the drive roller.

The transport processing method according to claim 1 or 2.
The control value related to the discharge is a transport processing method including a discharge amount of the processing substance.

The transport processing method according to any one of claims 1 to 3.
In the step a), information on the thickness or type of the base material is further acquired.
In the step b), a transport processing method for creating the simple simulation model showing the relationship between the control value, the information on the base material, and the measured value based on the acquisition result in the step a).

The transport processing method according to any one of claims 1 to 4.
In the step a), information on the characteristics of the processed substance is further acquired, and the information is obtained.
In the step b), a transport processing method for creating the simple simulation model showing the relationship between the control value, the information on the processing substance, and the measured value based on the acquisition result in the step a).

The transport processing method according to any one of claims 1 to 5.
In the step a), information on environmental conditions including the temperature or humidity around the base material is further acquired.
In the step b), a transport processing method for creating the simple simulation model showing the relationship between the control value, the information related to the environmental condition, and the measured value based on the acquisition result in the step a).

The transport processing method according to any one of claims 1 to 6.
In the step b), the simple simulation model includes a decision tree, and the parameters included in the decision tree are adjusted.

The transport processing method according to any one of claims 1 to 7.
The reinforcement learning performed in the step c) is a transfer processing method, which is machine learning executed by a technique of PPO or DQN.