WO2022176761A1

WO2022176761A1 - Control device, robot system, learning device, control method, and program

Info

Publication number: WO2022176761A1
Application number: PCT/JP2022/005292
Authority: WO
Inventors: 義浩細川; 弘記雨森; 沙織松永
Original assignee: 三菱電機株式会社
Priority date: 2021-02-18
Filing date: 2022-02-10
Publication date: 2022-08-25
Also published as: JP7462827B2; CN116917086A; JPWO2022176761A1

Abstract

This control device (1) comprises a drive condition storage unit (11) that stores drive conditions and reference intermediate points in association with each other, a reference trajectory determination unit (12) that determines a reference trajectory passing through the intermediate points, and an actual trajectory acquisition unit (13) that acquires the trajectory of a robot (10). The control device (1) furthermore comprises: a learning unit (14) that derives a trajectory error indicating deviation of the actual trajectory from an ideal trajectory for each of the reference intermediate points, learns a correspondence between the reference intermediate points and the trajectory error, and generates a trajectory model indicating, in accordance with the drive conditions, target intermediate points for which the trajectory error is minimized; a target trajectory determination unit (16) that determines a target trajectory passing through the target intermediate points obtained from the trajectory model; and a control unit (17) that controls the robot in accordance with the reference trajectory or the target trajectory.

Description

Control device, robot system, learning device, control method, and program

The present disclosure relates to control devices, robot systems, learning devices, control methods, and programs.

　Industrial robots are equipped with multiple links and multiple joints that connect the multiple links to enable various movements. The control device controls a plurality of servo motors that drive a plurality of joints of the robot, thereby enabling the robot to perform a desired motion. For example, if there is a difference in the delay of each servomotor, the actual trajectory of the tip link, which is the link located at the tip of the plurality of links, may deviate from the target trajectory. An example of technology for minimizing the error between the actual trajectory of the robot and the target trajectory is disclosed in Patent Document 1.

The robot position teaching device disclosed in Patent Document 1 measures the actual trajectory of the robot, and based on the trajectory error, which is the distance between the actual trajectory and the target trajectory, predictive teaching is performed to minimize the error. Calculate points. Trajectory errors are reduced by controlling the robot based on potential teaching points.

JP-A-11-048176

The robot position teaching device disclosed in Patent Document 1 calculates probable teaching points indicating a plurality of positions in the motion path of the robot and controls the robot based on the probable teaching points until the trajectory error becomes equal to or less than the allowable value. repeat. As the target trajectory becomes more complex, the number of potential teaching points increases, the time it takes for the trajectory error to converge below the allowable value increases, and it becomes difficult to bring the robot's actual trajectory closer to the target trajectory. Become. As a result, robot control accuracy is degraded.

The present disclosure has been made in view of the circumstances described above, and aims to provide a control device, a robot system, a learning device, a control method, and a program with high robot control accuracy.

To achieve the above object, a control device according to the present disclosure is a control device for controlling a robot, comprising a driving condition storage unit, a reference trajectory determination unit, an actual trajectory acquisition unit, a learning unit, and a target trajectory determination unit. and a controller. The drive condition storage unit stores drive conditions specifying at least the motion start point and the motion end point of the robot in association with each of the plurality of reference intermediate points. The reference trajectory determination unit acquires one of a plurality of reference intermediate points from the drive condition storage unit, and determines a reference trajectory passing through the reference intermediate point based on the drive condition for each of the acquired reference intermediate points. The actual trajectory acquisition unit acquires the actual trajectory of the robot. The learning unit calculates a trajectory error indicating a deviation from the ideal trajectory based on the driving conditions of the actual trajectory acquired by the actual trajectory acquisition unit when the robot is controlled according to the reference trajectory. Then, the correspondence between the reference waypoint and the trajectory error is learned to generate a trajectory model indicating the target waypoint that minimizes the trajectory error according to the driving conditions. The target trajectory determination unit determines a target trajectory passing through the target waypoint obtained from the trajectory model. The controller controls the robot according to the reference trajectory or the target trajectory.

The control device according to the present disclosure learns the correspondence between the reference waypoint and the trajectory error, generates a trajectory model indicating the target waypoint that minimizes the trajectory error according to the driving conditions, and generates a target waypoint passing through the target waypoint. Control the robot according to the trajectory. Therefore, the robot control accuracy of the control device according to the present disclosure is high.

A perspective view of a robot according to Embodiment 1 Side view of the robot according to Embodiment 1 Block diagram of the robot system according to Embodiment 1 2 is a block diagram showing the hardware configuration of the control device according to Embodiment 1; FIG. 3 is a flowchart showing an example of learning processing performed by the control device according to Embodiment 1; FIG. 4 shows an example of a drive condition table held by a drive condition storage unit according to the first embodiment; A diagram showing an example of a reference midpoint in Embodiment 1 FIG. 4 shows an example of a reference trajectory and an actual trajectory according to the first embodiment; FIG. 4 shows an example of a reference trajectory and an actual trajectory according to the first embodiment; FIG. 4 is a diagram showing an example of an error table held by a learned data storage unit according to Embodiment 1; FIG. 4 is a diagram showing an example of a target waypoint table held by a learned data storage unit according to Embodiment 1; FIG. Flowchart showing an example of operation processing performed by the control device according to Embodiment 1 FIG. 4 shows an example of a target trajectory and an actual trajectory according to the first embodiment; Perspective view of a robot according to Embodiment 2 Block diagram of a robot system according to Embodiment 2 10 is a flowchart showing an example of learning processing performed by the control device according to Embodiment 2; FIG. 10 is a diagram showing an example of an error table held by a learned data storage unit according to Embodiment 2; FIG. Block diagram of a robot system according to Embodiment 3 Flowchart showing an example of learning processing performed by the control device according to the third embodiment FIG. 11 is a diagram showing an example of a driving condition table held by a driving condition storage unit according to the third embodiment; FIG. FIG. 11 is a diagram showing an example of a driving condition table held by a driving condition storage unit according to the third embodiment; FIG. Block diagram of a robot system according to Embodiment 4 Flowchart showing an example of learning processing performed by the control device according to the fourth embodiment Block diagram of a robot system according to Embodiment 5 FIG. 5 is a diagram showing a modified example of the target waypoint table held by the learned data storage unit according to the embodiment; The figure which shows the 1st modification of the error table which the learned data memory|storage part which concerns on embodiment holds. The figure which shows the 2nd modification of the error table which the learned data storage part which concerns on embodiment holds. The figure which shows the 3rd modification of the error table which the learned data memory|storage part which concerns on embodiment holds. Block diagram of a modified example of the robot system according to the embodiment The perspective view of the 1st modification of the robot which concerns on embodiment The perspective view of the 2nd modification of the robot which concerns on embodiment

Hereinafter, a control device, a robot system, a learning device, a control method, and a program according to embodiments of the present disclosure will be described in detail with reference to the drawings. In the drawings, the same reference numerals are given to the same or equivalent parts.

(Embodiment 1)
A robot system including a robot and a control device for controlling the robot will be described in the first embodiment by taking a robot including a plurality of links and a plurality of joints as an example. The robot has multiple links, multiple joints that connect the multiple links, and multiple motors that are associated with the multiple joints and drive the corresponding joints.

As shown in FIGS. 1 and 2, the robot 10 is a vertically articulated robot. Specifically, the robot 10 includes a first arm 110, a second arm 120, a third arm 130, a fourth arm 140, a fifth arm 150, a flange 160, and a base 170 as a plurality of links. , provided. The flange 160 corresponds to a tip link, which is a link positioned at the tip of the plurality of links. The robot 10 further comprises a fixed part 180 fixed to a plane and supporting the base 170 . In Embodiment 1, fixing portion 180 is a plate-like member having a square main surface.

1 and 2, the X-axis and the Y-axis are set as axes included in a plane parallel to the main surface of the fixing portion 180 and perpendicular to each other, and the Z-axis is set as an axis perpendicular to each of the X-axis and the Y-axis. set. The X-axis and the Y-axis each extend along the end surface of the fixed portion 180 . If the fixed part 180 is fixed on the horizontal plane, the Z axis extends in the vertical direction. In Embodiment 1, the trajectory of the tip link, more specifically, the trajectory of the center of gravity of the tip surface 160a of the flange 160 is represented by an XYZ orthogonal coordinate system. The origin of the XYZ orthogonal coordinate system is, for example, the position of the center of gravity of the tip surface 160a when the robot 10 is at the initial position. When the robot 10 is at the initial position means when the rotational positions of motors M1, M2, M3, M4, M5 and M6, which will be described later, are at the initial positions.

The first arm 110 is attached to the base 170 so as to be rotatable around the rotation axis AX1 parallel to the Z axis. The second arm 120 is connected to the first arm 110 and is rotatable around the rotation axis AX2. The third arm 130 is connected to the second arm 120 and is rotatable around the rotation axis AX3. The fourth arm 140 is connected to the third arm 130 and is rotatable around the rotation axis AX4. The fifth arm 150 is connected to the fourth arm 140 and is rotatable around the rotation axis AX5. The flange 160 is connected to the fifth arm 150 and is rotatable around the rotation axis AX6. A machining tool can be attached to the flange 160 . In other words, the robot 10 has six joints corresponding to the rotation axes AX1, AX2, AX3, AX4, AX5 and AX6.

As shown in FIG. 3, the robot 10 includes motors M1, M2, M3, M4, M5 and M6 corresponding to the rotation axes AX1, AX2, AX3, AX4, AX5 and AX6. Motors M1, M2, M3, M4, M5 and M6 are servo motors. The control device 1 controls the motors M1, M2, M3, M4, M5, and M6 according to the driving conditions, so that the motors M1, M2, M3, M4, M5, and M6 rotate, and the first arm 110 and the second arm 110 rotate. Arm 120, third arm 130, fourth arm 140, fifth arm 150, and flange 160 rotate. As a result, the trajectory of the flange 160, which is the tip link of the robot 10, becomes a trajectory according to the drive conditions. The robot 10 further includes encoders E1, E2, E3, E4, E5 and E6 that detect the rotational positions of the motors M1, M2, M3, M4, M5 and M6.

The robot system 100 includes the robot 10 having the configuration described above and the control device 1 that controls the robot 10 . The control device 1 controls the trajectory of the robot 10, specifically, the trajectory of the flange 160, to approach the ideal trajectory according to the drive conditions specifying at least the motion start point and the motion end point of the robot 10. FIG. The motion starting point of the robot 10 means the motion starting point of the movable portion of the robot 10 , specifically, the motion starting point of the flange 160 of the robot 10 . The motion end point of the robot 10 means the motion end point of the movable portion of the robot 10 , specifically, the motion end point of the flange 160 of the robot 10 .

In the first embodiment, the control device 1 controls the motors M1, M2, M3, M4, M5, Control M6. The motors M1, M2, M3, M4, M5 and M6 operate with a delay from the command from the control device 1. FIG. If there is a difference in the operation delays of the motors M1, M2, M3, M4, M5, and M6, the actual trajectory indicating the actual trajectory of the robot 10 deviates from the ideal trajectory based on the drive conditions.

For each of the reference intermediate points corresponding to the drive conditions, the control device 1 obtains a trajectory error indicating the deviation of the actual trajectory from the ideal trajectory when the robot 10 is controlled based on the reference trajectory passing through the reference intermediate points. Learn the correspondence between waypoints and trajectory errors. The reference midpoint indicates a predetermined position through which the leading link can pass from the motion start point to the motion end point. The ideal trajectory is, for example, the shortest trajectory from the motion start point indicated by the drive condition to the motion end point indicated by the drive condition.

After learning the correspondence between the reference waypoint and the trajectory error as described above, the control device 1 generates a trajectory model indicating the target waypoint that minimizes the trajectory error for each drive condition, and calculates the target waypoint passing through the target waypoint. The robot 10 is controlled according to the trajectory. By controlling the robot 10 according to the target trajectory, it is possible to improve the control accuracy of the robot 10 .

Each part of the control device 1 will be described below.
The control device 1 includes a driving condition storage unit 11 that stores driving conditions and a plurality of reference intermediate points in association with each other, a reference trajectory determining unit 12 that determines a reference trajectory passing through the reference intermediate points, and an actual robot 10 . , more specifically, an actual trajectory acquisition unit 13 that acquires the actual movement trajectory of the flange 160 that is the tip link. The control device 1 further obtains a trajectory error that indicates the deviation of the actual trajectory from the ideal trajectory for each of the reference intermediate points, learns the correspondence between the reference intermediate points and the trajectory error, and determines a target intermediate point that minimizes the trajectory error. A learning unit 14 that generates a trajectory model indicated according to driving conditions, and a learned data storage unit 15 that stores the learning result of the learning unit 14, specifically, the correspondence between the reference intermediate point and the trajectory error. The control device 1 further includes a target trajectory determining unit 16 that determines a target trajectory passing through a target intermediate point according to the driving conditions based on the driving command including the trajectory model and the driving conditions; Specifically, it includes a control unit 17 that controls the motors M1, M2, M3, M4, M5, and M6.

As shown in FIG. 4, the control device 1 includes a processor 31, a memory 32, and an interface 33 as a hardware configuration. Processor 31 , memory 32 and interface 33 are connected to each other via bus 34 . Each function of the control device 1 is realized by executing a program stored in the memory 32 by the processor 31 . The interface 33 connects the control device 1 and an external device to enable communication with the external device. Specifically, the controller 1 is connected via an interface 33 to motors M1, M2, M3, M4, M5 and M6 and encoders E1, E2, E3, E4, E5 and E6. The interface 33 has multiple types of interface modules as required.

Although the control device 1 shown in FIG. 4 has one processor 31 and one memory 32 , the control device 1 may have multiple processors 31 and multiple memories 32 . In this case, each function of the control device 1 may be realized by cooperation of the plurality of processors 31 and the plurality of memories 32 .

The control device 1 having the above configuration repeats the control of the robot 10 according to the reference trajectory passing through the reference waypoint for different reference waypoints and different driving conditions, and sets the target waypoint that minimizes the trajectory error as the driving condition. , and operation processing for controlling the robot 10 according to the target trajectory passing through the target waypoint. Minimizing the trajectory error means bringing the trajectory error closer to a minimum value, a value near the minimum value, or a minimum value or a value near the minimum value.

An overview of the operation of the learning process performed by the control device 1 will be described using FIG. For example, the control device 1 starts the learning process of FIG. 5 when the control device 1 starts to operate for the first time after the robot 10 is installed or when a new driving condition is added.

A reference trajectory determination unit 12 acquires a driving condition and any one of a plurality of reference intermediate points corresponding to the driving condition from the driving condition storage unit 11 that stores the driving condition and a plurality of reference intermediate points in association with each other. (Step S11). The reference trajectory determining unit 12 determines a reference trajectory passing through the reference intermediate point based on the driving conditions and the reference intermediate point acquired in step S11 (step S12). The control unit 17 controls the motors M1, M2, M3, M4, M5, and M6 of the robot 10 according to the reference trajectory determined in step S12, and the actual trajectory acquisition unit 13 acquires the actual trajectory of the robot 10. (step S13).

Specifically, the control unit 17 determines the target positions of the motors M1, M2, M3, M4, M5 and M6 to move the flange 160 according to the reference trajectory, and activates the motors M1, M2, M3, M4, M5 and M6. Control to move to the target position. The actual trajectory obtaining unit 13 obtains the actual trajectory of the robot 10 based on the rotational positions of the motors M1, M2, M3, M4, M5 and M6 obtained from the encoders E1, E2, E3, E4, E5 and E6. . The learning unit 14 calculates a trajectory error indicating the deviation of the trajectory acquired in step S13 from the ideal trajectory based on the driving conditions acquired in step S11 (step S14). While the calculation of the trajectory error has not been completed for all the reference intermediate points of all the drive conditions stored in the drive condition storage unit 11 (step S15; No), the above-described steps S11 to S14 are repeated. .

When the calculation of the trajectory error is completed for all the reference intermediate points of all the driving conditions stored in the driving condition storage unit 11 (step S15; Yes), the learning unit 14 selects the target intermediate point that minimizes the trajectory error. A trajectory model indicating a target intermediate point corresponding to the drive condition is generated (step S16). When the process of step S16 is completed, the control device 1 terminates the learning process.

Details of the learning process performed by the control device 1 will be described below.
As shown in FIG. 6, the drive condition storage unit 11 holds a drive condition table in which drive conditions are associated with a plurality of reference intermediate points. The drive conditions specify at least the motion start point and motion end point of the flange 160, which is the tip link. In Embodiment 1, the drive conditions include the motion start point, motion end point, speed and attitude of the leading link. The motion start point and motion end point indicate positions in the XYZ orthogonal coordinate system shown in FIGS. The speed indicates the target speed during operation of the leading link. The attitude indicates the orientation of the tip link, more specifically, the angle formed by the tip surface 160a of the flange 160 and the plane to which the robot 10 is fixed.

FIG. 6 shows an example of a plurality of reference intermediate points stored corresponding to one drive condition. For example, the drive condition included in the record on the first row of the drive condition table shown in FIG. , 0, 0) to the motion end point (100, 0, 0) at the target speed V1. In this case, the ideal trajectory extends from the motion start point SP1 (0,0,0) to the motion end point EP1 (100,0,0) and is indicated by a line segment located on the X axis.

In Embodiment 1, a plurality of reference midpoints are defined for each driving condition. In the example of FIG. 6, the flange 160 is moved from the motion start point (0,0,0) to the motion end point (100,0,0) with the tip surface 160a of the flange 160 parallel to the plane to which the robot 10 is fixed. 0) at the target speed V1, a plurality of reference intermediate points are determined.

A plurality of reference intermediate points are three-dimensionally arranged in the space from the motion start point to the motion end point. For example, with respect to a drive condition that instructs movement from the motion start point SP1 to the motion end point EP1, the motion is three-dimensionally arranged in the space S1 from the motion start point SP1 to the motion end point EP1 indicated by the dotted line in FIG. A plurality of reference waypoints RP are defined. For example, if the distance from the motion start point SP1 to the motion end point EP1 is 100 millimeters, reference intermediate points RP that are three-dimensionally arranged at intervals of 5 millimeters in the space S1 are determined. The size of the space S1 may be determined depending on the accuracy of the robot 10 and the possible size of the trajectory error.

The reference trajectory determination unit 12 acquires the associated driving conditions and reference intermediate points from the driving condition table shown in FIG. Then, the reference trajectory determination unit 12 determines a reference trajectory passing through the acquired reference intermediate point based on the drive conditions. More specifically, the reference trajectory determination unit 12 performs spline interpolation based on the motion start point, the reference intermediate point, and the motion end point indicated by the drive condition, and converts the motion start point indicated by the drive condition to the reference intermediate point. A reference trajectory is calculated that passes through the points and reaches the operation end point indicated by the drive condition. The reference trajectory indicates, for example, the position of the leading link in the XYZ orthogonal coordinate system for each control cycle. The control period is determined, for example, according to the arithmetic processing capability of the control device 1 .

For example, the reference trajectory determining unit 12 acquires the driving condition and the reference intermediate point indicated by the record on the first row of the driving condition table shown in FIG. The motion start point (0, 0, 0) and the motion end point (100, 0, 0) included in this drive condition are respectively shown as motion start point SP1 and motion end point EP1 in FIG. 50,0,0) is shown in FIG. 8 as the reference midpoint RP1. The reference trajectory determination unit 12 determines the reference intermediate point RP1 (50, 0, 0) is determined. In this case, the reference trajectory RT1 indicated by the dotted line extends from the motion start point SP1 (0,0,0) to the motion end point EP1 (100,0,0) and is indicated by a line segment positioned on the X axis. After determining the reference trajectory as described above, the reference trajectory determination unit 12 sends the driving conditions and the determined reference trajectory to the learning unit 14 and the control unit 17 .

The control unit 17 shown in FIG. 3 generates control commands to the motors M1, M2, M3, M4, M5, and M6 according to the drive conditions and the reference trajectory acquired from the reference trajectory determination unit 12, and outputs the control commands to the motor M1. , M2, M3, M4, M5 and M6. Specifically, the control unit 17 generates control commands to the motors M1, M2, M3, M4, M5, and M6 in accordance with the reference positions, which are the positions of the leading links indicated by the reference trajectory. Transmission to M2, M3, M4, M5 and M6 is performed for each control period. As a result, the motors M1, M2, M3, M4, M5 and M6 rotate according to the control commands, and the first arm 110, the second arm 120, the third arm 130, the fourth arm 140, the fifth arm 150, and the flange 160 rotate. rotates. As a result, the flange 160, which is the tip link, moves.

The control unit 17 acquires the rotational positions of the motors M1, M2, M3, M4, M5 and M6 from the encoders E1, E2, E3, E4, E5 and E6, and controls the motors M1, M2, M3, M4, M5 and M6. Feedback control is preferred.

The actual trajectory acquisition unit 13 acquires the rotational positions of the motors M1, M2, M3, M4, M5 and M6 from the encoders E1, E2, E3, E4, E5 and E6, and the motors M1, M2, M3, M4, M5, Obtain the actual position of the leading link from the rotational position of M6. The actual trajectory acquisition unit 13 performs the above-described processing for each control cycle to determine the actual trajectory indicating the actual position of the leading link in the XYZ orthogonal coordinate system for each control cycle.

As an example, when the motors M1, M2, M3, M4, M5, and M6 are controlled in accordance with the drive condition indicated by the record in the first row of the drive condition table shown in FIG. 6 and the reference trajectory RT1 based on the reference intermediate point, The actual trajectory AT1 acquired by the actual trajectory acquisition unit 13 is indicated by a solid line in FIG. The actual trajectory acquisition unit 13 sends the actual trajectory acquired as described above to the learning unit 14 .

When the learning unit 14 acquires the actual trajectory from the actual trajectory acquisition unit 13, the learning unit 14 generates a trajectory indicating the deviation of the actual trajectory acquired by the actual trajectory acquisition unit 13 from the ideal trajectory when the robot 10 is controlled according to the reference trajectory. Calculate the error. Specifically, the learning unit 14 determines an ideal trajectory, which is the shortest trajectory from the motion start point to the motion end point, based on the drive conditions, and determines the position of the leading link for each control cycle on the ideal trajectory. Then, the learning unit 14 determines the position of the robot 10 in each control cycle indicated by the ideal trajectory, more specifically, the position of the robot 10 in each control cycle indicated by the ideal position, which is the position of the tip link of the robot 10, and the actual trajectory. Specifically, the distance from the actual position, which is the position of the tip link, is calculated. The learning unit 14 uses the deviation of the actual position from the ideal position when this distance reaches the maximum value as the trajectory error.

For example, when the learning unit 14 acquires from the reference trajectory determining unit 12 the driving condition indicated by the record in the first row of the driving condition table shown in FIG. 0, 0) to the motion end point EP1 (100, 0, 0). In this case, the ideal trajectory extends from the motion start point SP1 (0,0,0) to the motion end point EP1 (100,0,0) shown in FIG. 8, and is indicated by a line segment located on the X axis. The learning unit 14 calculates the distance between the ideal position of the leading link on the ideal trajectory for each control cycle and the actual position of the leading link for each control cycle indicated by the actual trajectory AT1 acquired by the actual trajectory acquisition unit 13, and , the deviation of the actual position from the ideal position when the distance reaches the maximum value is used as the trajectory error. In the example of FIG. 8, the actual position deviates from the ideal position on the XY plane in the positive direction of the Y axis.

The control device 1 performs the above-described processing for all combinations of drive conditions and reference intermediate points stored in the drive condition table shown in FIG. For example, after completing the above-described processing for the driving condition and the reference intermediate point indicated by the record in the first row of the driving condition table shown in FIG. Acquire the drive condition and the reference midpoint indicated by the record of . The drive condition indicated by the record on the second line is the same as the drive condition indicated by the record on the first line. The reference midpoint (50, -5, 0) indicated by the second row record is indicated as the reference midpoint RP2 in FIG. The reference trajectory determination unit 12 determines the reference intermediate point RP2 (50, −5 , 0) is determined. For example, as indicated by the dotted line in FIG. 9, the reference trajectory RT2 is a smooth curve extending from the motion start point SP1 through the reference intermediate point RP2 to the motion end point EP1, and protrudes in the Y-axis negative direction on the XY plane. is shown by a curve that

The control unit 17 shown in FIG. 3 generates control commands to the motors M1, M2, M3, M4, M5, and M6 according to the drive conditions and the reference trajectory RT2 acquired from the reference trajectory determination unit 12, and outputs the control commands to the motors M1, M2, M3, M4, M5, and M6. Send to M1, M2, M3, M4, M5, M6.

The actual trajectory acquisition unit 13 acquires the rotational positions of the motors M1, M2, M3, M4, M5 and M6 from the encoders E1, E2, E3, E4, E5 and E6, and the motors M1, M2, M3, M4, M5, Obtain the actual position of the leading link from the rotational position of M6. As an example, when the motors M1, M2, M3, M4, M5, and M6 are controlled in accordance with the driving condition indicated by the record in the second row of the driving condition table shown in FIG. , the actual trajectory AT2 acquired by the actual trajectory acquisition unit 13 is indicated by a solid line in FIG. The actual trajectory acquisition unit 13 sends the actual trajectory acquired as described above to the learning unit 14 .

The learning unit 14 calculates the distance between the ideal position of the leading link on the ideal trajectory for each control cycle and the actual position of the leading link for each control cycle indicated by the actual trajectory AT2 determined by the actual trajectory acquisition unit 13, and As indicated by an arrow ER2 in 9, the deviation of the actual position from the ideal position when the distance between the ideal position and the actual position reaches the maximum value is used as the trajectory error.

As described above, the learning unit 14 calculates the trajectory error for each reference intermediate point associated with each drive condition, and learns the correspondence between the reference intermediate point and the trajectory error. The learning unit 14 then generates an error table shown in FIG. 10 and stores it in the learned data storage unit 15 . In the example of FIG. 10, a trajectory error is stored for each reference intermediate point for a plurality of reference intermediate points corresponding to one driving condition. Trajectory errors are stored as components of a vector. Specifically, the trajectory error is represented by a component of a vector that starts from the ideal position and indicates the displacement of the actual position.

When the calculation of the trajectory errors for all the reference midpoints of all driving conditions is completed, the learning unit 14 selects the target midpoint that minimizes the trajectory error from the trajectory errors corresponding to the reference midpoints for each driving condition. Then, a trajectory model indicating the target intermediate point is generated according to the driving conditions. Specifically, the learning unit 14 learns changes in trajectory error in response to changes in the reference intermediate point for each driving condition, obtains a target intermediate point that minimizes the trajectory error, and determines the target intermediate point according to the driving condition. Generate a trajectory model showing In Embodiment 1, the learning unit 14 generates a target waypoint table shown in FIG. 11 as a trajectory model and stores it in the learned data storage unit 15 . Specifically, the learning unit 14 performs regression analysis for each drive condition to calculate a target intermediate point that minimizes the trajectory error for each drive condition. In the regression analysis, the XYZ coordinate components of the reference midpoint are used as independent variables, and the trajectory error is used as the objective function. The value of the independent variable when the trajectory error is minimized corresponds to the target waypoint.

An overview of the operation of the operation process, which is the control process of the robot 10 that is performed after the above-described learning process is completed, will be described with reference to FIG. For example, after the learning process is completed, the control device 1 starts the operation process of FIG. 12 when a driving command including driving conditions for the robot 10 is input by an operation from an operation unit (not shown).

When the target trajectory determination unit 16 acquires the drive command including the drive condition, it acquires the target intermediate point corresponding to the drive condition indicated by the drive command from the learned data storage unit 15 (step S21). Then, the target trajectory determination unit 16 determines a target trajectory passing through the target intermediate point acquired in step S21 based on the drive conditions (step S22). The controller 17 controls the motors M1, M2, M3, M4, M5 and M6 of the robot 10 according to the target trajectory determined in step S22 (step S23).

Details of the operational processing performed by the control device 1 will be described below.
Upon acquiring the drive command, the target trajectory determination unit 16 acquires a target intermediate point corresponding to the drive condition indicated by the drive command from the target intermediate point table stored in the learned data storage unit 15 . Then, the target trajectory determining unit 16 determines a target trajectory representing a trajectory from the motion start point indicated by the drive condition, through the target intermediate point, to the motion end point indicated by the drive condition. More specifically, the target trajectory determination unit 16 performs spline interpolation based on the motion start point, target intermediate point, and motion end point indicated by the drive condition, and calculates the target intermediate point from the motion start point indicated by the drive condition. A target trajectory is determined through the points to reach the motion end point indicated by the drive condition. The target trajectory indicates the target position of the leading link for each control cycle.

For example, the target trajectory determination unit 16 moves the flange 160 from the motion start point (0, 0, 0) to the motion end point (100 , 0, 0) at the target speed V1. The target trajectory determination unit 16 acquires the target intermediate point (50, -2.5, 0) indicated by the record in the first row of the target intermediate point table shown in FIG. 11 corresponding to the acquired drive condition. This target midpoint (50, -2.5, 0) is shown as target midpoint TP1 in FIG. Then, the target trajectory determination unit 16 moves from the motion start point SP1 (0,0,0) to the motion end point EP1 (100,0,0) through the target intermediate point TP1 (50,-2.5,0). A target trajectory TT1 to be reached is determined. As indicated by the dashed line in FIG. 13, the target trajectory TT1 is a smooth curve extending from the motion start point SP1 through the target intermediate point TP1 to the motion end point EP1. is indicated by After determining the target trajectory as described above, the target trajectory determination unit 16 sends the driving conditions and the determined target trajectory to the control unit 17 .

The control unit 17 shown in FIG. 3 generates control commands to the motors M1, M2, M3, M4, M5, and M6 according to the target trajectory acquired from the target trajectory determination unit 16, and outputs the control commands to the motors M1, M2, M6, and M6. Send to M3, M4, M5, M6. As a result, the motors M1, M2, M3, M4, M5 and M6 rotate according to the control commands, and the first arm 110, the second arm 120, the third arm 130, the fourth arm 140, the fifth arm 150 and the flange 160 rotate. Rotate. This causes the tip link to move. By controlling the motors M1, M2, M3, M4, M5, and M6 according to the target trajectory that minimizes the trajectory error, the actual trajectory AT1' of the leading end link approaches the ideal trajectory as shown in FIG.

As described above, the control device 1 according to Embodiment 1 learns the correspondence between the reference intermediate point and the trajectory error, obtains the target intermediate point that minimizes the trajectory error, and determines the target intermediate point according to the driving conditions. A trajectory model shown is generated, and the robot 10 is controlled according to the target trajectory passing through the target waypoint. As a result, the actual trajectory approaches the ideal trajectory, and the accuracy of control of the robot 10 by the control device 1 increases.

(Embodiment 2)
When a machining tool attached to a robot, for example, a machining tool attached to a tip link, is used to machine a workpiece, the robot receives force from the workpiece during machining. The trajectory of the tip link may deviate from the ideal trajectory under the influence of the reaction force, which is the force that the robot receives from the workpiece. In the second embodiment, a robot system 101 that includes a robot 30 and a control device 2 that controls the robot 30 according to the reaction force that the robot 30 receives during processing is mainly different from the robot system 100 according to the first embodiment. to explain.

A robot 30 shown in FIG. 14 includes a force sensor 190 in addition to the configuration of the robot 10 according to the first embodiment. A processing tool 200 is attached to the flange 160 . By controlling the robot 30 to which the processing tool 200 is attached, it is possible to perform processing such as deburring of the workpiece, attachment of parts, and the like.

The force sensor 190 outputs a sensor signal corresponding to the reaction force, which is the force that the flange 160 receives during machining of the workpiece by the machining tool 200 .

The control device 2 included in the robot system 101 shown in FIG. and a learning unit 18 that learns the error and the reaction force. The reaction force acquisition unit 19 detects the value of the sensor signal acquired from the force sensor 190 in each control cycle, calculates the reaction force received by the robot 30 from the detected value in each control cycle, and obtains the calculated reaction force. Send to learning unit 18 . The learning unit 18 acquires the reaction force calculated by the reaction force acquiring unit 19, and calculates the reaction force corresponding to the trajectory error, that is, the ideal position of the leading end link in each control cycle and the ideal position of the leading end link in each control cycle indicated by the actual trajectory. Obtain the reaction force when the distance from the actual position is maximum.

The hardware configuration of the control device 2 is the same as that of the control device 1 shown in FIG. The control device 2 is connected to the motors M1, M2, M3, M4, M5 and M6, the encoders E1, E2, E3, E4, E5 and E6 and the force sensor 190 via the interface 33.

The learning process performed by the control device 2 having the above configuration will be explained using FIG. For example, when the robot 30 performs processing for the first time after the robot 30 is installed, the control device 2 starts the learning process of FIG. 16 . The processing from steps S11 to S12 is the same as the processing performed by the control device 1 shown in FIG. The control unit 17 controls the motors M1, M2, M3, M4, M5, and M6 of the robot 10 according to the reference trajectory determined in step S12, and the actual trajectory acquisition unit 13 acquires the actual trajectory of the robot 10. Then, the reaction force acquiring unit 19 calculates the reaction force (step S17). Specifically, in step S17, the motors M1, M2, M3, M4, M5, and M6 are controlled based on the reference trajectory, so that machining by the robot 30 is performed. While the motors M1, M2, M3, M4, M5, and M6 are being controlled based on the reference trajectory, the actual trajectory acquisition unit 13 acquires the actual trajectory of the tip link of the robot 30, and the reaction force acquisition unit 19 , the reaction force is calculated from the sensor signal output by the force sensor 190 .

The learning unit 18 calculates a trajectory error indicating the deviation of the actual trajectory acquired in step S17 from the ideal trajectory based on the driving conditions acquired in step S11, and calculates the distance between the trajectory error and the ideal position and the actual position. The maximum reaction force is associated (step S18).

As in the first embodiment, while the correspondence between the trajectory error and the reaction force is not completed for all the reference intermediate points of all the drive conditions stored in the drive condition storage unit 11 (step S19; No). , the above-described steps S11, S12, S17, and S18 are repeated. In other words, the learning unit 18 calculates the trajectory error for each reference intermediate point associated with each driving condition, and learns the correspondence between the driving condition, the reference intermediate point, the trajectory error, and the reaction force. The learning unit 18 then generates an error table shown in FIG. 17 and stores it in the learned data storage unit 15 . In the example of FIG. 17, a trajectory error and a reaction force are stored for each reference intermediate point with respect to a plurality of reference intermediate points corresponding to one drive condition. The reaction force is represented by force Fx in the X-axis direction, force Fy in the Y-axis direction, and force Fz in the Z-axis direction. In the example of FIG. 17, the reaction force represented by (Fx, Fy, Fz) is stored in the error table.

When the correspondence between the trajectory error and the reaction force is completed for all the reference intermediate points of all the driving conditions (step S19; Yes), the learning unit 18 obtains the target intermediate point that minimizes the trajectory error according to the driving condition. A trajectory model indicating the target waypoint is generated (step S20). Specifically, the learning unit 18 uses a multiple regression model for each driving condition to calculate a target intermediate point that minimizes the trajectory error for each driving condition. In the multiple regression model, the XYZ coordinate components of the reference midpoint and the reaction force are used as independent variables, and the trajectory error is used as the objective function. The reference midpoint at which the trajectory error is minimized corresponds to the target midpoint. The learning unit 18 generates the target waypoint table shown in FIG. 11, as in the first embodiment. When the process of step S20 is completed, the control device 2 ends the learning process.

The operation process performed by the control device 2 after the above-described learning process is completed is the same as in the first embodiment. The control device 2 performs operation processing shown in FIG. 12 to control the robot 30 .

As described above, the control device 2 according to the second embodiment obtains the target intermediate point that minimizes the trajectory error from the trajectory error and the reaction force corresponding to each of the reference intermediate points for each driving condition. A trajectory model indicating the corresponding target waypoint is generated, and the robot 30 is controlled according to the target trajectory passing through the target waypoint. Since the target intermediate point is determined according to the reaction force, which is the force that the robot 30 receives during machining, the controller 2 can control the robot 30 with high accuracy.

(Embodiment 3)
In

Embodiments

1 and 2, all reference midpoints corresponding to each driving condition are stored in advance in the drive condition table, but the reference midpoint may be estimated. In the third embodiment, a robot system 102 that includes a robot 10 and a control device 3 that estimates at least a part of reference intermediate points and performs learning processing will focus on differences from the robot system 100 according to the first embodiment. to explain.

The control device 3 included in the robot system 102 shown in FIG. 18 further includes a midpoint estimation unit 20 in addition to the configuration of the control device 1 according to the first embodiment. The midpoint estimating unit 20 detects a plurality of reference midpoints corresponding to the driving conditions in the driving condition storage unit 11, or when a plurality of reference midpoints corresponding to the driving conditions are stored in the driving condition storage unit 11. If the number of reference waypoints is not sufficient, the reference waypoints are estimated.

Specifically, the midpoint estimation unit 20 estimates a plurality of reference midpoints corresponding to other driving conditions from the target midpoint corresponding to the driving condition indicated by the trajectory model generated by the learning unit 14 . For example, the intermediate point estimating unit 20 selects the driving conditions stored in the learned data storage unit 15 and the target intermediate points corresponding to the driving conditions, and the driving condition storage unit 11 stores a plurality of corresponding reference intermediate points. Estimate multiple reference waypoints for non-zero driving conditions. Then, the midpoint estimation unit 20 associates the drive condition with the estimated plurality of reference midpoints and stores them in the drive condition storage unit 11 .

The hardware configuration of the control device 3 is the same as the hardware configuration of the control device 1 shown in FIG.

The learning process performed by the control device 3 having the above configuration will be explained using FIG. As with the control device 1, for example, when the control device 3 starts operating for the first time after the robot 10 is installed, the control device 3 starts the learning process of FIG. The processing from steps S11 to S16 is the same as the processing performed by the control device 1 shown in FIG. In other words, the trajectory error is calculated and the trajectory model indicating the target intermediate point is generated in the same manner as in the first embodiment based on the driving conditions and a plurality of reference intermediate points stored in the driving condition storage unit 11 in advance. will be

When the processing of step S16 is completed, the control device 3 performs the processing from step S31 onward. The midpoint estimation unit 20 determines whether or not it is necessary to estimate the reference midpoint corresponding to the drive conditions stored in the drive condition storage unit 11 (step S31). Specifically, the intermediate point estimation unit 20 determines that the number of reference intermediate points associated with the driving conditions stored in the driving condition storage unit 11 is the number necessary for the learning unit 14 to generate the trajectory model. It is determined whether or not the reference midpoint needs to be estimated based on whether or not the above conditions are met. As an example, the midpoint estimating unit 20 determines that the reference midpoint needs to be estimated when the driving conditions stored in the driving condition storage unit 11 are not associated with the reference midpoint. As another example, if the number of reference intermediate points corresponding to each driving condition stored in the driving condition storage unit 11 is less than 50, the intermediate point estimating unit 20 determines that estimation of the reference intermediate point is necessary. judge.

For example, in the drive condition table shown in FIG. 20, the flange 160 is moved from the operation start point (0, 0, 0) to the operation end point ( −100, 0, 0) at the target speed V1 is not associated with a reference intermediate point. Similarly, with the tip surface 160a of the flange 160 parallel to the plane to which the robot 10 is fixed, the flange 160 is moved from the motion start point (0,0,0) to the motion end point (0,100,0). No reference midpoint is associated with the drive condition for driving at the target speed V1. Therefore, in the process of step S31 of FIG. 19, the midpoint estimating unit 20 determines that the reference midpoint needs to be estimated for these drive conditions (step S31; Yes).

If it is necessary to estimate the reference intermediate point (step S31; Yes), the intermediate point estimation unit 20 estimates the reference intermediate point from the target intermediate point according to the driving conditions indicated by the trajectory model (step S32).

Specifically, the midpoint estimating unit 20 selects a plurality of driving conditions that require estimation of a reference midpoint from the driving conditions stored in the learned data storage unit 15 and target midpoints corresponding to the driving conditions. Estimate the reference waypoint. For example, the target midpoint of each drive condition is obtained from the target midpoint table shown in FIG. Estimate the target waypoint for no drive condition. Then, the midpoint estimation unit 20 estimates a plurality of reference midpoints three-dimensionally arranged as shown in FIG. 7 in a space centered on the estimated target midpoint. The space centered on the estimated target midpoint is preferably narrower than the space in which the plurality of reference midpoints pre-stored in the driving condition storage unit 11 are positioned, for example, the space S1 in FIG. As a result, the time required to calculate the trajectory error can be shortened, and as a result, the target intermediate point can be obtained efficiently.

In the process of step S32 in FIG. 19, the midpoint estimating unit 20 estimates a plurality of reference midpoints for each driving condition that requires estimation of a reference midpoint as described above, and determines each driving condition and the estimated multiple is stored in the drive condition storage unit 11 in association with the reference midpoint of (step S33). As a result, as shown in FIG. 21, a record is added to the driving condition table stored in the driving condition storage unit 11 to associate the driving condition with the estimated reference intermediate point.

When the process of step S33 in FIG. 19 is completed, the control device 3 repeats the above-described processes from the process of step S11. In step S33, since a record that associates the driving condition with the estimated reference midpoint is added to the driving condition table stored in the driving condition storage unit 11, the driving condition and the estimated reference midpoint are: Learning processing is performed in the same manner as in the first embodiment. It is not necessary to repeat the learning process for driving conditions that have already been subjected to the learning process and for which no record associated with the estimated reference midpoint has been added.

In step S33, a record that associates the driving condition with the estimated reference midpoint is added to the driving condition table stored in the driving condition storage unit 11, and the learning process described above is performed. 20 determines that estimation of the reference midpoint is unnecessary (step S31; No). If estimation of the reference midpoint is unnecessary (step S31; No), the control device 3 terminates the learning process.
The operation process performed by the control device 3 after the above-described learning process is completed is the same as that of the first embodiment.

As described above, the control device 3 according to Embodiment 3 estimates a reference midpoint corresponding to another driving condition from the target midpoint corresponding to the driving condition indicated by the trajectory model. A reference waypoint is estimated based on a trajectory model, a trajectory error is calculated based on the estimated reference waypoint, and a target waypoint is obtained. In comparison, the learning efficiency of the control device 3 is high. By reducing the number of reference intermediate points to be estimated to less than the number of reference intermediate points stored in advance in the drive condition storage unit 11, it is possible to learn the control device 3 more efficiently.

(Embodiment 4)
In

Embodiments

1 and 2, all reference midpoints corresponding to each driving condition are stored in advance in the drive condition table, but the reference midpoint may be estimated. Furthermore, driving conditions may also be estimated. In the fourth embodiment, the robot system 103 including the robot 10 and the control device 4 that estimates at least part of the driving conditions and at least part of the reference intermediate points to perform learning processing is described as the robot according to the first embodiment. The points different from the system 100 will be mainly described.

The control device 4 included in the robot system 103 shown in FIG. 22 further includes a drive condition estimation unit 21 in addition to the configuration of the control device 1 according to Embodiment 1. The driving condition estimation unit 21 estimates a reference intermediate point and stores it in the driving condition storage unit 11 when the number of reference intermediate points stored corresponding to the driving condition in the driving condition storage unit 11 is not sufficient. If there are not enough driving conditions, a driving condition and a plurality of reference intermediate points corresponding to the driving condition are estimated.

More specifically, the driving condition estimating unit 21 uses the trajectory error calculated by the learning unit 14 according to the driving conditions stored in the driving condition storage unit 11 and the reference intermediate point to determine the distance stored in the driving condition storage unit 11. A new reference midpoint associated with the drive condition, a new drive condition not stored in the drive condition storage unit 11, and a plurality of reference midpoints associated with the new drive condition are estimated. Then, the driving condition estimation unit 21 associates the driving conditions stored in the driving condition storage unit 11 with the new reference midpoint estimated based on the driving conditions, and stores them in the driving condition storage unit 11 . The driving condition estimating unit 21 further stores the estimated driving condition and each of the plurality of estimated reference intermediate points in the driving condition storage unit 11 in association with each other.

The hardware configuration of the control device 4 is the same as the hardware configuration of the control device 1 shown in FIG.

The learning process performed by the control device 4 having the above configuration will be explained using FIG. As with the control device 1, for example, when the control device 4 starts operating for the first time after the robot 10 is installed, the control device 4 starts the learning process of FIG. The processing from steps S11 to S15 is the same as the processing performed by the control device 1 shown in FIG. In other words, the locus error calculation process is performed in the same manner as in the first embodiment, based on the driving conditions and the reference intermediate point stored in advance in the driving condition storage unit 11 .

When the processing of step S15 is completed, the control device 4 performs the processing from step S41 onward. The driving condition estimation unit 21 determines whether or not it is necessary to estimate the reference intermediate point corresponding to the driving conditions stored in the driving condition storage unit 11 (step S41). Specifically, the driving condition estimation unit 21 determines whether or not the number of reference intermediate points corresponding to the driving conditions stored in the driving condition storage unit 11 is equal to or greater than the number necessary for generating the trajectory model in the learning unit 14. It is determined whether or not the reference midpoint needs to be estimated. For example, if the number of reference midpoints corresponding to each driving condition stored in the driving condition storage unit 11 is less than 50, the driving condition estimation unit 21 determines that estimation of the reference midpoint is necessary.

For example, when the driving condition table stored in the driving condition storage unit 11 stores only the first row record shown in FIG. 23, the tip surface 160a of the flange 160 is parallel to the plane to which the robot 10 is fixed, and the flange 160 is moved from the operation start point (0, 0, 0) to the operation end point It is determined that the reference midpoint needs to be estimated for the driving condition of driving at the target speed V1 up to (100, 0, 0) (step S41; Yes).

If the reference midpoint needs to be estimated (step S41; Yes), the driving condition estimating unit 21 estimates the reference midpoint using a regression algorithm based on the error table stored in the learned data storage unit 15. (step S42). For example, the drive condition estimator 21 moves the flange 160 to the operation start point (0, 0, 0) with the tip surface 160a of the flange 160 parallel to the plane to which the robot 10 is fixed, using a Gaussian process regression algorithm. A reference intermediate point is estimated with respect to the drive conditions for driving from the point (100, 0, 0) to the end point (100, 0, 0) at the target speed V1. The objective function of Gaussian process regression algorithm is trajectory error.

The driving condition estimation unit 21 stores the reference midpoint estimated in step S42 in the driving condition storage unit 11 in association with the driving condition (step S43). As a result, for example, records from the second line onward shown in FIG. 6 are generated. When the processing of step S43 is completed, the control device 4 repeats the above-described processing from step S11. In step S43, since a record that associates the driving condition with the estimated reference midpoint is added to the driving condition table stored in the driving condition storage unit 11, the driving condition and the estimated reference midpoint are: A locus error calculation process is performed in the same manner as in the first embodiment. It is not necessary to repeat the learning process for the driving conditions that have already been calculated.

As a result of repeating the above-described process, when a sufficient number of reference midpoints are associated with each driving condition and stored in the driving condition storage unit 11, the driving condition estimating unit 21 estimates the reference midpoint as It is determined that it is unnecessary (step S41; No). If estimation of the reference midpoint is unnecessary (step S41; No), the driving condition estimator 21 determines whether estimation of the driving condition is necessary (step S44).

Specifically, the driving condition estimating unit 21 determines whether or not the driving condition stored in the driving condition storage unit 11 is sufficient for the type of control pattern of the robot 10. Judge no. For example, if the number of types of driving conditions stored in the driving condition storage section 11 is less than the predetermined number of types of control patterns for the robot 10, the driving condition estimating section 21 determines that the driving condition needs to be estimated. judge. The driving condition estimating unit 21 only needs to hold information about the types of control patterns of the robot 10 in a memory (not shown).

For example, when the process of step S43 ends, the driving condition stored in the driving condition storage unit 11 is such that the flange 160 is parallel to the plane to which the robot 10 is fixed, and the tip surface 160a of the flange 160 is parallel to the plane to which the robot 10 is fixed. is driven at the target speed V1 from the operation start point (0,0,0) to the operation end point (100,0,0), the drive condition estimator 21 needs to estimate the drive condition. It is determined that there is (step S44; Yes).

If it is necessary to estimate the driving condition (step S44; Yes), the driving condition estimating unit 21 uses a regression algorithm based on the error table stored in the learned data storage unit 15 to estimate other driving conditions. A plurality of reference intermediate points corresponding to the driving conditions are estimated (step S45). Other driving conditions are driving conditions different from the driving conditions stored in the driving condition storage unit 11 . For example, the drive condition estimator 21 moves the flange 160 to the operation start point (0, 0, 0) with the tip surface 160a of the flange 160 parallel to the plane to which the robot 10 is fixed, using a Gaussian process regression algorithm. to the operation end point (−100, 0, 0) at the target speed V1 is estimated, and a plurality of reference intermediate points corresponding to the estimated driving conditions are estimated. Estimation of a plurality of reference midpoints corresponding to the estimated drive conditions is the same as the process of step S42.

The driving condition estimation unit 21 associates the driving conditions estimated in step S45 with the plurality of reference intermediate points and stores them in the driving condition storage unit 11 (step S46). As a result, for example, as shown in FIG. 21, the end surface 160a of the flange 160 is parallel to the plane to which the robot 10 is fixed, and the flange 160 is moved from the operation start point (0, 0, 0) to the end of the operation. A plurality of records are generated for the drive condition to drive at the target speed V1 to the point (-100,0,0). When the processing of step S46 is completed, the control device 4 repeats the above-described processing from step S11. In step S46, since a record that associates the estimated driving condition with each of the plurality of estimated reference intermediate points is added to the driving condition table stored in the driving condition storage unit 11, the estimated driving For a plurality of reference intermediate points estimated as conditions, the locus error calculation process is performed in the same manner as in the first embodiment. It is not necessary to repeat the calculation process for the driving conditions that have already been calculated.

As a result of repeating the above process, when a sufficient number of driving conditions and a sufficient number of reference midpoints corresponding to each driving condition are stored in the driving condition storage unit 11, the driving condition estimation unit 21 It is determined that estimation of the reference intermediate point is unnecessary (step S41; No) and that estimation of the drive condition is unnecessary (step S44; No). If estimation of the reference intermediate point is unnecessary (step S41; No) and estimation of the drive condition is unnecessary (step S44; No), the learning unit 14 calculates the locus error as in the first embodiment. A trajectory model indicating a target intermediate point to be minimized for each drive condition is generated (step S16). When the process of step S16 is completed, the control device 4 terminates the learning process.
Operational processing performed by the control device 4 after the above-described learning processing is completed is the same as that of the first embodiment.

As described above, the control device 4 according to the fourth embodiment estimates the reference midpoint corresponding to the driving condition stored in the driving condition storage unit 11, and based on the estimated reference midpoint, Calculate the trajectory error and determine the target waypoint. Further, the control device 4 estimates driving conditions not stored in the driving condition storage unit 11, estimates a plurality of reference midpoints corresponding to the estimated driving conditions, and calculates the estimated driving conditions and the plurality of reference midpoints. Calculate the trajectory error based on and obtain the target intermediate point. Therefore, even if a large number of driving conditions and a large number of reference intermediate points are not stored in advance in the driving condition storage unit 11, the trajectory error can be calculated for various driving conditions for controlling the robot 10, and the actual trajectory can be calculated. It is possible to approach the ideal trajectory.

(Embodiment 5)
The method of acquiring the trajectory of the robot 10 is not limited to the above example. A robot system 104 that includes a robot 10 and a control device 5 that obtains the trajectory of the robot 10 by a method different from that of the first embodiment will be described, focusing on differences from the robot system 100 according to the first embodiment.

The control device 5 shown in FIG. 24 includes an actual trajectory acquisition unit 22 that acquires the trajectory of the robot 10 regardless of the rotational positions of the motors M1, M2, M3, M4, M5, and M6. The actual trajectory acquisition unit 22 has a measuring device for measuring the trajectory of the robot 10, for example, a three-dimensional measuring device, and acquires the trajectory of the robot 10 according to the measurement values of the measuring device. The three-dimensional measuring device acquires the position of the tip link of the robot 10 by irradiating the movement range of the tip link of the robot 10 with a laser beam and receiving the laser beam reflected by the tip link of the robot 10 .

In the fifth embodiment, the actual trajectory acquisition unit 22 performs the above-described processing for each control cycle to calculate the actual trajectory indicating the actual position of the leading link in the XYZ orthogonal coordinate system for each control cycle. . The actual trajectory acquisition unit 22 then sends the actual trajectory to the learning unit 14 .

The hardware configuration of the control device 5 is the same as that of the control device 1 shown in FIG.
The learning process performed by the control device 5 having the above configuration is the same as that of the control device 1 except for the method of acquiring the trajectory of the leading link. Operation processing performed by the control device 5 is the same as that of the control device 1 .

As described above, the controller 5 according to the fifth embodiment acquires the trajectory of the robot 10 regardless of the rotational positions of the motors M1, M2, M3, M4, M5, and M6. The accuracy of the actual trajectory is improved because the error caused by the mechanical rigidity of the

The present disclosure is not limited to the above embodiments. It is possible to arbitrarily combine multiple embodiments among the above-described embodiments. As an example, the control device 3-5 calculates the reaction force from the measurement value of the force sensor 190 in the same way as the control device 2, and determines the target intermediate point according to the trajectory error and the reaction force corresponding to the trajectory error. good too.

The drive conditions are not limited to the above examples. As an example, the driving conditions may further include information about the workpiece. Specifically, the drive conditions may include the motion start point, motion end point, speed, and attitude of the leading link, and the weight of the workpiece. The learning unit 14 may acquire information about the weight of the workpiece, which is the object to be processed, from an external device, for example, a weight sensor that measures the weight of the object to be processed. Specifically, the control device 1-5 acquires the measured values from the weight sensors, controls the

robots

10 and 30 according to the driving conditions stored in the driving condition storage unit 11, and uses the driving conditions and the measured values of the weight sensors to A trajectory model is generated by repeating calculation of the corresponding trajectory error. As a result, as shown in FIG. 25, the learning unit 14 creates a target intermediate point table that associates driving conditions, including the motion start point, motion end point, speed, and attitude of the leading link and the weight of the workpiece, with the target intermediate points. to generate

During operation, when the target trajectory determination unit 16 acquires a drive command, the target trajectory determination unit 16 acquires from the learned data storage unit 15 the motion start point, motion end point, speed, attitude, and measurement values of the weight sensor indicated by the drive command. Get the target waypoint according to . Then, the target trajectory determination unit 16 determines a target trajectory passing through the target intermediate point based on the motion start point, motion end point, speed, and attitude of the leading link indicated by the drive command. Control accuracy of the

robots

10 and 30 is improved by controlling the

robots

10 and 30 based on the target trajectory corresponding to the weight of the object to be processed.

Information about the workpiece is not limited to the weight of the workpiece, and may include at least one of the weight, shape, and dimensions of the workpiece.

As another example, the drive conditions may include the motion start point, motion end point, time required from the motion start point to the motion end point, and attitude of the leading link.
In the above-described embodiments, the driving conditions instruct linear movement on a plane, but the driving conditions may instruct three-dimensional movement, curvilinear movement, and the like.

In the above-described embodiment, one record of the driving condition table stored in the driving condition storage unit 11 stores one reference midpoint. Intermediate points may be stored. In this case, the reference trajectory determining unit 12 may determine a reference trajectory passing through a plurality of reference intermediate points stored in the same record.

The driving condition storage unit 11 and the learned data storage unit 15 may be provided outside the control device 1-5. As an example, the driving condition storage unit 11 and the learned data storage unit 15 may be implemented as functions of storage devices on a network.

The methods for determining the reference trajectory, ideal trajectory, and target trajectory are not limited to the above examples. Specifically, the reference trajectory RT2 determined by the reference trajectory determination unit 12 in the first embodiment is a smooth curve from the motion start point SP1 through the reference intermediate point RP2 to the motion end point EP1. Although indicated by a curve projecting in the Y-axis negative direction, the reference trajectory RT2 is not limited to such a curve, and may be indicated by curves projecting in a plurality of directions. As an example, the reference trajectory RT2 projects in the negative Y-axis direction from the motion start point SP1 to the motion end point EP1, and extends in the positive Y-axis direction on the XY plane near the motion start point SP1 and the motion end point EP1. It may be represented by a protruding curve.

As another example, the reference trajectory determination unit 12 linearly interpolates the motion start point and the reference intermediate point indicated by the driving condition, and linearly interpolates the reference intermediate point and the operation end point indicated by the driving condition, thereby determining the reference trajectory. You may Even when the reference trajectory is determined by linear interpolation, the control unit 17 controls the motors M1, M2, M3, M4, M5, and M6 to smoothly rotate, so that the tip links move smoothly and the actual trajectory is obtained. becomes a smooth curve.

As another example, the reference trajectory determining unit 12 determines the reference trajectory using a table (not shown) that associates the motion start point, the reference intermediate point, the motion end point, and the reference position of the leading end link for each control cycle. may be determined.

The ideal trajectory and the target trajectory can also be determined by a method similar to the method for determining the reference trajectory described above.

In Embodiment 1, since the robot 10 is moved linearly, the ideal trajectory is a straight line. Specifically, the ideal trajectory may be a circular arc, a curved line, or a combination of straight lines, circular arcs, and curved lines.

The target trajectory TT1 determined by the target trajectory determination unit 16 in the first embodiment is a smooth curve from the motion start point SP1 through the target intermediate point TP1 to the motion end point EP1, and extends in the negative Y-axis direction on the XY plane. , the target trajectory TT1 is not limited to such a curve. As an example, the target trajectory TT1 projects in the negative Y-axis direction from the motion start point SP1 to the motion end point EP1, and extends in the positive Y-axis direction on the XY plane near the motion start point SP1 and the motion end point EP1. It may be represented by a protruding curve.

As another example, the target trajectory determination unit 16 linearly interpolates between the motion start point and the target intermediate point indicated by the drive condition, and linearly interpolates the target midpoint and the motion end point indicated by the drive condition, thereby determining the target trajectory. You may Even when the target trajectory is determined by linear interpolation, the control unit 17 controls the motors M1, M2, M3, M4, M5, and M6 to smoothly rotate, so that the tip links move smoothly and the actual trajectory is obtained. becomes a smooth curve.

The method of determining the target waypoint is not limited to the above example. As an example, the

learners

14, 18 may determine the target waypoint using linear regression, curvilinear regression, polynomial regression, multidimensional function fitting, or the like. For example, the learning

units

14 and 18 may determine the target intermediate point that minimizes the trajectory error by performing multidimensional function fitting with the XYZ coordinate components of the reference intermediate point as independent variables and the trajectory error as the objective function. good.

The method of generating the trajectory model is not limited to the above example. As an example, the learning

units

14 and 18 may perform supervised learning according to the neural network model to generate the trajectory model. Supervised learning means that by giving a large set of input and result datasets to a learner, the learner learns the features in the large dataset and produces a model that infers the result from the input. . For example, the learning

units

14 and 18 generate a trajectory model, which is a neural network model, using the motion start point and motion end point indicated by the drive condition as input data and the target intermediate point as result data. Using the generated trajectory model, it is possible to obtain the target intermediate point that minimizes the trajectory error according to the drive conditions.

In machine learning, it is known that over-learning reduces estimation accuracy, so it is preferable to suppress over-learning when using any learning model. Methods of suppressing over-learning include, for example, a method of normalizing learning data, a method of reducing the number of parameters in a learning model, and the like. As an example, when performing multidimensional function fitting, it is preferable to limit the number of dimensions of the function, for example, to 5 or less. As another example, in a neural network model, it is preferable to limit the number of hidden layers, for example, to 10 layers or less. Moreover, in the neural network model, the output of a specific layer of the learning model may be randomly set to 0 during learning. Furthermore, over-learning may be suppressed by combining the above methods.

The learning

units

14 and 18 may perform learning processing in accordance with the control of the

robots

10 and 30 that will be performed in the immediately following operational processing. As an example, the operation of the learning unit 14 when it is predetermined in the operational process to move the

robots

10 and 30 from the motion start point (0,0,0) to the motion end point (100,0,0) will be described. do. In this case, the learning unit 14, based on the drive conditions including the motion start point (0,0,0) and the motion end point (100,0,0), and a plurality of reference midpoints associated with the drive conditions, A trajectory model can be generated by learning the trajectory error. The learning efficiency is improved by performing the learning process based on the drive conditions corresponding to the control of the

robots

10 and 30 performed in the operation process and the plurality of reference intermediate points associated with the drive conditions.

As another example, in the operational process, the robot 30 is moved from the operation start point (0,0,0) to the operation end point (100,0,0) with the tip surface 160a of the flange 160 positioned horizontally. The operation of the learning unit 14 will be described by taking as an example the case of processing a workpiece with a weight of 4 kg. In this case, the learning unit 14 includes a motion start point (0, 0, 0) and a motion end point (100, 0, 0), a posture of 0 degrees, and a weight of 4 kg. A trajectory model may be generated by learning a trajectory error based on a plurality of reference intermediate points associated with conditions.

In this case, the learning

units

14 and 18 calculate the trajectory error based on not only the driving condition that completely matches the driving condition in the operational process, but also the driving condition that partially matches the driving condition in the operational process. may be used to generate the trajectory model. As described above, the robot 30 is moved from the operation start point (0, 0, 0) to the operation end point (100, 0, 0) with the tip surface 160a of the flange 160 positioned horizontally. The operation of the learning unit 14 will be described by taking as an example the case of machining a workpiece with a weight of 4 kg. The learning unit 14 calculates a trajectory error and generates a trajectory model based on a plurality of reference midpoints associated with the driving condition indicating that the posture is 0 degrees and the weight is 4 kg, and the driving condition. good too. Alternatively, the learning unit 14 includes a plurality of driving conditions that include an operation start point (0, 0, 0) and an operation end point (100, 0, 0) and indicate a weight of 4 kg and a plurality of driving conditions that are associated with the driving condition. A trajectory model may be generated by calculating a trajectory error based on the reference waypoints.

The learning

units

14 and 18 may calculate the distance between the ideal position and the actual position of the leading link for each control cycle, and use the deviation of the actual position from the ideal position when the distance reaches the maximum value as the trajectory error. . In this case, as shown in FIG. 26, the learning

units

14 and 18 may store the deviation of the actual position from the ideal position when the distance reaches the maximum value as the trajectory error in the error table. Then, the learning

units

14 and 18 may determine the target waypoint based on the trajectory error.

The learning

units

14 and 18 may store the actual position of the tip link corresponding to the trajectory error in the error table. For example, when the learning

units

14 and 18 use, as the trajectory error, the deviation of the actual position from the ideal position when the distance between the ideal position and the actual position of the tip link in each control cycle becomes a maximum value, Thus, the actual position of the tip link when the distance reaches the maximum value may be stored in the error table. Then, the learning

units

14 and 18 may determine the target intermediate point based on the trajectory error and the actual position of the leading link corresponding to the trajectory error.

As an example, the learning unit 18 may store the reaction force calculated from the sensor signal output by the force sensor 190 when the distance reaches the maximum value in the error table, as shown in FIG. Then, the learning unit 18 may determine the target intermediate point based on the trajectory error and the reaction force corresponding to the trajectory error. As another example, the learning unit 18 may store the maximum value of the reaction force and the average value of the reaction force in the error table.

The learning

units

14 and 18 may perform the learning process again in parallel with the operation process performed after the learning process is completed. More specifically, the learning

units

14 and 18 determine the deviation of the actual trajectory acquired by the actual trajectory acquisition unit 13 from the ideal trajectory according to the driving condition indicated by the drive command acquired by the target trajectory determination unit 16 during operation processing. A trajectory error indicating is calculated, and a target intermediate point is obtained. By performing the learning process in parallel with the operation process, the precision of the target intermediate point that minimizes the trajectory error is improved, and as a result, the control precision of the

robots

10 and 30 is improved.

The learning

units

14 and 18 may determine the target intermediate point using the absolute value, average value, and median of the deviation of the actual position of the tip link from the ideal position as the trajectory error.

When determining the target waypoint, the learning

units

14 and 18 estimate the trajectory error when the

robots

10 and 30 are controlled according to the target trajectory passing through the target waypoint, and estimate the target waypoint and the trajectory error. A trajectory model may be generated that indicates the value. In this case, the waypoint estimating unit 20 may estimate a plurality of reference waypoints arranged at intervals according to the estimated value of the trajectory error indicated by the trajectory model. For example, the intermediate point estimating section 20 may widen the intervals between the plurality of reference intermediate points to be estimated as the estimated value of the trajectory error becomes smaller. As an example, the intermediate point estimating unit 20 is spaced from each other by a length that is twice the maximum value of the trajectory error when the robot 10 is controlled under other driving conditions indicated by the trajectory model generated by the learning unit 14. A plurality of reference waypoints arranged in three dimensions may be estimated.

The driving condition estimation unit 21 may estimate the driving condition within a predetermined driving condition range. For example, in the driving condition estimating unit 21, the maximum value of the distance from the movement start point to the movement end point and the maximum speed are determined in advance according to the performance of the

robots

10 and 30. to the motion end point is less than the specified maximum value and the speed is less than the specified maximum value.

The learning process performed by the control device 1-5 may be a function of another device. As an example, FIG. 29 shows an example in which the functions of the control device 1 according to Embodiment 1 are implemented by a control device 6 that controls the robot 10 and a learning device 7 that generates a trajectory model. A robot system 105 shown in FIG. 29 includes a robot 10 , a control device 6 that controls the robot 10 , and a learning device 7 that learns a trajectory model for controlling the robot 10 .

The learning device 7 includes a driving condition storage unit 11, a reference trajectory determination unit 12, an actual trajectory acquisition unit 13, a learning unit 14, and a learned data storage unit 15. The function of each part of the learning device 7 is the same as the function of each corresponding part of the control device 1 . The control device 6 includes a target trajectory determination section 16 and a control section 17 . The function of each part of the control device 6 is the same as the function of each corresponding part of the control device 1 . The target trajectory determination unit 16 included in the control device 6 may acquire the trajectory model from the learned data storage unit 15 included in the learning device 7 . A control unit 17 provided in the control device 6 controls the robot 10 according to the reference trajectory acquired from the reference trajectory determination unit 12 provided in the learning device 7 or the target trajectory determined by the target trajectory determination unit 16 . The learned data storage unit 15 may be an independent storage device accessible from the control device 6 and the learning device 7 . Similarly, the functions of the control device 2-5 can be realized by the control device 6 and the learning device 7. FIG.

As another example, in the robot system 105 shown in FIG. 29, among the functions of the learning unit 14 provided in the learning device 7, the control device 6 may be provided with processing for generating a trajectory model. In this case, the learning device 7 may calculate the trajectory error for each of the plurality of reference intermediate points for each driving condition, and store the trajectory error in the learned data storage unit 15 . The control device 6 may acquire and hold the data previously learned by the learning device 7, specifically, the trajectory error stored in the learned data storage unit 15. FIG. Then, the control device 6 may generate a trajectory model indicating a target intermediate point that minimizes the trajectory error for each driving condition.

The trajectories of the

robots

10 and 30 that are controlled by the control device 1-6 are not limited to the trajectories of the tip links, but are the trajectories of any part of the

robots

10 and 30 or the processing tool 200 attached to the robot 30.

The method of acquiring the trajectory of the leading link by the actual trajectory acquisition unit 22 is not limited to the above example. At least the first arm 110, the second arm 120, the third arm 130, the fourth arm 140, the fifth arm 150, and the flange 160 may Any mechanical deflection may occur and the actual trajectory may deviate from the ideal trajectory. For example, the actual trajectory acquisition unit 22 may have a plurality of cameras as measuring instruments in order to accurately acquire the actual trajectory even if mechanical deflection occurs due to an external factor. The actual trajectory acquisition unit 22 may photograph the movement range of the tip link of the robot 10 with a plurality of cameras, and determine the position of the tip link of the robot 10 from the images photographed by the plurality of cameras.

As another example, the actual trajectory acquisition unit 22 has an acceleration sensor attached to the tip link of the robot 10 as a measuring device. The actual trajectory acquisition unit 22 may determine the position of the tip link of the robot 10 based on the measurement value of the acceleration sensor.

The actual trajectory acquisition unit 22 may measure the position of the tip link of the robot 10 in each acquisition cycle independent of the control cycle. In this case, the learning unit 14 interpolates the position of the leading link of the robot 10 for each acquisition cycle, determines the position of the leading link of the robot 10 for each control cycle, and compares it with the ideal position. Specifically, the learning unit 14 performs interpolation processing using linear interpolation, polynomial interpolation, or the like.

The control unit 17 may feedback-control the motors M1, M2, M3, M4, M5, and M6 according to the position of the tip link of the robot 10 acquired by the actual trajectory acquisition unit 22 described above.

In addition, the above hardware configuration and flowchart are examples, and can be changed and modified arbitrarily. As an example, the control device 1-6 and the learning device 7 may start learning processing according to the operation of an operation unit (not shown). As another example, the learning unit 14 included in the control device 1 may generate a trajectory model when trajectory error learning is completed for all reference intermediate points associated with each driving condition. In other words, the learning unit 14 may calculate the trajectory error for all the associated reference intermediate points and generate the trajectory model repeatedly for each driving condition.

In the above embodiment, the

robots

10 and 30 are actually operated to calculate the trajectory error, but the learning process may be performed by simulation.

A computer program for executing the above operations is stored in a computer-readable recording medium such as a flexible disk, CD-ROM (Compact Disc - Read Only Memory), DVD-ROM (Digital Versatile Disc - Read Only Memory). The control device 1-6 and the learning device 7 that execute the above-described operations may be implemented by distributing the software and installing the computer program in a computer. Alternatively, the controller 1-6 and the learning device 7 that perform the above operations may be realized by a dedicated system. The computer program may be superimposed on a carrier wave and provided via a communication network.

The

robots

10 and 30 are not limited to the above examples and are optional. As an example, controller 1-6 may control robot 40, which is a horizontal articulated robot shown in FIG. The robot 40 includes a first arm 410 , a second arm 420 , a third arm 430 and a base 440 . The first arm 410 is attached to the base 440 so as to be rotatable around a rotation axis AX1 parallel to the Z axis. The second arm 420 is connected to the first arm 410 and is rotatable around a rotation axis AX2 parallel to the Z axis. The third arm 430 is connected to the second arm 420 and is rotatable around a rotation axis AX3 parallel to the Z axis. Furthermore, the third arm 430 can extend and contract in the direction along the rotation axis AX3. Base 440 is fixed to a plane and supports first arm 410 . A processing tool 200 can be attached to the third arm 430 . The control device 1 - 6 may control the position of the tip link, which is the tip of the third arm 430 , by controlling the motor (not shown) of the robot 40 .

As another example, the control device 1-6 may control a robot 50 having multiple axes shown in FIG. The robot 50 has a first arm 510 , a second arm 520 and a base 530 . The first arm 510 is attached to the base 530 so as to be movable in the X-axis direction. The second arm 520 is connected to the first arm 510 and is movable in the Y-axis direction. Base 530 is fixed to a plane and supports first arm 510 . A processing tool 200 is attached to the second arm 520 . In the example of FIG. 31, the second arm 520 is a print head that forms electrodes on a substrate on a stage 540 placed on a base 530 by means of a processing tool 200 having a liquid ejection part and ejection nozzles. Specifically, the tip of the processing tool 200 is movable on the stage 540 in the X-axis direction and the Y-axis direction. The control device 1 - 6 may control the position of the tip link, which is the tip of the second arm 520 , by controlling the motor (not shown) of the robot 50 .

In the above example, the positions of the tip links of the

robots

10, 30, 40, 50 were defined in the XYZ orthogonal coordinate system, but may be defined in the cylindrical coordinate system.

Various embodiments and modifications of the present disclosure are possible without departing from the broad spirit and scope of the present disclosure. In addition, the embodiments described above are for explaining this disclosure, and do not limit the scope of this disclosure. That is, the scope of the present disclosure is indicated by the claims rather than the embodiments. Various modifications made within the scope of the claims and within the scope of equivalent disclosure are considered to be within the scope of this disclosure.

This application is based on Japanese Patent Application No. 2021-23847 filed on February 18, 2021. The entire specification, claims, and drawings of Japanese Patent Application No. 2021-23847 are incorporated herein by reference.

1, 2, 3, 4, 5, 6 control device, 7 learning device, 10, 30, 40, 50 robot, 11 drive condition storage unit, 12 reference trajectory determination unit, 13, 22 actual trajectory acquisition unit, 14, 18 learning unit, 15 learned data storage unit, 16 target trajectory determination unit, 17 control unit, 19 reaction force acquisition unit, 20 intermediate point estimation unit, 21 drive condition estimation unit, 31 processor, 32 memory, 33 interface, 34 bus, 100, 101, 102, 103, 104, 105 Robot system, 110, 410, 510 1st arm, 120, 420, 520 2nd arm, 130, 430 3rd arm, 140 4th arm, 150 5th arm, 160 Flange, 160a Tip surface, 170, 440, 530 Base, 180 Fixed part, 190 Force sensor, 200 Processing tool, 540 Stage, AT1, AT1', AT2 Actual trajectory, AX1, AX2, AX3, AX4, AX5, AX6 Rotation Axis, E1, E2, E3, E4, E5, E6　Encoder, EP1　Operation end point, ER1, ER2　Trajectory error, M1, M2, M3, M4, M5, M6　Motor, RP, RP1, RP2　Reference intermediate point, RT1, RT2 　Reference trajectory, S1 　Space, SP1 　Operation start point, TP1 　Target intermediate point, TT1 　Target trajectory.

Claims

A control device for controlling a robot,
a driving condition storage unit that associates and stores a plurality of reference intermediate points with driving conditions specifying at least an operation start point and an operation end point of the robot;
a reference trajectory determination unit that acquires one of the plurality of reference intermediate points from the driving condition storage unit, and determines a reference trajectory passing through the reference intermediate point based on the driving condition for each of the acquired reference intermediate points; ,
an actual trajectory acquisition unit that acquires the actual trajectory of the robot;
Each of the reference intermediate points for which a trajectory error indicating a deviation of the actual trajectory obtained by the actual trajectory obtaining unit from the ideal trajectory based on the driving condition when the robot is controlled according to the reference trajectory is obtained. and learning the correspondence between the reference midpoint and the trajectory error to generate a trajectory model indicating a target midpoint that minimizes the trajectory error according to the driving condition;
a target trajectory determination unit that determines a target trajectory passing through the target waypoint obtained from the trajectory model;
a control unit that controls the robot according to the reference trajectory or the target trajectory;
A control device comprising:
estimating the plurality of reference intermediate points corresponding to the other driving conditions from the target intermediate points corresponding to the driving conditions indicated by the trajectory model, and determining the plurality of reference intermediate points estimated as the other driving conditions; further comprising an intermediate point estimating unit that associates points and stores them in the driving condition storage unit;
A control device according to claim 1 .
The learning unit estimates the trajectory error when the robot is controlled according to the target trajectory, and generates the trajectory model indicating the target waypoint and the estimated trajectory error according to the driving condition. generate and
The waypoint estimating unit estimates the plurality of reference waypoints arranged at intervals according to the size of the estimated trajectory error indicated by the trajectory model.
3. A control device according to claim 2.
Based on the trajectory error, which is the deviation of the actual trajectory from the ideal trajectory when the robot is controlled according to the reference trajectory passing through the reference intermediate point predetermined for the driving condition, the driving condition further comprising a driving condition estimating unit that estimates another reference midpoint corresponding to the driving condition, associates the estimated reference midpoint with the driving condition, and stores the estimated reference midpoint in the driving condition storage unit;
A control device according to any one of claims 1 to 3.
The driving condition estimator estimates the other driving condition and a plurality of reference middle points corresponding to the other driving condition from the driving condition, the reference middle point corresponding to the driving condition, and the trajectory error. and storing the estimated other driving condition and the plurality of estimated reference intermediate points in association with each other in the driving condition storage unit;
5. A control device according to claim 4.
The driving condition estimator estimates the other reference midpoint corresponding to the driving condition by a regression algorithm based on the trajectory error.
6. A control device according to claim 4 or 5.
The driving condition estimator calculates the other driving condition and the plurality of driving conditions corresponding to the other driving condition by a regression algorithm based on the driving condition, the reference midpoint corresponding to the driving condition, and the trajectory error. Estimate the reference midpoint of
A control device according to claim 5 .
The actual trajectory acquisition unit has a measuring device that measures the trajectory of the robot, and acquires the trajectory of the robot according to the measurement value of the measuring device.
A control device according to any one of claims 1 to 7.
A control device for controlling a robot,
The robot is determined according to each of a plurality of reference intermediate points associated with drive conditions that specify at least an operation start point and an operation end point of the robot, and is controlled according to a reference trajectory passing through the reference intermediate points. A correspondence between a trajectory error indicating deviation of the actual trajectory of the robot from an ideal trajectory based on the driving condition and the reference intermediate point is learned, and a target intermediate point that minimizes the trajectory error is indicated according to the driving condition. a learning unit that generates a trajectory model;
a target trajectory determination unit that determines a target trajectory passing through the target waypoint obtained from the trajectory model;
a control unit that controls the robot according to the target trajectory;
A control device comprising:
The learning unit learns changes in the trajectory error according to changes in the reference waypoint, and generates the trajectory model indicating the target waypoint that minimizes the trajectory error according to the driving condition.
A control device according to any one of claims 1 to 9.
further comprising a reaction force acquisition unit that acquires a reaction force that the robot receives from the object when the object is processed by a machining tool attached to the robot,
The learning unit learns the reference intermediate point, the trajectory error, and the correspondence of the reaction force when the trajectory error occurs, and generates the trajectory model.
A control device according to any one of claims 1 to 10.
The trajectory error is calculated from the ideal position of the actual position when the distance between the ideal position, which is the position of the robot indicated by the ideal trajectory, and the actual position, which is the position of the robot indicated by the actual trajectory, reaches a maximum value. showing the deviation of
A control device according to any one of claims 1 to 11.
The maximum value is the maximum value of the distance between the ideal position and the actual position,
13. A control device according to claim 12.
The driving conditions further include information about a workpiece to be machined by a machining tool attached to the robot.
14. A control device according to any one of claims 1 to 13.
The information about the workpiece includes at least one of the weight and dimensions of the workpiece,
15. Control device according to claim 14.
A control device for controlling a robot,
A trajectory indicating a target intermediate point for minimizing a trajectory error indicating a deviation of the actual trajectory of the robot from an ideal trajectory based on the driving condition, according to the driving condition designating at least an operation start point and an operation end point of the robot. a target trajectory determination unit that acquires a model and determines a target trajectory passing through the target waypoint obtained from the trajectory model;
a control unit that controls the robot according to the target trajectory;
A control device comprising:
robot and
17. The control device according to any one of claims 1 to 16, which controls the robot in accordance with a driving condition specifying at least an operation start point and an operation end point of the robot;
A robot system with
A learning device for learning a trajectory model for controlling a robot,
a driving condition storage unit that associates and stores a plurality of reference intermediate points with driving conditions specifying at least an operation start point and an operation end point of the robot;
a reference trajectory determination unit that acquires one of the plurality of reference intermediate points from the driving condition storage unit, and determines a reference trajectory passing through the reference intermediate point based on the driving condition for each of the acquired reference intermediate points; ,
an actual trajectory acquisition unit that acquires the actual trajectory of the robot;
For each driving condition, a trajectory error indicating a deviation of the actual trajectory obtained by the actual trajectory obtaining unit from an ideal trajectory based on the driving condition when the robot is controlled according to the reference trajectory is obtained. learning the correspondence between the reference intermediate points and the trajectory error, and generating a trajectory model indicating a target intermediate point that minimizes the trajectory error in accordance with the driving conditions. Department and
A learning device with
A control method for controlling a robot,
determining a reference trajectory passing through any one of a plurality of reference intermediate points associated with the driving condition based on the driving condition specifying at least an operation start point and an operation end point of the robot;
A trajectory error indicating a deviation of the actual trajectory of the robot from the ideal trajectory based on the driving conditions when the robot is controlled according to the reference trajectory is obtained for each of the reference midpoints, and learning a correspondence with the trajectory error to generate a trajectory model indicating a target intermediate point that minimizes the trajectory error according to the driving condition;
determining a target trajectory through the target waypoint obtained from the trajectory model;
controlling the robot according to the reference trajectory or the target trajectory;
control method.
The computer that controls the robot,
a driving condition storage unit that associates and stores a plurality of reference intermediate points with a driving condition that designates at least an operation start point and an operation end point of the robot;
a reference trajectory determination unit that acquires one of the plurality of reference intermediate points from the driving condition storage unit and determines a reference trajectory passing through the reference intermediate point based on the driving condition for each of the acquired reference intermediate points;
an actual trajectory acquisition unit that acquires the actual trajectory of the robot;
Each of the reference intermediate points for which a trajectory error indicating a deviation of the actual trajectory obtained by the actual trajectory obtaining unit from the ideal trajectory based on the driving condition when the robot is controlled according to the reference trajectory is obtained. and learns the correspondence between the reference midpoint and the trajectory error to generate a trajectory model indicating a target midpoint that minimizes the trajectory error according to the driving condition;
a target trajectory determination unit that determines a target trajectory passing through the target waypoint obtained from the trajectory model; and
a control unit that controls the robot according to the reference trajectory or the target trajectory;
A program to function as