CN110977960A

CN110977960A - Visual following system of three drive arms

Info

Publication number: CN110977960A
Application number: CN201911057970.4A
Authority: CN
Inventors: 张雷雨; 王洪刚; 杨斯钦; 雷杨浩; 张得阳; 彭程
Original assignee: Beijing University of Technology
Current assignee: Beijing University of Technology
Priority date: 2019-11-01
Filing date: 2019-11-01
Publication date: 2020-04-10

Abstract

The invention provides a vision following system of a three-drive mechanical arm, which comprises three parts, namely an upper computer vision identification part, an upper computer and lower computer communication part and a lower computer drive control part. The provided visual following system of the three-drive mechanical arm is characterized in that the upper computer visually identifies a frequency domain filtering mode based on convolution, predicts the position of a target possibly appearing in the next frame, mainly extracts the change coordinate of the target position by using the minimum output mean square error and the MOSSE algorithm, and sends the change coordinate to the lower computer; the upper computer and the lower computer are communicated by zigbee; the lower computer receives the command of the upper computer through the zigbee and sends a specified hexadecimal code to the driver to drive the corresponding motor accordingly, and the following action of the mechanical arm is completed through combination.

Description

Visual following system of three drive arms

Technical Field

The invention relates to a visual following system, in particular to a visual following system of a mechanical arm.

Background

Machine vision is an integral and important component of artificial intelligence, and develops with the development of artificial intelligence. Machine vision refers to the use of optical non-contact sensing devices to receive images of real scenes to obtain information to control a machine or process. The machine vision is frequently integrated with an automatic control device in a system to jointly complete a relatively complex control task with object identification and judgment, and the mechanical arm following system based on the machine vision plays a very important decisive role in the fields of operation of an industrial robot, grabbing of a mechanical arm and the like, and particularly needs to have high enough precision to complete corresponding tasks in the production process with high precision, high quality and multiple batches; therefore, the mechanical arm following system based on machine vision has great influence on the working efficiency of the machine, and the system can greatly reduce the workload of manual control and greatly improve the working precision for the production of special equipment.

At present, in the use process of the existing mechanical arm driving system based on machine vision, the following effect is poor, the traditional following system is only simple in image recognition and follows, the precision is poor, the limitation is large, and the mechanical arm driving system cannot be suitable for different place environments. Because the traditional three-dimensional positioning technology is used for establishing a three-dimensional coordinate system of a target by acquiring accurate three-dimensional coordinates of the target, in actual life, an object shakes or moves at a high speed due to external force factors, and a certain error or even an identification failure can be caused by the existing machine vision following system. Therefore, we propose a vision following system of three-drive mechanical arm to solve the above-mentioned problems.

Disclosure of Invention

The invention mainly solves the problem that the existing three-drive mechanical arm vision tracking system has defects, and provides an efficient, accurate and universal mechanical arm vision tracking system for a three-motor-driven mechanical arm. The upper computer vision recognition system of the system adopts the idea of relevant filtering (the more relevant two target relevant values are larger, namely the video frame is provided withThe more similar the initialized target is, the larger the obtained response is), the convolution theorem (the time domain convolution is equivalent to frequency domain multiplication, and the frequency domain convolution is equivalent to time domain multiplication) is applied to process the image collected by the camera, the target function of the MOSSE filter is searched, the closed solution is solved, and the peak value coordinate is updated to realize target tracking, so that a very efficient visual identification program is formed, and the updated position coordinate X is respectively extracted₁And Y₁Comparing the target position with the central coordinates (300, 100), obtaining a result, outputting the result to a lower computer, obtaining a motion track of the mechanical arm, and driving three motors to enable the mechanical arm to reach the target position; for the motor drive of the lower computer, a register is configured in the microprocessor to compile a read-write driver function, the communication between the driver and the microprocessor is completed through an RS485 serial port, and the current position information of the mechanical arm is fed back to the microprocessor in real time through an encoder. The read/write driver function achieves control by changing the state of the flag bit in the register. The specific implementation is that after receiving the instruction of the upper computer, the MODBUS protocol is written into the register by writing the function of the driver according to the bit, and is sent to the driver through the serial port, and the host computer sets the parameter of the driver and controls the operation through the read-write register function of the MODBUS.

Therefore, the invention adopts the following technical scheme and implementation steps:

determining a target is finding a filter h that maximizes its response at the target. f denotes the training image, g denotes the output image, h denotes the filter, F, G, H corresponds to its frequency domain value. And deducing a calculated value of the target H through a time domain formula and a frequency domain formula, and completing the conversion of the time domain convolution and the frequency domain convolution. And solving a closed solution of H by using the objective function of the filter and updating the MOSSE. Calling a python corresponding library function to generate an ideal response of a Gaussian shape, solving a target frame area and performing filter convolution to obtain a response value, extracting a corresponding maximum value as a new target position, updating a position coordinate new target as a center selection target, finally extracting updated X and Y values respectively, comparing and sending the updated X and Y values to a lower computer. The execution device of the lower computer is a servo motor, the servo motor selects a speed mode, the upper computer sends a driving message of the motor to the microprocessor through a wireless transmission serial port, and the microprocessor reads data based on a 485 communication protocol and writes the driving message into a motor driver to realize the adjustment of the rotating speed of the servo motor. The three motors respectively control different polar coordinate parameters to realize space motion. The upper computer converts the comparison result of the object (the deviation degree of the target relative to the center of the visual field) into the specific displacement direction and movement time (given speed) of the lower computer, so that the mechanical arm can adjust the position and the posture along with the movement of the target.

After a camera acquires a video image, selecting a target to be tracked in the video, then performing Gaussian filtering on a first frame acquired after selection, mapping the image and a filter onto a topological structure, filling a boundary in a circulating image mode, namely connecting the left edge of the image to the right edge, connecting the top to the bottom, and finally performing point-by-point cosine window processing to enable the edge of the image to become zero.

In order to convert the spatial domain correlation filtering to the frequency domain filtering to improve the operation speed, a convolution operator is introduced, wherein when the conversion of the point product of the time domain convolution and the frequency domain is carried out:

g＝f*h (1)

G＝F·H^*(2)

(f denotes the feature matrix of the input image, g denotes the output matrix, H denotes the parameter matrix of the tracker, F, G, H corresponds to its frequency domain value, H^*Conjugate complex number corresponding to H)

(the convolution operator has no practical physical meaning and can be regarded as a bridge for transferring the spatial domain correlation filtering to the frequency domain so as to improve the running speed of the computer)

The required filter is obtained in MOSSE by minimizing the sum of the squared errors between the convolved actual output and the convolved desired output, i.e. solving:

min_H*∑i|F_i·H^*-G_i|²(4)

(where i denotes the ith training sample)

To optimize the calculation, equation (4) is converted into:

min_H*∑i|F_iwv·H^* _wv-G_iwv|²(5)

(wherein the subscript denotes the w rows and v columns of the ith sample)

Unfolding to obtain:

min_H*∑i[F_iωvF_iωv*(H^* _ωv)²-G_iωvF^* _iωvH_ωv-F_iωvH^* _ωvG^* _iωv+ G_iωvG^* _iωv](6)

within the sum number is a sum of H^*Is a solution of a first derivative equal to 0, which is H_*Minimum value of (d):

the calculation result is as follows:

∑_i[F_iωvF_iωv ^*H_ωv-F_iωvG^* _iωv]＝0 (8)

finally, the following steps are obtained:

the final closed-form solution of H is:

in consideration of the problems of preventing filter overfitting and enabling the filter to adapt to rotation, shielding, scale change and the like quickly, a parameter η of learning efficiency is introduced to represent the weights of frames with different time sequences, and the learning efficiency parameter enables the weight occupied by the frame with the time sequence closer to the current frame to be larger, and the learning result of the previous frame is exponentially decreased with time.

In order to make the filter have better adaptability to the external influences such as deformation, illumination and the like, an updating template is introduced by combining the parameters of learning efficiency:

A_i＝ηG_i·F_i ^*+(1-η)A_i-1(12)

B_i＝ηF_i·F_i ^*+(1-η)B_i-1(13)

(the learning rate η is 1 during initialization, and the variation range of η is 0-1 along with the variation of time sequence, wherein the subscripts i and i-1 respectively represent the numerator of the current frame and the last frame, A and B represent the numerator and denominator of the model formula of the filter)

After the maximum response of the target is obtained by the method, the corresponding maximum response is extracted as a new target coordinate, and the updated position coordinate X is respectively extracted₁And Y₁Comparing the obtained value with a central coordinate (300, 100), setting the value of X between 290 and 310 as a transverse shaking interval, namely when the final value of X of the selected target is in the interval, regarding the target as the transverse shaking of the mechanical arm, and sending 0X11 to a lower computer; x₁When the average value is more than 310, sending 0X10 to a lower computer; when X is present₁If the current time is less than 290, 0X09 is sent to the lower computer. Setting the y value between 90 and 110 as a longitudinal shaking interval, namely when the final y value of the selected target is in the interval, regarding the y value as the longitudinal shaking of the longitudinal arm, and sending 0X08 to the lower computer; x₁When the current time is more than 110, sending 0X07 to the lower computer; when X is present₁If the number is less than 110, 0X06 is sent to the lower computer.

The spatial positions of the three motors are given, and the motor 1 is a telescopic motor of the mechanical arm and is used for controlling the extension and retraction of the mechanical arm; the motor 2 is a pitching motor of the mechanical arm and controls the lifting and sinking of the mechanical arm; the motor 3 is a rotary motor of the mechanical arm and is used for adjusting the left-right swing of the mechanical arm, and the adjustment of the spatial position of the mechanical arm is completed by driving the three motors, so that the following function is realized.

The lower computer receives the instruction of the upper computer through the wireless transmission device and then runs the motor function to drive the corresponding motor. The servo motor file receives an instruction of an upper computer through the single chip microcomputer, writes an MODBUS protocol into a register according to bits through a write driver function, sends the MODBUS protocol to a driver through a serial port, and the host sets driver parameters and controls operation through the read-write register function of the MODBUS. The function codes supported by the driver are 0x3 (read register), 0x6 (write register), 0x78 (write target position), 0x7a (modify device address).

Drawings

Fig. 1 is an overall work flow diagram of the system.

FIG. 2 is a flow chart of the upper computer recognizing vision recognition and following

Fig. 3 is the spatial position of the robot arm motor.

Fig. 4 is a motor driving flowchart.

Detailed Description

The present invention is described in detail with reference to the embodiments shown in the drawings, but it should be understood that these embodiments are not intended to limit the present invention, and those skilled in the art should understand that functional, methodological, or structural equivalents or substitutions made by these embodiments are within the scope of the present invention.

Referring to fig. 1 to 4, fig. 1 is a flowchart illustrating the overall operation of the system, fig. 2 is a flowchart illustrating the recognition, visual recognition and tracking of the upper computer, fig. 3 is a spatial position of a robot motor, and fig. 4 is a flowchart illustrating the driving of the motor.

As shown in FIG. 1, the vision following system of the three-drive mechanical arm provided by the invention comprises three parts, namely an upper computer vision identification part, an upper computer and lower computer communication part and a lower computer drive control part. The upper computer visually identifies a frequency domain filtering mode based on convolution, predicts the position of a target possibly appearing in the next frame, mainly applies a minimum output mean square error and an MOSSE algorithm, extracts the position change coordinate of the target and sends the position change coordinate to the lower computer; the communication part of the upper computer and the lower computer is realized by serial communication, and data is transmitted according to bits; the lower computer wirelessly receives the command of the upper computer through serial port communication and sends a specified hexadecimal code to the driver through RS485 serial port communication so as to drive the corresponding motor, and the following action of the mechanical arm is completed through combination. The motor 1 is a telescopic motor of the mechanical arm and is used for controlling the extension and retraction of the mechanical arm; the motor 2 is a pitching motor of the mechanical arm and controls the lifting and sinking of the mechanical arm; the motor 3 is a rotary motor of the mechanical arm and is used for adjusting the left-right swing of the mechanical arm, the invention does not specially summarize the modes of the mechanical arm expansion, pitching and horizontal rotation, and the invention only provides a following system for driving three corresponding motors.

As shown in fig. 2, after a video image is acquired by a camera, a target to be tracked is selected from the video, then gaussian filtering is performed on a first frame acquired after selection, the image and a filter are mapped onto a topological structure, a boundary is filled in a cyclic image mode, namely, the left edge of the image is connected to the right edge, the top of the image is connected to the bottom of the image, and finally, point-by-point cosine window processing is adopted to enable the edge of the image to become zero.

g＝f*h (1)

G＝F·H^*(2)

(f denotes the feature matrix of the input image, g denotes the output matrix, H denotes the parameter matrix of the tracker, F, G, H corresponds to its frequency domain value, H_*Conjugate complex number corresponding to H)

min_H*∑i|F_i·H^*-G_i|²(4)

(where i denotes the ith training sample)

To optimize the calculation, equation (4) is converted into:

min_H*∑i|F_iwv·H^* _wv-G_iwv|²(5)

(wherein the subscript denotes the w rows and v columns of the ith sample)

Unfolding to obtain:

min_H*∑i[F_iωvF_iωv ^*(H^* _ωv)²-G_iωγF^* _iωγH_ωγ-F_iωvH^* _ωvG^* _iωv+ G_iωvG^* _iωv](6)

within the sum number is a sum of H^*Is a solution of a first derivative equal to 0, which is H^*Minimum value of (d):

the calculation result is as follows:

∑_i[F_iωvF_iωv ^*H_ωv-F_iωvG^* _iωv]＝0 (8)

finally, the following steps are obtained:

the final closed-form solution of H is:

in consideration of preventing filter overfitting and enabling the filter to adapt to the problems of rotation, shielding, scale change and the like quickly, a parameter η of learning efficiency is introduced to represent the weights of frames with different time sequences, the learning efficiency parameter enables the weight occupied by the frame with the time sequence closer to the current frame to be larger, and the learning result of the previous frame is exponentially decreased gradually along with time, in order to enable the filter to have better adaptability to external influences such as deformation, illumination and the like, an updating template is introduced by combining the parameter of the learning efficiency:

A_i＝ηG_i·F_i ^*+(1-η)A_i-1(12)

B_i＝ηF_i·F_i ^*+(1-η)B_i-1(13)

After the maximum response of the target is obtained by the method, the corresponding maximum response is extracted as a new target coordinate, and the updated position coordinate X is respectively extracted₁And Y₁Comparing the obtained value with a central coordinate (300, 100), setting the value of X between 290 and 310 as a transverse shaking interval, namely when the final value of X of the selected target is in the interval, regarding the target as the transverse shaking of the mechanical arm, and sending 0X11 to a lower computer; x₁When the average value is more than 310, sending 0X10 to a lower computer; when X is present₁If the current time is less than 290, 0X09 is sent to the lower computer. Setting the y value between 90 and 110 as a longitudinal shaking interval, namely when the final y value of the selected target is in the interval, regarding the y value as the longitudinal shaking of the longitudinal arm, and sending 0X08 to the lower computer; x₁When the current time is more than 110, sending 0X07 to the lower computer; when X is present₁If < 110, sending 0X06 to the lower computer

As shown in fig. 3, the spatial positions of the three motors are given, and the motor 1 is a telescopic motor of the robot arm and is used for controlling the extension and retraction of the robot arm; the motor 2 is a pitching motor of the mechanical arm and controls the lifting and sinking of the mechanical arm; the motor 3 is a rotary motor of the mechanical arm and is used for adjusting the left-right swing of the mechanical arm, and the adjustment of the spatial position of the mechanical arm is completed by driving the three motors, so that the following function is realized.

As shown in fig. 4, the lower computer receives the instruction from the upper computer through the wireless transmission device and then runs the motor function to drive the corresponding motor. The servo motor file receives an instruction of an upper computer through the single chip microcomputer, writes an MODBUS protocol into a register according to bits through a write driver function, sends the MODBUS protocol to a driver through a serial port, and the host sets driver parameters and controls operation through the read-write register function of the MODBUS. The function codes supported by the driver are 0x3 (read register), 0x6 (write register), 0x78 (write target position), 0x7a (modify device address). The driver control motor speed mode comprises the following specific steps:

(1) write device 1 (function code 0x06 address 0MODBUS Enable) 1

(2) Reading equipment 1 (function code 0x06 address 14 alarm code) (when reading 0, it is normal)

(3) Writing device 1 (function code 0x06 address 3 motor acceleration) 1

(4) Writing device 1 (function code 0x06 address 2 motor target speed) 1

(5) And (4) repeating the process to control the rotating speed of the motor.

The received upper computer instruction corresponds to the mechanical arm action and the motor rotation condition as follows:

0x 06: the constant speed is positive rotation towards the No. 2 motor at the constant speed;

0x 07: the motor No. 2 is turned upwards at a constant speed and is rotated reversely at a constant speed;

0x 08: stopping the motor No. 2, locking and stopping;

0x 09: the constant speed positive rotation is carried out on the motor with the constant speed being right No. 3;

0x 10: the constant speed is reversed to the left No. 3 motor at a constant speed;

0x 11: and stopping the No. 3 motor, locking and stopping.

The vision following system of the three-drive mechanical arm provided by the invention has the following beneficial effects:

(1) the machine vision is applied to the field of mechanical arms, and the tracking of the mechanical arms on the selected targets is realized by selecting the targets in the video, so that the tracking of the mechanical arms is more efficient and accurate.

(2) The creative division into six regions in the visual follow is the three divided regions in the X-axis and the three divided regions in the Y-axis explained in detail in the present invention.

(3) The vision following system of the three-drive mechanical arm has very good universality, can run on various upper computers (linux systems, raspberry pies and the like), and is a mature product in the market, and the three driven motors are six-wire servo motors.

The above-listed detailed description is only a specific description of a possible embodiment of the present invention, and they are not intended to limit the scope of the present invention, and equivalent embodiments or modifications made without departing from the technical spirit of the present invention should be included in the scope of the present invention.

It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims

1. A vision following system of a three-drive mechanical arm is characterized by comprising three parts, namely an upper computer vision identification part, an upper computer and lower computer communication part and a lower computer drive control part:

(1) the upper computer vision recognition system of the system adopts the idea of relevant filtering, processes the image collected by the camera by applying the convolution theorem, finds the target function of the MOSSE filter, solves the closed solution of the MOSSE filter, and updates the peak coordinate to realize target tracking, thereby forming a vision recognition program and respectively extracting the updated position coordinate X₁And Y₁Comparing the central coordinates with the central coordinates (300, 100), obtaining the result, outputting the result to a lower computer, and obtaining a computerThe motion trail of the mechanical arm;

(2) for the motor drive of a lower computer, a register is configured in a microprocessor to compile a read-write driver function, the communication between a driver and the microprocessor is completed through an RS485 serial port, and the current position information of the mechanical arm is fed back to the microprocessor in real time through an encoder; the read-write driver function achieves the purpose of control by changing the state of the flag bit in the register; the specific implementation is that after receiving the instruction of the upper computer, the MODBUS protocol is written into the register by writing the function of the driver according to the bit, and is sent to the driver through the serial port, and the host computer sets the parameter of the driver and controls the operation through the read-write register function of the MODBUS.

2. The visual following system of a three-drive robotic arm of claim 1, wherein: after a camera acquires a video image, selecting a target to be tracked in the video, then performing Gaussian filtering on a first frame acquired after selection, mapping the image and a filter onto a topological structure, filling a boundary in a circulating image mode, namely connecting the left edge of the image to the right edge, connecting the top to the bottom, and finally performing point-by-point cosine window processing to enable the edge of the image to become zero.

3. The visual following system of a three-drive robotic arm of claim 1, wherein: the i-th frame image is processed in MOSSE by minimizing the sum of the squared errors between the convolved actual output and the convolved desired output to obtain the required filter.

4. The visual following system of claim 1, wherein a learning efficiency parameter η is introduced to indicate the weights of frames with different time sequences, the learning efficiency parameter is such that the closer the time sequence is to the current frame, the higher the weight of the frame is, the learning result of the previous frame decreases exponentially with time, and the change range of η is 0-1 with the change of the time sequence.

5. According to claimThe vision following system of claim 1, wherein: after the maximum response of the target is obtained, extracting the corresponding maximum response as a new target coordinate, and respectively extracting the updated position coordinate X₁And Y₁Comparing the obtained value with a central coordinate (300, 100), setting the value of X between 290 and 310 as a transverse shaking interval, namely when the final value of X of the selected target is in the interval, regarding the target as the transverse shaking of the mechanical arm, and sending 0X11 to a lower computer; x₁>Sending 0X10 to the lower computer at 310; when X is present₁<When 290, sending 0X09 to the lower computer;

setting the y value between 90 and 110 as a longitudinal shaking interval, namely when the final y value of the selected target is in the interval, regarding the y value as the longitudinal shaking of the longitudinal arm, and sending 0X08 to the lower computer; y is₁>When 110, sending 0X07 to the lower computer; when Y is₁<When 110, sending 0X06 to the lower computer;

the spatial positions of the three motors are given, and the first motor is a telescopic motor of the mechanical arm and is used for controlling the extension and retraction of the mechanical arm; the second motor is a pitching motor of the mechanical arm and controls the lifting and sinking of the mechanical arm; the third motor is a rotary motor of the mechanical arm and is used for adjusting the left-right swing of the mechanical arm, and the adjustment of the spatial position of the mechanical arm is completed by driving the three motors, so that the following function is realized;

0x 08: stopping the motor No. 2, locking and stopping;

0x 11: and stopping the No. 3 motor, locking and stopping.

6. The visual following system of a three-drive robotic arm of claim 1, wherein: determining a target is to find a filter h that maximizes its response at the target; f represents a training image, g represents an output image, h represents a filter, and F, G, H respectively correspond to frequency domain values thereof; deducing a calculated value of the target H through a time domain formula and a frequency domain formula to complete the conversion of the time domain convolution and the frequency domain convolution; solving a closed solution of H by using a target function of the filter and updating the MOSSE; calling a python corresponding library function to generate an ideal response of a Gaussian shape, solving a target frame area and performing filter convolution to obtain a response value, extracting a corresponding maximum value as a new target position, updating a position coordinate new target as a center selection target, finally respectively extracting updated X and Y values, comparing and sending the updated X and Y values to a lower computer; the execution device of the lower computer is a servo motor, the servo motor selects a speed mode, and the three motors respectively control different polar coordinate parameters to realize space motion; the upper computer converts the comparison result of the object, namely the deviation degree of the target relative to the center of the visual field, into the specific displacement direction and the movement time of the lower computer, so that the mechanical arm can adjust the position and the posture along with the movement of the target.

7. The visual following system of a three-drive robotic arm of claim 1, wherein: after a camera acquires a video image, selecting a target to be tracked in the video, then performing Gaussian filtering on a first frame acquired after selection, mapping the image and a filter onto a topological structure, filling a boundary in a circulating image mode, namely connecting the left edge of the image to the right edge, connecting the top to the bottom, and finally performing point-by-point cosine window processing to enable the edge of the image to become zero;

g＝f*h (1)

G＝F·H^*(2)

where f denotes the feature matrix of the input image, g denotes the output matrix, h denotes the parameter matrix of the tracker, F, G, H corresponds to its frequency domain values，H^*A complex conjugate corresponding to H;

min_H*∑i|F_i·H^*-G_i|²(4)

wherein i represents the ith training sample;

to optimize the calculation, equation (4) is converted into:

where the subscripts denote the w rows and v columns of the ith sample;

unfolding to obtain:

the calculation result is as follows:

∑_i[F_iωvF_iωv ^*H_ωv-F_iωvG^* _iωv]＝0 (8)

finally, the following steps are obtained:

the final closed-form solution of H is:

in consideration of preventing filter overfitting, a parameter η of learning efficiency is introduced to represent the weights of frames with different time sequences, the learning efficiency parameter enables the weight occupied by the frame with the time sequence closer to the current frame to be larger, and the learning result of the previous frame is exponentially decreased along with the time;

in order to make the filter have better adaptability to the external influence, an updating template is introduced by combining parameters of learning efficiency:

A_i＝ηG_i·F_i ^*+(1-η)A_i-1(12)

B_i＝ηF_i·F_i ^*+(1-η)B_i-1(13)

in the initialization process, the learning rate η is 1, and the change range of η is 0-1 along with the change of time sequence, wherein lower subscripts i and i-1 respectively represent the numerator of a current frame and the numerator of a previous frame, and A and B represent the numerator and the denominator of a model formula of the filter;

after the maximum response of the target is obtained by the method, the corresponding maximum response is extracted as a new target coordinate, and the updated position coordinate X is respectively extracted₁And Y₁Comparing the obtained value with a central coordinate (300, 100), setting the value of X between 290 and 310 as a transverse shaking interval, namely when the final value of X of the selected target is in the interval, regarding the target as the transverse shaking of the mechanical arm, and sending 0X11 to a lower computer; x₁>Sending 0X10 to the lower computer at 310; when X is present₁<When 290, sending 0X09 to the lower computer; setting the y value between 90 and 110 as a longitudinal shaking interval, namely when the final y value of the selected target is in the interval, regarding the y value as the longitudinal shaking of the longitudinal arm, and sending 0X08 to the lower computer; x₁>When 110, sending 0X07 to the lower computer; when X is present₁<When 110, sending 0X06 to the lower computer;

the spatial positions of the three motors are given, and the motor 1 is a telescopic motor of the mechanical arm and is used for controlling the extension and retraction of the mechanical arm; the motor 2 is a pitching motor of the mechanical arm and controls the lifting and sinking of the mechanical arm; the motor 3 is a rotary motor of the mechanical arm and is used for adjusting the left-right swing of the mechanical arm, and the adjustment of the spatial position of the mechanical arm is completed by driving the three motors, so that the following function is realized;

the lower computer receives the instruction of the upper computer through the wireless transmission device and then runs a motor function to drive a corresponding motor; the servo motor file receives an instruction of an upper computer through the single chip microcomputer, writes an MODBUS protocol into a register according to bits through a write driver function, and sends the MODBUS protocol to a driver through a serial port, and the host sets driver parameters and controls operation through the read-write register function of MODBUS; the function codes supported by the driver are 0x3 (read register), 0x6 (write register), 0x78 (write target position), 0x7a (modify device address).