EA038279B1

EA038279B1 - Method and system for grasping an object by means of a robotic device

Info

Publication number: EA038279B1
Application number: EA201990572A
Authority: EA
Inventors: Альберт Рувимович ЕФИМОВ; Александр Николаевич СЁМОЧКИН; Святослав Дмитриевич МОРОШКИН
Original assignee: Публичное Акционерное Общество "Сбербанк России" (Пао Сбербанк)
Priority date: 2019-03-21
Filing date: 2019-03-22
Publication date: 2021-08-04
Also published as: RU2700246C1; EA201990572A1; WO2020190166A1

Abstract

The invention generally relates to the field of robotic devices and in particular to methods and systems for providing grasping different objects by means of robotic devices. A method of grasping an object by means of a robotic device includes the steps of training the robotic device with the aid of at least one machine learning algorithm for identifying and remembering object grasp points, wherein the training is performed on data describing photographic images of objects and corresponding 3D models of said objects in different perspectives; forming at least one grasp point of an object represented on a graphic image of the object; obtaining a photographic image of the object using a camera of the robotic device, and determining a perspective that depicts the object grasp point formed; obtaining a three-dimensional object point cloud in the determined perspective by means of a depth sensor of the robotic device; determining the location of the object grasp point according to the point cloud obtained; determining the orientation and position of a grasping device of the robotic device in the grasp point found, wherein the determining is performed by means of calculating the rotation matrices of the robotic device; grasping the object by means of the robotic device on the basis of a calculation of an average rotation matrix for the given grasp point. The technical result is an increase in the accuracy with which a grasp region is identified on an object by a robotic device by means of determining the orientation of a grasping device on the basis of a calculation of a rotation matrix to grasp the object for the given grasp point.

Description

Область техникиTechnology area

Заявленное техническое решение в общем относится к области робототехнических устройств, а в частности к способам и системам для обеспечения захвата различных объектов с помощью роботизированных устройств.The claimed technical solution generally relates to the field of robotic devices, and in particular to methods and systems for ensuring the capture of various objects using robotic devices.

Уровень техникиState of the art

Использование роботизированных устройств для обеспечения манипуляций с различными объектами широко известно из существующего уровня техники и находит широкое применение в сфере промышленных роботов в различных отраслях производства.The use of robotic devices for manipulating various objects is widely known from the state of the art and is widely used in the field of industrial robots in various industries.

Основной сложностью в процессе применения роботизированных устройств при манипуляциях с различными типами объектов является необходимость сложного процесса обучения, который заключается в создании графических образов объектов с их точными геометрическими параметрами и последующую отладку роботизированных манипуляторов для взаимодействия с ними.The main difficulty in the process of using robotic devices when manipulating various types of objects is the need for a complex learning process, which consists in creating graphic images of objects with their exact geometric parameters and subsequent debugging of robotic manipulators to interact with them.

В настоящее время для процесса обучения захвата объектов предлагаются различные подходы на базе алгоритмов и моделей машинного обучения, в частности с использованием нейронных сетей, с помощью которых выполняется обучение роботизированных манипуляторов для захватов объектов по обученным заранее точкам захвата. Такой подход раскрывается в работе Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation (Peter R. Florence et al., CSAIL, Massachusetts Institute of Technology, 07.09.2018). Известное решение предлагает использовать облака точек объектов в процессе обучения роботизированных устройств для того, чтобы сформировать дескрипторы объектов на основании фотографических и 3D моделей таких объектов для целей последующего указания точек захвата и взаимодействия с объектами в указанных точках.Currently, for the learning process of object capture, various approaches are proposed based on algorithms and machine learning models, in particular, using neural networks, with the help of which robotic manipulators are trained to capture objects using pre-trained capture points. This approach is disclosed in Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation (Peter R. Florence et al., CSAIL, Massachusetts Institute of Technology, 09/07/2018). The known solution proposes to use point clouds of objects in the process of teaching robotic devices in order to form object descriptors based on photographic and 3D models of such objects for the purpose of further specifying points of capture and interaction with objects at these points.

Основной недостаток известного решения заключается в ограничении возможности захвата различных типов объектов, что обусловлено отсутствием анализа поворота захватного устройства робота. С учетом того, что объект может находится в различном положении при работе роботизированного устройства, необходимо оценивать идеальное положение захватного устройства для более точного и правильного взаимодействия с объектом.The main disadvantage of the known solution is to limit the ability to grip various types of objects, which is due to the lack of analysis of the rotation of the gripper of the robot. Taking into account the fact that the object can be in different positions during the operation of the robotic device, it is necessary to evaluate the ideal position of the gripper for more accurate and correct interaction with the object.

Раскрытие изобретенияDisclosure of invention

Решаемой технической задачей или технической проблемой является обеспечение нового процесса взаимодействия с объектами с помощью их захвата роботизированными устройствами.The technical problem or technical problem to be solved is to provide a new process of interaction with objects by means of their capture by robotic devices.

Техническим результатом, достигающимся при решении вышеуказанной технической проблемы, является повышение точности распознавания области захвата объекта роботизированным устройством за счет определения ориентации захватного устройства на основании расчета матрицы поворотов для цели захвата объекта в требуемой точке захвата.The technical result achieved by solving the above technical problem is to improve the recognition accuracy of the object gripping area by the robotic device by determining the orientation of the gripping device based on the calculation of the rotation matrix for the purpose of gripping the object at the desired gripping point.

Заявленный технический результат достигается за счет способа захвата объекта с помощью роботизированного устройства, содержащего этапы, на которых обучают роботизированное устройство с помощью, по меньшей мере, алгоритма машинного обучения для распознавания и запоминания точек захвата объектов, причем обучение выполняется на данных, характеризующих фотографические изображения объектов и соответствующие 3D модели объектов в различных ракурсах;The claimed technical result is achieved by means of a method for capturing an object using a robotic device, containing the stages at which the robotic device is taught using at least a machine learning algorithm for recognizing and memorizing the capture points of objects, and the training is performed on data characterizing photographic images of objects and corresponding 3D models of objects in various angles;

формируют по меньшей мере одну точку захвата объекта, представленного на графическом изображении объекта;form at least one gripping point of the object represented on the graphical image of the object;

получают фотографическое изображение объекта с помощью камеры роботизированного устройства и определяют ракурс, отображающий сформированную точку захвата объекта;obtaining a photographic image of the object using the camera of the robotic device and determining the angle displaying the formed gripping point of the object;

получают трехмерное облако точек объекта в выявленном ракурсе с помощью сенсора глубины роботизированного устройства;obtaining a three-dimensional cloud of points of the object in the identified perspective using the depth sensor of the robotic device;

определяют по полученному облаку точек месторасположение точки захвата объекта;determine by the obtained cloud of points the location of the point of capture of the object;

определяют ориентацию и положение захватного устройства роботизированного устройства в обнаруженной точке захвата, причем определение осуществляется с помощью вычисления матриц поворотов роботизированного устройства;determining the orientation and position of the gripping device of the robotic device at the detected gripping point, and the determination is carried out by calculating the matrices of rotations of the robotic device;

осуществляют захват объекта с помощью роботизированного устройства на основании вычисления усредненной матрицы поворотов для заданной точки захвата.the object is gripped by a robotic device based on the calculation of the averaged rotation matrix for a given gripping point.

В частном варианте осуществления способа роботизированное устройство представляет собой роботизированный манипулятор или роботизированную руку.In a particular embodiment of the method, the robotic device is a robotic arm or a robotic arm.

В другом частном варианте осуществления способа на этапе обучения дополнительно указываются две и более точек захвата объекта на соответствующей 3D модели объекта.In another particular embodiment of the method, at the stage of training, two or more points of gripping of the object are additionally indicated on the corresponding 3D model of the object.

В другом частном варианте осуществления способа на этапе обучения дополнительно указывают и запоминают расположение роботизированного устройства в точке захвата в зависимости от расположения объекта.In another particular embodiment of the method, at the training stage, the position of the robotic device at the gripping point is additionally indicated and stored depending on the position of the object.

В другом частном варианте осуществления способа определение месторасположения точки захвата на изображении осуществляется с помощью модели машинного обучения.In another particular embodiment of the method, the location of the capture point in the image is determined using a machine learning model.

В другом частном варианте осуществления способа дополнительно определяется принцип захвата объекта в зависимости от типа захватного устройства и/или типа объекта.In another particular embodiment of the method, the principle of gripping the object is additionally determined depending on the type of gripping device and / or the type of object.

- 1 038279- 1 038279

В другом частном варианте осуществления способа для нескольких точек захвата объекта указывается оптимальная точка захвата объекта на основании ее взаимного расположения относительно других точек захвата и относительно захватного устройства в момент осуществления захвата.In another particular embodiment of the method, for several points of object gripping, the optimal gripping point of the object is indicated based on its relative position relative to other gripping points and relative to the gripping device at the moment of gripping.

В другом частном варианте осуществления способа роботизированный манипулятор калибруется по поверхности взаимодействия с объектом.In another particular embodiment of the method, the robotic arm is calibrated against the surface of interaction with the object.

В другом частном варианте осуществления способа калибровка осуществляется с помощью нанесения на поверхность графических кодов дополненной реальности.In another particular embodiment of the method, calibration is performed by applying graphical augmented reality codes to the surface.

Заявленный технический результат достигается также за счет системы захвата объекта с помощью роботизированного устройства, которая содержит роботизированное устройство, содержащее по меньшей мере одну камеру, по меньшей мере один сенсор глубины и захватное устройство;The claimed technical result is also achieved by a system for gripping an object using a robotic device, which contains a robotic device containing at least one camera, at least one depth sensor and a gripping device;

вычислительное устройство, соединенное с роботизированным устройством, причем вычислительное устройство взаимодействует с моделью машинного обучения, которая обучена на фотографических изображениях объектов и соответствующих 3D моделей объектов в различных ракурсах для распознавания и запоминания точек захвата объектов;a computing device connected to the robotic device, the computing device interacting with a machine learning model that has been trained on photographic images of objects and corresponding 3D models of objects in various angles to recognize and store gripping points of objects;

формировать по меньшей мере одну точку захвата объекта, представленного на графическом изображении объекта;to form at least one gripping point of the object represented on the graphical image of the object;

получать фотографическое изображение объекта от камеры роботизированного устройства и определять ракурс, отображающий сформированную точку захвата объекта;receive a photographic image of the object from the camera of the robotic device and determine an angle displaying the generated gripping point of the object;

получать трехмерное облако точек объекта в выявленном ракурсе с помощью сенсора глубины роботизированного устройства;receive a three-dimensional cloud of points of the object in the identified angle using the depth sensor of the robotic device;

определять по полученному облаку точек месторасположение точки захвата объекта;determine the location of the capture point of the object by the received cloud of points;

определять ориентацию и положение захватного устройства в обнаруженной точке захвата, причем определение осуществляется с помощью вычисления матриц поворотов роботизированного устройства;determine the orientation and position of the gripping device at the detected gripping point, and the determination is carried out by calculating the matrices of the rotations of the robotic device;

формировать команду для захвата объекта с помощью роботизированного устройства на основании вычисления усредненной матрицы поворотов для заданной точки захвата.generate a command for gripping an object using a robotic device based on the calculation of the averaged rotation matrix for a given gripping point.

В частном варианте реализации системы роботизированное устройство представляет собой роботизированный манипулятор или роботизированную руку.In a particular implementation of the system, the robotic device is a robotic arm or a robotic arm.

В другом частном варианте реализации системы вычислительное устройство представляет собой компьютер, систему на кристалле (SoC) или сервер.In another particular implementation of the system, the computing device is a computer, system on a chip (SoC), or server.

В другом частном варианте реализации системы вычислительное устройство соединено проводным ил беспроводным способом с роботизированным устройством.In another particular implementation of the system, the computing device is wired or wirelessly connected to a robotic device.

В другом частном варианте реализации системы модель машинного обучения представляет собой по меньшей мере одну нейронную сеть.In another particular implementation of the system, the machine learning model is at least one neural network.

В другом частном варианте реализации системы модель машинного обучения выполнена с возможностью автоматического переобучения на основании новых данных об объектах.In another particular embodiment of the system, the machine learning model is made with the possibility of automatic retraining based on new data about objects.

В другом частном варианте реализации системы новые данные об объектах передаются от облачного сервера.In another particular implementation of the system, new data about objects is transmitted from a cloud server.

В другом частном варианте реализации системы нейронная сеть расположена на облачном сервере.In another particular implementation of the system, the neural network is located on a cloud server.

Краткое описание чертежейBrief Description of Drawings

Признаки и преимущества настоящего технического решения станут очевидными из приводимого ниже подробного описания и прилагаемых чертежей.The features and advantages of the present technical solution will become apparent from the following detailed description and accompanying drawings.

Фиг. 1 иллюстрирует общий вид заявленной системы.FIG. 1 illustrates a general view of the claimed system.

Фиг. 2 иллюстрирует схематичный вид манипулятора роботизированного устройства.FIG. 2 illustrates a schematic view of a manipulator of a robotic device.

Фиг. 3 иллюстрирует блок-схему общей процедуры обучения роботизированного устройства.FIG. 3 illustrates a flow diagram of a general robotic device teaching procedure.

Фиг. 4 иллюстрирует пример обучения роботизированного устройства с помощью изображений и 3D-моделей объекта.FIG. 4 illustrates an example of teaching a robotic device using images and 3D models of an object.

Фиг. 5 иллюстрирует пример обучения роботизированного устройства с помощью нескольких точек захвата на объекте.FIG. 5 illustrates an example of teaching a robotic device using multiple gripping points on site.

Фиг. 6 иллюстрирует блок-схему выполнения заявленного способа.FIG. 6 illustrates a block diagram of the implementation of the claimed method.

Фиг. 7 иллюстрирует пример центрирования захватного устройства для определения требуемой точки захвата на объекте.FIG. 7 illustrates an example of centering the gripper to determine the desired gripping point on an object.

Фиг. 8 иллюстрирует пример системы с получением изображений в реальном времени.FIG. 8 illustrates an example of a real-time imaging system.

Фиг. 9 иллюстрирует пример вычислительной системы.FIG. 9 illustrates an example of a computing system.

Детальное описание осуществления изобретенияDetailed description of the implementation of the invention

Данное техническое решение может быть реализовано на компьютере, в виде автоматизированной информационной системы (АИС) или машиночитаемого носителя, содержащего инструкции для выполнения вышеупомянутого способа. Техническое решение может быть реализовано в виде распределенной компьютерной системы, которая может быть установлена на централизованном сервере (наборе серверов). Доступ пользователей к системе возможен как из сети Интернет, так и из внутренней сети предприятия/организации посредством мобильного устройства связи, на котором установлено программное обеспечение с соответствующим графическим интерфейсом пользователя, или персонального компьютеThis technical solution can be implemented on a computer, in the form of an automated information system (AIS) or a computer-readable medium containing instructions for performing the above method. The technical solution can be implemented as a distributed computer system that can be installed on a centralized server (set of servers). User access to the system is possible both from the Internet and from the internal network of an enterprise / organization through a mobile communication device on which software with a corresponding graphical user interface is installed, or a personal computer

- 2 038279 ра с доступом к веб-версии системы с соответствующим графическим интерфейсом пользователя. Ниже будут описаны термины и понятия, необходимые для реализации настоящего технического решения.- 2 038279 pa with access to the web version of the system with the corresponding graphical user interface. Below will be described the terms and concepts necessary for the implementation of this technical solution.

В данном решении под системой подразумевается компьютерная система, ЭВМ (электронновычислительная машина), ЧПУ (числовое программное управление), ПЛК (программируемый логический контроллер), компьютеризированные системы управления и любые другие устройства, способные выполнять заданную, четко определенную последовательность вычислительных операций (действий, инструкций). Под устройством обработки команд подразумевается электронный блок либо интегральная схема (микропроцессор), исполняющая машинные инструкции (программы). Устройство обработки команд считывает и выполняет машинные инструкции (программы) с одного или более устройства хранения данных. В роли устройства хранения данных могут выступать, но не ограничиваясь, жесткие диски (HDD), флеш-память, ПЗУ (постоянное запоминающее устройство), твердотельные накопители (SSD), оптические приводы.In this solution, the system means a computer system, a computer (electronic computing machine), CNC (numerical control), PLC (programmable logic controller), computerized control systems and any other devices capable of performing a given, well-defined sequence of computational operations (actions, instructions ). A command processing device means an electronic unit or an integrated circuit (microprocessor) that executes machine instructions (programs). A command processor reads and executes machine instructions (programs) from one or more storage devices. The role of data storage devices can be, but are not limited to, hard disks (HDD), flash memory, ROM (read only memory), solid state drives (SSD), optical drives.

Программа - последовательность инструкций, предназначенных для исполнения устройством управления вычислительной машины или устройством обработки команд.A program is a sequence of instructions intended for execution by a computer control device or a command processing device.

Как представлено на фиг. 1, система (100) для реализации заявленного решения содержит вычислительное устройство (110), обеспечивающее необходимые процессы вычислительной обработки алгоритма для функционирования роботизированного устройства (200), канал передачи данных (130), обеспечивающий процессы приема/передачи данных между элементами системы (100), а также удаленное вычислительное устройство (120), например, облачный сервер для специализированного выполнения задач.As shown in FIG. 1, the system (100) for implementing the claimed solution comprises a computing device (110) that provides the necessary computational processing of the algorithm for the operation of the robotic device (200), a data transmission channel (130) that provides processes for receiving / transmitting data between the elements of the system (100) as well as a remote computing device (120), such as a cloud server for specialized tasks.

В качестве вычислительного устройства (110), с которым взаимодействует пользователь (10), может использоваться, например, персональный компьютер, ноутбук, планшет, сервер, смартфон, серверный кластер, мейнфрейм, суперкомпьютер, тонкий клиент, система на кристалле (англ. System on a Chip, сокр. SoC) и т.п. Конкретная аппаратная реализация вычислительного устройства (110) зависит от предъявляемых требований и должна обеспечивать необходимые процессы вычислительной обработки цифровой информации для обеспечения должного функционирования устройства (110) в составе системы (100).As a computing device (110), with which the user (10) interacts, can be used, for example, a personal computer, laptop, tablet, server, smartphone, server cluster, mainframe, supercomputer, thin client, system on a chip (English System on a Chip, abbreviated SoC), etc. The specific hardware implementation of the computing device (110) depends on the requirements and must provide the necessary computational processing of digital information to ensure the proper functioning of the device (110) as part of the system (100).

В качестве канала передачи данных (130) может применяться различный вид связи, например проводного и/или беспроводного типа, в частности LAN, WAN, WLAN, Bluetooth, Wi-Fi, GSM/GPRS/LTE/5G, Ethernet, USB, RS232, RJ45 и т.п. Удаленное вычислительное устройство (120), как правило, представляет собой облачное хранилище данных или сервер для решения специализированных задач или программной логики. Описание работы устройства (120) будет раскрыто далее в настоящем описании.As a data transmission channel (130), a different type of communication can be used, for example, a wired and / or wireless type, in particular LAN, WAN, WLAN, Bluetooth, Wi-Fi, GSM / GPRS / LTE / 5G, Ethernet, USB, RS232, RJ45 etc. The remote computing device (120), as a rule, is a cloud storage or a server for solving specialized tasks or program logic. The description of the operation of the device (120) will be described later in the present description.

В качестве роботизированного устройства (200) может применяться как стационарный роботизированный манипулятор промышленного или коллаборативного назначения, например, Universal Robotics™, KUKA™, Fanuc™, ABB™ и т.п., так и роботизированные манипуляторы, устанавливаемые на подвижные автономные роботизированные устройства, в качестве конечностей, в частности роботизированные руки.As a robotic device (200) can be used as a stationary robotic arm for industrial or collaborative purposes, for example, Universal Robotics ™, KUKA ™, Fanuc ™, ABB ™, etc., and robotic manipulators installed on mobile autonomous robotic devices, as limbs, in particular robotic arms.

На фиг. 2 представлено схематическое изображение примера роботизированного устройства (200) в виде манипулятора, снабженного захватным устройством (210) и модулем, содержащим RGB камеру (211) и камеру с сенсором глубины (212) (также называемую камерой глубины или RGB-D камерой). С помощью RGB камеры (211) выполняется получения двумерных графических изображений объекта (20), с которым предполагается взаимодействие с помощью роботизированного устройства (200). Камера глубины (212) применяется для получения 3D моделей объекта (20), а также центрированию захватного устройства (210) и определения расстояния до объекта (20).FIG. 2 is a schematic diagram of an example of a robotic device (200) in the form of a manipulator equipped with a gripper (210) and a module containing an RGB camera (211) and a camera with a depth sensor (212) (also called a depth camera or RGB-D camera). The RGB camera (211) is used to obtain two-dimensional graphic images of the object (20), with which it is supposed to interact using a robotic device (200). The depth camera (212) is used to obtain 3D models of the object (20), as well as to center the gripper (210) and determine the distance to the object (20).

В качестве захватного устройства (210) могут применяться различные типы схватов, например в виде клешней, антропоморфных кистей рук и т.п. Захватное устройство, как правило, обеспечивает 6 степеней свободы для выполнения процесса манипуляций с объектом.As a gripping device (210), various types of grips can be used, for example, in the form of pincers, anthropomorphic hands, etc. The gripper typically provides 6 degrees of freedom for manipulating an object.

Как показано на фиг. 1, объект (20), с которым осуществляется взаимодействие роботизированного устройства (200), может размещаться на поверхности, на которую наносятся маркеры дополненной реальности (201), применяемые для калибровки роботизированного устройства (200) для целей формирования области его действия (205), в которой объект (20) может быть захвачен. Маркеры дополненной реальности (201) представляют собой графические коды дополненной реальности, также называемые AR-кодами (англ. Augmented reality codes), которые используются для ограничения поверхности взаимодействия с объектом (20). Объект (20) может представлять собой широкий спектр изделий, имеющих мягкую (деформируемую) или твердую форму. На фиг. 3 представлен процесс обучения (300) роботизированного устройства (200). Для обучения применяется подход из указанной выше научной публикации. На первом шаге (301) осуществляется съемка объекта (20) с помощью RGB камеры (211). Съемка осуществляется с различных ракурсов. Далее с помощью камеры глубины (212) на каждом из полученных ракурсов создается соответствующее ракурсу облако точек (302). По сформированным на этапе (302) облаками точек осуществляется построение 3D модели объекта (20) за счет объединения полученных облаков точек для каждого из ракурсов съемки на этапе (301). Полученное множество позиций камеры (211), соотнесенных с изображениями объекта, полученных на соответствующих ракурсах на этапе (301), а также итоговая 3D модель объекта (20), полученная на этапе (303), используются для автоматической подгоAs shown in FIG. 1, the object (20) with which the robotic device (200) interacts can be placed on a surface on which augmented reality markers (201) are applied, which are used to calibrate the robotic device (200) in order to form its area of action (205), in which the object (20) can be captured. Augmented reality markers (201) are graphical augmented reality codes, also called AR-codes, which are used to limit the surface of interaction with an object (20). Object (20) can represent a wide range of products having a soft (deformable) or hard shape. FIG. 3 shows the learning process (300) of a robotic device (200). For training, the approach from the above scientific publication is applied. At the first step (301), the object (20) is captured using an RGB camera (211). Shooting is carried out from various angles. Then, using the depth camera (212), a cloud of points (302) corresponding to the angle is created at each of the obtained angles. Based on the point clouds formed at step (302), a 3D model of the object (20) is built by combining the obtained point clouds for each of the survey angles at step (301). The resulting set of camera positions (211), correlated with the images of the object obtained at the corresponding angles at step (301), as well as the final 3D model of the object (20), obtained at step (303), are used for automatic preparation

- 3 038279 товки обучающей выборки для тренировки модели машинного обучения на этапе (304), в частности сверточной нейронной сети (СНС).- 3,038279 training set for training a machine learning model at stage (304), in particular, a convolutional neural network (CNN).

На фиг. 4 представлен пример получения ракурсов (Р1)-(Р4) изображений объекта (20). По каждому ракурсу (Р1)-(Р4) с помощью камеры глубины (212) строится соответствующая модель (М1)-(М4) на основании получаемого облака точек. Все облака точек (М1)-(М4), слитые в единое целое, образуют 3D модель объекта (21). В итоге по полученным моделям ракурсов (М1)-(М4) создается итоговая 3D модель объекта (21) с помощью слияния всех полученных облаков точек (М1)-(М4) по ракурсам (Р1)-(Р4) изображений объекта. Необходимо отметить, что количество ракурсов изображений объекта (20) может быть любым и не ограничиваться приведенным примером количества ракурсов. Данный подход позволяет по выбранной точке на изображении объекта, например фотографии или трехмерной модели, найти соответствующую ей 3D точку на поверхности модели объекта для его захвата.FIG. 4 shows an example of obtaining views (P1) - (P4) of images of an object (20). For each angle (P1) - (P4), using the depth camera (212), a corresponding model (M1) - (M4) is built based on the resulting point cloud. All point clouds (M1) - (M4), merged into a single whole, form a 3D model of the object (21). As a result, according to the obtained models of the angles (M1) - (M4), the final 3D model of the object (21) is created by merging all the obtained point clouds (M1) - (M4) by the angles (P1) - (P4) of the object images. It should be noted that the number of views of images of the object (20) can be any and not limited to the given example of the number of views. This approach allows for a selected point on the image of an object, for example a photograph or a three-dimensional model, to find the corresponding 3D point on the surface of the object model to capture it.

С помощью модели машинного обучения выполняется отображение из множества изображений заданного размера в множество дескрипторов, подобранных во время обучения таким образом, что для двух точек из двух разных изображений расстояние между соответствующими им дескрипторами будет меньше определенного значения, если эти точки являются проекциями одной и той же точки на поверхности 3D модели объекта (21). Данная операция производится несколько раз для разных положений объекта (20) в области досягаемости (205) роботизированного устройства (200). При построении обучающей выборки для модели машинного обучения для каждой фотографии объекта (20) с соответствующего ракурса выбирается по одной точке, для каждой из которых имеется соответствующая 3D точка с поверхности модели объекта, полученная из облака точек соответствующего ракурса съемки объекта (20). В качестве примера можно рассмотреть пары фотографий одного объекта (20) с соответствующими точками. Если пара 3D точек на соответствующих 2D изображениях принадлежат одному и тому же вокселю/вертексу 3D модели объекта (21), то считается, что эти точки являются соответствующими (совпадающими), иначе несоответствующими (несовпадающими). Таким образом, из всех фотографий можно получить миллионы пар пикселов как совпадающих, так и несовпадающих. На основании данной информации строится обучающая выборка для тренировки нейронной сети.Using a machine learning model, a mapping is performed from a set of images of a given size to a set of descriptors selected during training in such a way that for two points from two different images, the distance between the corresponding descriptors will be less than a certain value if these points are projections of the same points on the surface of the 3D model of the object (21). This operation is performed several times for different positions of the object (20) in the reach (205) of the robotic device (200). When constructing a training sample for a machine learning model, for each photograph of an object (20) from the corresponding angle, one point is selected, for each of which there is a corresponding 3D point from the surface of the object model, obtained from the point cloud of the corresponding angle of shooting of the object (20). As an example, we can consider pairs of photographs of one object (20) with the corresponding points. If a pair of 3D points on the corresponding 2D images belong to the same voxel / vertex of the 3D model of the object (21), then these points are considered to be corresponding (coinciding), otherwise inconsistent (mismatched). Thus, from all photos, you can get millions of pairs of pixels, both coinciding and non-coinciding. Based on this information, a training sample is constructed for training a neural network.

В режиме эксплуатации на вход нейронной сети подаются фотографии объекта (20), состоящие из пикселов, на выходе каждому пикселю соответствует дескриптор (последовательность чисел одинаковой длины n, или точка в n-мерном пространстве, на котором задана метрика - функция вычисления расстояния между точками этого пространства). Нейронную сеть тренируют на множестве пар пикселов из обучающей выборки так, чтобы для совпадающих пикселов расстояние между соответствующими им дескрипторами было меньше определенного порога, а для несовпадающих - больше. На фиг. 5 представлен пример обучения позициям захвата объекта (20) с помощью указания нескольких точек захвата (25, 26) объекта (20) с помощью захватного устройства (210). При обучении роботизированного устройства (200) взаимодействию с объектами (20) по точкам захвата (25, 26) формируется взаимная ориентация отрезков (22) между выделенными точками захвата (25, 26) и ориентацией захватного устройства (210). Далее для пар отрезков (22) между точками захвата на исходных данных и соответствующих найденных точках захвата (25, 26) на объекте (20) при его съемке с помощью RGB камеры (211) и камеры глубины (212) вычисляются матрицы поворотов. По всем парам отрезков (22) вычисляется усредненная матрица поворота, которая применяется к сохраненной конфигурации, соответствующей ранее заданной точке захвата, и захватное устройство (210) приводится в движение для захвата объекта в выбранной точке.In the operating mode, photographs of the object (20), consisting of pixels, are fed to the input of the neural network, at the output to each pixel there corresponds a descriptor (a sequence of numbers of the same length n, or a point in n-dimensional space on which the metric is set - the function of calculating the distance between the points of this space). The neural network is trained on a set of pairs of pixels from the training sample so that for matching pixels, the distance between their corresponding descriptors is less than a certain threshold, and for non-matching ones, more. FIG. 5 shows an example of teaching positions of gripping an object (20) by specifying several gripping points (25, 26) of an object (20) using a gripper (210). When teaching the robotic device (200) to interact with objects (20) by the gripping points (25, 26), a mutual orientation of the segments (22) is formed between the selected gripping points (25, 26) and the orientation of the gripping device (210). Further, for pairs of segments (22) between the capture points on the initial data and the corresponding found capture points (25, 26) on the object (20) when shooting it with the RGB camera (211) and the depth camera (212), the rotation matrices are calculated. For all pairs of segments (22), an averaged rotation matrix is calculated, which is applied to the saved configuration corresponding to the previously specified gripping point, and the gripper (210) is set in motion to grip the object at the selected point.

На этапе обучения (300) роботизированного устройства (200) дополнительно захватное устройство (210) тренируется по взаимодействию с объектами (20) в зависимости от их формы и степени деформации, а также в зависимости от конфигурации захватного устройства (210). Конфигурация и принцип захвата устройства (210) играют важную роль при анализе позиции и возможности взаимодействия с объектом (20) в зависимости от его расположения в области досягаемости (205) устройства (200). В случае работы с мягкими или легкодеформируемыми объектами (20), например плюшевыми игрушками, пакетами, тканевыми упаковками и т.п., принцип захвата может являться допустимым для работы с объектом (20) с помощью его захвата любым возможным способом в указанной точке захвата (25).At the stage of training (300) of the robotic device (200), the gripper (210) is additionally trained to interact with objects (20), depending on their shape and degree of deformation, as well as depending on the configuration of the gripper (210). The configuration and gripping principle of the device (210) play an important role in the analysis of the position and the possibility of interaction with the object (20), depending on its location in the reach (205) of the device (200). In the case of working with soft or easily deformable objects (20), for example, plush toys, bags, tissue packages, etc., the capture principle can be acceptable for working with an object (20) by capturing it in any possible way at the specified capture point ( 25).

Если взаимодействие осуществляется с твердым объектом (20), например книга, телефон, коробка и т.п., то захватное устройство (210) тренируется для взаимодействия с допустимой частью объекта (20) (в целях исключения нарушения его целостности или недопустимой деформации), а также в зависимости от конфигурации установленного захватного устройства (210). Например, антропоморфные захваты (210) в виде роботизированной руки имеют большую степень возможности для взаимодействия с требуемой точкой захвата (25) объекта (20) по сравнению с клешневидным схватом, представленным на фиг. 2.If the interaction is carried out with a solid object (20), for example, a book, telephone, box, etc., then the gripper (210) is trained to interact with an admissible part of the object (20) (in order to avoid violation of its integrity or unacceptable deformation), and also depending on the configuration of the installed gripper (210). For example, the anthropomorphic grips (210) in the form of a robotic arm have a greater degree of ability to interact with the desired grip point (25) of the object (20) compared to the pincer gripper shown in FIG. 2.

Процесс обучения роботизированного устройства (200) может также выполняться с помощью внешнего контроллера, например перчатки-контроллера, позволяющего запоминать принцип взаимодействия с объектом (20) с помощью манипуляции движений оператора (10).The learning process of the robotic device (200) can also be performed using an external controller, for example, a glove controller, which makes it possible to memorize the principle of interaction with an object (20) by manipulating the operator's movements (10).

На фиг. 6 представлен процесс выполнения способа (400) осуществления захвата объекта (20) роботизированным устройством (200). Описание процесса работы (400) роботизированного устройства (200) также будет описано с отсылкой к фиг. 7. После прохождения вышеописанного процесса обучения (300) роботизированного устройства (200) его функционирование может осуществляться в автоматизированFIG. 6 shows a process of performing a method (400) for gripping an object (20) by a robotic device (200). A description of the operating process (400) of the robotic device (200) will also be described with reference to FIG. 7. After passing the above-described learning process (300) of the robotic device (200), its operation can be carried out in an automated

- 4 038279 ном режиме. Оператор (10) с помощью вычислительного устройства (110) осуществляет формирование (этап 401) точки захвата (25), например, с помощью указания на изображении объекта (23) с помощью графического интерфейса пользователя (GUI). Таким образом, на изображении (23) отмечается пиксель или группа пикселей, соответствующий точке захвата (25) или окрестности нахождения данной точки (25) на объекте (20).- 4 038279 nominal mode. The operator (10), using the computing device (110), generates (step 401) the capture point (25), for example, by pointing to the object (23) on the image using a graphical user interface (GUI). Thus, a pixel or a group of pixels corresponding to the capture point (25) or the vicinity of the location of this point (25) on the object (20) is marked on the image (23).

По полученной информации о точке захвата (25) роботизированное устройство (200) осуществляет обзор и фотографирование (этап 402) с помощью RGB камеры (211) объекта захвата (20). Роботизированное устройство (200) делает серию фотографий (не более 10 снимков) с нескольких различных позиций (ракурсов) камеры (211). С помощью модели машинного обучения, в частности обученной СНС, из серии полученных с камеры (211) фотографий выбирается одна с наиболее вероятными координатами расположения заданной точки интереса (25), после чего роботизированное устройство (200) размещает камеру (211) в позицию, с которой была сделана данная фотография (этап 403). На основании обработки получаемых фотографий объекта (20) для каждой фотографии из этой серии СНС вычисляет дескрипторы, среди которых осуществляется поиск и нахождение дескриптора, расстояние от которого до координаты пикселя точки захвата (25) является минимальным.Based on the received information about the capture point (25), the robotic device (200) reviews and photographs (step 402) using the RGB camera (211) of the capture object (20). The robotic device (200) takes a series of photographs (no more than 10 shots) from several different positions (angles) of the camera (211). Using a machine learning model, in particular a trained SNS, from a series of photos received from the camera (211), one with the most probable coordinates of the location of a given point of interest (25) is selected, after which the robotic device (200) places the camera (211) in the position with which photo was taken (step 403). Based on the processing of the obtained photographs of the object (20) for each photograph from this series, the SNS calculates descriptors, among which the search and finding of the descriptor is carried out, the distance from which to the pixel coordinate of the capture point (25) is minimal.

Далее осуществляется локализация точки интереса (этап 404) на поверхности 3D модели объекта (21) с помощью облака точек, получаемого с камеры глубины (212), как представлено на фиг. 7. Для точной локализации точки захвата (25) (этап 405) интегрируются несколько облаков точек за определенный промежуток времени, и находится точка в полученном облаке точек, ближайшая к прямой, проведенной из оптического центра (27) камеры (212) и проходящей через плоскость изображения камеры в проекции точки интереса (25).Next, the point of interest is localized (step 404) on the surface of the 3D model of the object (21) using the point cloud obtained from the depth camera (212), as shown in FIG. 7. To accurately localize the capture point (25) (step 405), several point clouds are integrated over a certain period of time, and the point in the obtained point cloud is located closest to the straight line drawn from the optical center (27) of the camera (212) and passing through the plane camera images in the projection of the point of interest (25).

Далее на этапе (406) выполняется расчет матриц поворотов для осуществления захвата объекта (20) и по вычислению усредненной матрицы поворотов захватного устройства (210) анализируется позиция и траектория движения устройства (210) для осуществления захвата (этап 407) объекта (20).Next, at step (406), the calculation of the rotation matrices for the capture of the object (20) is performed, and the position and trajectory of the movement of the device (210) for the capture (step 407) of the object (20) are analyzed by calculating the averaged rotation matrix of the gripper (210).

Как было показано на фиг. 5, в процессе обучения вариантов захвата устройством (210) на объекте (20) задают несколько точек (25, 26), относительно которых запоминают положение и ориентацию захватного устройства (210). В случае захвата объекта (20), находящегося в произвольном положении, на объекте (20) распознаются заданные точки (25, 26), и матрицы поворотов захватного устройства (210) вычисляются таким образом, чтобы взаимная ориентация точек захвата (25, 26) и захватного устройства (210) соответствовали сохраненным значениям во время обучения.As shown in FIG. 5, in the process of learning the options for gripping by the device (210), several points (25, 26) are set on the object (20), relative to which the position and orientation of the gripping device (210) are memorized. In the case of gripping an object (20) located in an arbitrary position, the specified points (25, 26) are recognized on the object (20), and the rotation matrices of the gripper (210) are calculated so that the relative orientation of the gripping points (25, 26) and gripper (210) matched the stored values during training.

В случае когда используются захватные устройства (210) с большим количеством степеней свободы, при формировании захвата в заданной точке на этапе обучения запоминается серия позиций, ориентации и конфигураций захватного устройства (210). После вычисления матриц поворотов захватного устройства (210) для каждого элемента этой серии с помощью алгоритмов инверсной кинематики роботизированное устройство (200) приводится в соответствующее положение. При планировании траектории движения роботизированного устройства (200) соответствующим алгоритмом учитываются возможные коллизии как между узлами самого устройства (200), так и с окружающими предметами.In the case when grippers (210) with a large number of degrees of freedom are used, a series of positions, orientations and configurations of the gripper (210) are stored during the formation of the gripper at a given point at the training stage. After calculating the rotational matrices of the gripper (210) for each element of this series using inverse kinematics algorithms, the robotic device (200) is brought to the appropriate position. When planning the trajectory of movement of the robotic device (200), the appropriate algorithm takes into account possible collisions both between the nodes of the device itself (200) and with surrounding objects.

Указанная модель машинного обучения может содержаться как непосредственно на устройстве (110), связанным с роботизированным устройством (200), так и находится на удаленном устройстве (120), например сервере, и осуществлять обработку данных по запросу, формируемому от вычислительного устройства (110) на основании данных, получаемых от роботизированного устройства (200). Как представлено на фиг. 8, в частном примере реализации системы (100) может использоваться несколько вычислительных устройств, каждое из которых предназначено для реализации требуемого функционала. Например, в качестве исходного изображения (23) для формирования точки захвата (25), может применяться изображение, полученное с камеры смартфона (110). После получения изображения (23) пользователь (10) указывается требуемую точку захвата (25) для соответствующего объекта (20), с которым взаимодействует роботизированное устройство (200). Устройство (200) может осуществлять обращение к серверу (120) для инициирования процесса обработки данных о схожем типе объекта (20), который находится в области работы роботизированного устройства (205).The specified machine learning model can be contained both directly on the device (110) associated with the robotic device (200) and located on a remote device (120), for example, a server, and perform data processing on request generated from the computing device (110) on based on data received from the robotic device (200). As shown in FIG. 8, in a particular example of the implementation of the system (100), several computing devices can be used, each of which is designed to implement the required functionality. For example, an image obtained from a smartphone camera (110) can be used as an initial image (23) for forming a gripping point (25). After acquiring the image (23), the user (10) specifies the desired gripping point (25) for the corresponding object (20) with which the robotic device (200) interacts. The device (200) can contact the server (120) to initiate the processing of data about a similar type of object (20), which is in the area of operation of the robotic device (205).

Обработка данных для распознавания объекта (20) и выявления требуемой точки захвата (25) выполняется с помощью обученной нейронной сети, которая может располагаться на сервере (120) и обновляться по мере поступления новых данных о соответствующих типах объектов (121) и вариантов взаимодействия с ними. Библиотека моделей (121) может храниться непосредственно на сервере (120) или стороннем ресурсе. При этом при поступлении новой информации о моделях объектов СНС может автоматически переобучаться для расширения вариантов классификации и повышения процесса автономной работы роботизированного устройства (200). Данный процесс может дополнительно улучшить взаимодействия роботизированного устройства (200) с заданными типами объектов, что позволит расширить его применение в заданных отраслях промышленности.Data processing for object recognition (20) and identification of the required capture point (25) is performed using a trained neural network, which can be located on the server (120) and updated as new data on the corresponding types of objects (121) and options for interaction with them arrive. ... The model library (121) can be stored directly on the server (120) or a third-party resource. At the same time, upon receipt of new information about the models of objects, the SNS can be automatically retrained to expand the classification options and increase the process of autonomous operation of a robotic device (200). This process can further improve the interactions of a robotic device (200) with specified types of objects, which will expand its application in specified industries.

Роботизированное устройство (200) может быть также сконфигурировано для автоматической работы с заданным типом объектов с помощью тренировки захвата за определенную часть объекта (20). Как показано на фиг. 8, в качестве объекта взаимодействия (20) может применяться кружка. В зависимости от типа установленного захватного устройства (210) роботизированное устройство (200) может при поThe robotic device (200) can also be configured to automatically work with a given type of object by training gripping for a certain part of the object (20). As shown in FIG. 8, a circle can be used as an object of interaction (20). Depending on the type of gripper (210) installed, the robotic device (200) can be

- 5 038279 явлении объекта (20) в области его работы (205) автоматически распознавать местоположение зоны захвата объекта (20), например ручки или корпуса. В зависимости от типа установленного захватного устройства (210) определяются возможные модели и позиции для осуществления процедуры захвата объекта (20).- 5 038279 the phenomenon of the object (20) in the area of its operation (205) to automatically recognize the location of the gripping area of the object (20), for example a handle or a body. Depending on the type of installed gripper (210), possible models and positions are determined for carrying out the procedure for gripping an object (20).

На фиг. 9 представлена общий вид вычислительной системы (500). На базе представленной системы (500) может реализовываться различный спектр вычислительных устройств, например устройство (110), сервер (120) и т.п. В общем виде система (500) содержит объединенные общей шиной информационного обмена один или несколько процессоров (501), средства памяти, такие как ОЗУ (502) и ПЗУ (503), интерфейсы ввода/вывода (504), устройства ввода/вывода (505) и устройство для сетевого взаимодействия (506).FIG. 9 shows a general view of the computing system (500). On the basis of the presented system (500), a different range of computing devices can be implemented, for example, a device (110), a server (120), etc. In general, the system (500) contains one or more processors (501) united by a common bus of information exchange, memory means such as RAM (502) and ROM (503), input / output interfaces (504), input / output devices (505 ) and a device for networking (506).

Процессор (501) (или несколько процессоров, многоядерный процессор и т.п.) может выбираться из ассортимента устройств, широко применяемых в настоящее время, например, таких производителей как Intel™, AMD™, Apple™, Samsung Exynos™, MediaTEK™, Qualcomm Snapdragon™ и т.п. Под процессором или одним из используемых процессоров в системе (500) также необходимо учитывать графический процессор, например GPU NVIDIA или Graphcore, тип которых также является пригодным для полного или частичного выполнения способов (300, 400), а также может применяться для обучения и применения моделей машинного обучения в различных информационных системах.The processor (501) (or multiple processors, multi-core processor, etc.) can be selected from a range of devices currently widely used, for example, such manufacturers as Intel ™, AMD ™, Apple ™, Samsung Exynos ™, MediaTEK ™, Qualcomm Snapdragon ™ etc. Under the processor or one of the processors used in the system (500), it is also necessary to take into account a graphics processor, for example, NVIDIA GPU or Graphcore, the type of which is also suitable for full or partial execution of methods (300, 400), and can also be used for training and applying models machine learning in various information systems.

ОЗУ (502) представляет собой оперативную память и предназначено для хранения исполняемых процессором (501) машиночитаемых инструкций для выполнение необходимых операций по логической обработке данных. ОЗУ (502), как правило, содержит исполняемые инструкции операционной системы и соответствующих программных компонент (приложения, программные модули и т.п.). При этом в качестве ОЗУ (502) может выступать доступный объем памяти графической карты или графического процессора.RAM (502) is a random access memory and is intended for storing machine-readable instructions executed by the processor (501) for performing the necessary operations for logical data processing. RAM (502), as a rule, contains executable instructions of the operating system and corresponding software components (applications, software modules, etc.). In this case, the available memory of the graphics card or graphics processor can act as RAM (502).

ПЗУ (503) представляет собой одно или более средств для постоянного хранения данных, например жесткий диск (HDD), твердотельный накопитель данных (SSD), флэш-память (EEPROM, NAND и т.п.), оптические носители информации (CD-R/RW, DVD-R/RW, BlueRay Disc, MD) и др.ROM (503) is one or more means for permanent storage of data, such as a hard disk drive (HDD), solid state data storage device (SSD), flash memory (EEPROM, NAND, etc.), optical storage media (CD-R / RW, DVD-R / RW, BlueRay Disc, MD), etc.

Для организации работы компонентов системы (500) и организации работы внешних подключаемых устройств применяются различные виды интерфейсов В/В (504). Выбор соответствующих интерфейсов зависит от конкретного исполнения вычислительного устройства, которые могут представлять собой, не ограничиваясь, PCI, AGP, PS/2, IrDa, FireWire, LPT, COM, SATA, IDE, Lightning, USB (2.0, 3.0, 3.1, micro, mini, type C), TRS/Audio jack (2.5, 3.5, 6.35), HDMI, DVI, VGA, Display Port, RJ45, RS232 и т.п. Для обеспечения взаимодействия пользователя с вычислительной системой (500) применяются различные средства (505) В/В информации, например клавиатура, дисплей (монитор), сенсорный дисплей, тачпад, джойстик, манипулятор мышь, световое перо, стилус, сенсорная панель, трекбол, динамики, микрофон, средства дополненной реальности, оптические сенсоры, планшет, световые индикаторы, проектор, камера, средства биометрической идентификации (сканер сетчатки глаза, сканер отпечатков пальцев, модуль распознавания голоса) и т.п.To organize the operation of system components (500) and to organize the operation of external connected devices, various types of I / O interfaces (504) are used. The choice of appropriate interfaces depends on the specific design of the computing device, which can be, but are not limited to, PCI, AGP, PS / 2, IrDa, FireWire, LPT, COM, SATA, IDE, Lightning, USB (2.0, 3.0, 3.1, micro, mini, type C), TRS / Audio jack (2.5, 3.5, 6.35), HDMI, DVI, VGA, Display Port, RJ45, RS232, etc. To ensure user interaction with the computing system (500), various means (505) of I / O information are used, for example, a keyboard, display (monitor), touch display, touchpad, joystick, mouse manipulator, light pen, stylus, touch panel, trackball, speakers , microphone, augmented reality, optical sensors, tablet, light indicators, projector, camera, biometric identification (retina scanner, fingerprint scanner, voice recognition module), etc.

Средство сетевого взаимодействия (506) обеспечивает передачу данных посредством внутренней или внешней вычислительной сети, например Интранет, Интернет, ЛВС и т.п. В качестве одного или более средств (506) может использоваться, но не ограничиваться, Ethernet карта, GSM модем, GPRS модем, LTE модем, 5G модем, модуль спутниковой связи, NFC модуль, Bluetooth и/или BLE модуль, Wi-Fi модуль и др. Дополнительно могут применяться также средства спутниковой навигации в составе системы (500), например GPS, ГЛОНАСС, BeiDou, Galileo.The networking tool (506) provides data transmission via an internal or external computer network, such as an Intranet, Internet, LAN, and the like. As one or more means (506), an Ethernet card, a GSM modem, a GPRS modem, an LTE modem, a 5G modem, a satellite communication module, an NFC module, a Bluetooth and / or BLE module, a Wi-Fi module, and etc. Additionally, satellite navigation means can also be used as part of the system (500), for example, GPS, GLONASS, BeiDou, Galileo.

Конкретный выбор элементов системы (500) для реализации различных программно-аппаратных архитектурных решений может варьироваться с сохранением обеспечиваемого требуемого функционала от того или иного типа устройства. Представленные материалы изобретения раскрывают предпочтительные примеры реализации технического решения и не должны трактоваться как ограничивающие иные, частные примеры его воплощения, не выходящие за пределы испрашиваемой правовой охраны, которые являются очевидными для специалистов соответствующей области техники.The specific choice of system elements (500) for the implementation of various software and hardware architectural solutions can vary while maintaining the required functionality provided from a particular type of device. The presented materials of the invention disclose preferred examples of the implementation of the technical solution and should not be construed as limiting other, particular examples of its implementation, which do not go beyond the scope of the claimed legal protection, which are obvious to specialists in the relevant field of technology.

Claims

CLAIM

1. A method of capturing an object using a robotic device, comprising the steps of teaching a robotic device using at least one machine learning algorithm to recognize and memorize capture points of objects, and training is performed on data characterizing photographic images of objects and corresponding 3D models objects in different angles;

at least one gripping point of the object represented on the graphical image of the object is formed;

taking a photographic image of the object using the camera of the robotic device and determining the angle displaying the formed gripping point of the object;

a three-dimensional cloud of points of an object is obtained in a detected perspective using a depth sensor of a robotic device;

- 6 038279 determine the location of the capture point of the object by the obtained point cloud;

determining the orientation and position of the gripping device of the robotic device at the detected gripping point, and the determination is carried out by calculating the matrices of the rotations of the robotic device;

the object is gripped by a robotic device based on the calculation of the averaged rotation matrix for a given gripping point.

2. The method according to claim 1, characterized in that the robotic device is a robotic arm or a robotic arm.

3. The method according to claim 1, characterized in that at the training stage, two or more points of gripping of the object are additionally indicated on the corresponding 3D model of the object.

4. The method according to claim 1, characterized in that at the stage of training, the location of the robotic device at the gripping point is additionally indicated and stored depending on the location of the object.

5. The method according to claim 1, characterized in that the location of the capture point in the image is determined using a machine learning model.

6. The method according to claim 5, characterized in that the principle of gripping the object is additionally determined depending on the type of gripping device and / or the type of object.

7. The method according to claim 3, characterized in that for several points of object gripping, the optimal gripping point of the object is indicated based on its relative position relative to other gripping points and relative to the gripping device at the moment of gripping.

8. The method according to claim 2, characterized in that the robotic arm is calibrated against the surface of interaction with the object.

9. The method according to claim 8, characterized in that the calibration is performed by applying graphical augmented reality codes to the surface.

10. A system for gripping an object using a robotic device, comprising a robotic device containing at least one camera, at least one depth sensor and a gripping device;

a computing device connected to a robotic device, and the computing device interacts with a machine learning model that is trained on photographic images of objects and corresponding 3D models of objects in various angles to recognize and memorize gripping points of objects, configured to form at least one gripping point the object represented on the graphical image of the object;

to take a photographic image of the object from the camera of the robotic device and determine the angle that displays the formed gripping point of the object;

to acquire a three-dimensional cloud of points of the object in the identified angle using the depth sensor of the robotic device;

determine the location of the capture point of the object by the obtained cloud of points;

determine the orientation and position of the gripper at the detected gripping point, the determination being made by calculating the rotation matrices of the robotic device;

generate a command for gripping an object using a robotic device based on the calculation of the averaged rotation matrix for a given gripping point.

11. The system of claim 10, wherein the robotic device is a robotic arm or a robotic arm.

12. The system of claim 10, wherein the computing device is a computer, a system on a chip (SoC), or a server.

13. The system of claim 12, wherein the computing device is wired or wirelessly connected to the robotic device.

14. The system of claim 10, wherein the machine learning model is at least one neural network.

15. The system according to claim 14, characterized in that the machine learning model is made with the possibility of automatic retraining based on new data about objects.

16. The system according to claim 15, characterized in that new data about objects is transmitted from a cloud server.

17. The system according to claim 14, characterized in that the neural network is located on a cloud server.