WO2023062558A1

WO2023062558A1 - Intelligent self-checkout terminal

Info

Publication number: WO2023062558A1
Application number: PCT/IB2022/059776
Authority: WO
Inventors: Mehdi AFRAITE-SEUGNET
Original assignee: Mo-Ka
Priority date: 2021-10-13
Filing date: 2022-10-12
Publication date: 2023-04-20
Also published as: FR3128048A1

Abstract

The invention relates to a self-checkout terminal (10) comprising: - an image sensor (13) having a field of view at least partially covering a presentation space intended to receive an item to be registered; and - a processing unit capable of: - detecting, in a plurality of successive images, a moving object; - identifying, in said object, a hand (30) and an item (31); - recognising, with a confidence index, said item (31); - classifying the posture for gripping the item (31) into a first plurality of classes and/or classifying the movement of the item (31) into a second plurality of predefined classes; and - generating, when the item is recognised with a confidence index lower than a threshold value, a message associated with the class of the gripping posture and/or with the class of the movement of the item (31).

Description

Intelligent automatic payment terminal

The present invention relates to cashing systems, and more particularly to automatic cashing terminals.

The term “automatic payment terminals” means the devices or terminals, commonly called self-service checkouts, automatic checkouts or even, in English, “self-service checkouts”, generally placed at an exit from a point of sale allowing the customer to scan their own purchases.

To identify the articles which are successively presented to him by the customer, these automatic payment terminals comprise optical readers, commonly called “scanners”, able to read a barcode associated with each article. These scanners can be manipulated by hand by the customer (barcode scanner) or incorporated into the cash counter. The automated reading of barcodes is at the heart of the collection procedure, which is entirely based on the identifiers collected by the scanners.

However, a major drawback of scanner-based self-checkout kiosks is that items must be handled with precise gestures to properly present the barcode to the scanner. The customer must therefore observe the item to look for the location of the barcode, which changes from one item to another, then turn their gaze towards the scanner to show them the location of the barcode. As a result, the customer is often required to turn, turn over, or even pass the same article several times in front of the scanner before it is actually registered. In addition to being time-consuming, these manipulations can be painful and cause nervous tension in the client.

If the customer repeatedly fails to register an item, a cashier must intervene to manually enter the barcode digits. This further increases the customer's checkout time and, consequently, also the queue, as well as requiring the availability of a checkout staff.

In addition, the presence of more than one barcode on the item (such as a promotional barcode and an initial barcode or the barcode of an element of a pack and the barcode of the pack), or a total or partial absence of a barcode on the item (due, for example, to a stain, a scratch, a deformation, or a crumpling of the packaging) disrupt the procedure of collection with all the resulting inconveniences.

Another disadvantage of the existing automatic payment terminals to be scanned is that they increase the risk of unknown shrinkage (theft), in particular via fraudulent acts consisting in modifying the barcodes of articles or, more generally, via a change voluntary barcodes or labels. Verification by the automatic payment terminal of a correspondence between an expected weight of the scanned article and a measured weight can be foiled by reducing the weight of the article deposited on the scale to the weight associated with the scanned barcode.

Relying on staff or a cashier to ensure that customers actually present the items to be purchased and do not make intentional or unintentional errors is not effective enough. Given the number of customers often at different stages of the cashing procedure to be supervised simultaneously, the cashier staff is limited in practice to monitoring the sound emitted by the automatic cashing terminal indicating the recording of an item , without paying sufficient attention to the designation or price of the registered item. Thus, changing the barcode of a first item to that of a second item having substantially the same weight often goes unnoticed by the checkout staff. Shrinkage is further facilitated when the customer is in collusion with the person supervising the automatic payment terminals.

An object of the present invention is to remedy the aforementioned drawbacks.

Another object of the present invention is to accompany and assist the customer when going through the automatic checkout.

Another object of the present invention is to avoid shrinkage and unintentional errors during a collection procedure.

Another object of the present invention is to reduce the customer's automatic checkout time.

To this end, it is proposed, first, an automatic payment terminal comprising
- a first image sensor having a first field of vision covering at least partially a presentation space intended to receive an item to be recorded by the automatic payment terminal;
- a processing unit capable of
- detecting in a plurality of successive images at least partially acquired by the first image sensor a moving object in the presentation space;
- identify, in said object, a hand and an article, the article being gripped by the hand during at least part of the movement of the object;
- determining a first attribute of the identified article;
- recognizing with a first confidence index said article on the basis of at least said first attribute;
- determining a second attribute of the gripping posture of the article and classifying this gripping posture in a first plurality of predefined classes on the basis of at least said second attribute and/or determining a third attribute of the movement of the article when this article is grasped by the hand and classifying this movement in a second plurality of predefined classes on the basis of at least said third attribute;
- generating, when the article is recognized with a confidence index lower than a predefined threshold value, a message associated with the class of the gripping posture and/or with the class of the movement of the article.

Various additional features may be provided, alone or in combination:
- the second attribute is chosen from a list comprising the number of fingers of the hand which take part in gripping the article, the surfaces of the hand which take part in gripping the article;
- the first plurality of classes comprises a first class in which the palm of the hand participates in gripping the article;
- the third attribute is chosen from a list comprising the speed of the movement of the article, the trajectory of the movement of the article, a rotational movement of the article;
- the second plurality of classes comprises a second class in which the speed of the movement is greater than a predefined threshold speed, and/or a third class in which the trajectory of the movement is substantially at the limit of the presentation space, and/ or a fourth class in which the motion includes rotational motion;
- the generated message includes, when the class of the gripping posture is the first class, an alert intended for a checkout staff;
- the message generated includes, when the class of movement of the article is the fourth class, an indication relating to a predefined presentation of the article, this presentation allowing recognition of the article with a second confidence index greater than the first confidence index ;
- the first attribute is chosen from a list comprising a barcode, a color, a text, a design, a symbol, a logo, a dimension, a shape, a volume;
- the processing unit is, moreover, capable of detecting an outline on the article around the barcode;
- the automatic payment terminal further comprises a second image sensor having a second field of vision covering at least partially the presentation space, a third image sensor having a third field of vision covering at least partially the presentation space, the first image sensor being placed above the presentation space, the second and the third sensor being placed on either side of the presentation space.

Other characteristics and advantages of the invention will appear more clearly and concretely on reading the following description of embodiments, which is made with reference to the appended drawings in which:

the figure schematically illustrates a first perspective view of an automatic payment terminal according to various embodiments;

the figure schematically illustrates a second perspective view of the automatic payment terminal according to various embodiments;

the figure schematically illustrates a third perspective view of the payment terminal according to various embodiments;

the figure schematically illustrates the operation of the automatic payment terminal according to various embodiments.

Referring to the appended figures, there is displayed an automatic payment terminal 10 able to manage collection operations in a point of sale. A point of sale means, here, any place of physical self-service retail sale, regardless of its size or its field of activity, where a customer can freely compose his shopping basket, before going to the automatic payment terminal 10 to pay for purchases. "Collection operations" means any activity relating to a collection procedure such as the identification and registration of items, informing the customer of the registered items, assisting the customer in the procedure of collection, management of one or more payment methods, invoicing, printing or communication of a receipt, management of customer loyalty, and/or management of commercial advantages.

The automatic collection terminal 10 comprises a processing unit 1 connected to a plurality of cash register peripherals or, more generally, to information input and/or output devices used during a collection procedure. The processing unit 1 is, in one embodiment, a microcomputer, a server or, in general, calculation means arranged inside a piece of furniture 2 of the terminal 10 for automatic payment.

The input and/or output devices fitted to the automatic payment terminal 10 include, for example, a possibly touch-sensitive screen 3 , a keyboard, a payment device such as an electronic payment terminal 4 and/or an automatic coin mechanism , a receipt printer, a loudspeaker, semaphores, one or more scales 5 - 7 . In one embodiment, the automatic payment terminal 10 comprises a side scale 6 integrated into a side basket support 8 and/or a central scale 5 integrated into the upper face of the cabinet 2 . In another embodiment, the automatic payment terminal 10 also comprises an additional scale 7 making it possible to obtain a price for one or more categories of articles such as fruits and vegetables.

The automatic payment terminal 10 also comprises one or more image sensors 11 - 14 . These image sensors 11 - 14 are, for example, 2D cameras or 3D cameras such as time-of-flight cameras, stereoscopic cameras, or structured light cameras. Advantageously, 3D cameras provide an estimation of the shape and/or the volume of the captured object, without resorting to multi-view processing.

By referring to the , each of the image sensors 11 - 14 is arranged so that its field of vision 21 - 24 at least partially covers a space 9 (or a volume) of presentation intended to receive at least one article to be recorded by the terminal 10 d automatic collection. The upper face of the cabinet 2 is, in one embodiment, included in the base of the space 9 of presentation. When an object is introduced into the presentation space 9 , this object is at least partially in the field of vision 21 - 24 of at least one of the image sensors 11 - 14 . In one embodiment, the presentation space 9 is included in intersections of the fields of vision 21 - 24 of the image sensors 11 - 14 so that an object present therein is included in the fields of vision 21 - 24 of at least two sensors 11 - 14 image.

In one embodiment, at least one of the image sensors 11 - 14 has a substantially vertical shooting axis. In other words, at least one of the image sensors 11 - 14 is arranged above the presentation space 9 . This image sensor 11 - 12 is, for example, fixed to the lower edge of the screen 3 overhanging the presentation space 9 so that it is sufficiently discreet and does not pick up an image of the customer's face.

In another embodiment, the automatic payment terminal 10 comprises image sensors 13 - 14 on either side of the presentation space 9 . These image sensors 13 - 14 are, for example, fixed to edges of the upper face of the cabinet 2 . The image sensors 13 - 14 are advantageously arranged in the side edges of the upper face of the cabinet 2 so as not to cover the space facing the automatic payment terminal 10 where the customer is expected. The image sensors 11 - 14 are, in one embodiment, arranged so that their optical axes (or shooting axes) are substantially concurrent at the center of the presentation space 9 .

For example, the image sensors 11 - 14 are arranged so that the intersection of their fields of vision 21 - 24 covers a space 9 of presentation or a visual envelope of 40, 50 or 60 cm3 of visibility in 3D above. top of the piece of furniture 2 and a surface of 80, 90 or 100 cm2 of 2D visibility on the upper face of the piece of furniture 2 .

Each of the image sensors 11 - 14 is capable of acquiring a plurality of successive images, in particular a sequence of video images or a video stream, of at least part of the presentation space 9 . The images acquired are, in one embodiment, 3D images provided by 3D image sensors 11 - 14 or obtained from multi-view 2D images acquired simultaneously by a combination of image sensors 11 - 14 . In order to control the brightness in the presentation space 9 and avoid shadows, one or more lighting devices 15 - 18 fixed to the edges of the upper face of the cabinet 2 and/or to the lower edge of the screen 3 can be considered. A substantially uniform lighting facilitates the segmentation of the acquired images and, consequently, the identification of the objects present in the space 9 of presentation.

The processing unit 1 is capable of detecting (isolating or extracting) from a plurality of successive images at least partially acquired by an image sensor 11 - 14 an object 30 - 31 moving in the presentation space 9 . The segmentation of the content of the images acquired by the static image sensors 11 - 14 into a background and a foreground comprising a moving object 30 - 31 can be obtained by any known method of the state of the art allowing detection of moving objects. These methods include, for example, background subtraction methods, moving edge search methods, or deep machine learning-based methods such as neural networks convolutional (better known by the acronym CNN for “Convolutional Neural Networks”), convolutional neural networks based on regions (known as, according to English terminology, R-CNN for “Region-based Convolutional Neural Networks “), convolutional neural networks based on fast regions (known as, according to Anglo-Saxon terminology, Fast R-CNN for “Fast Region-based Convolutional Neural Networks”), convolutional neural networks based on faster regions (Faster RCNN), or any other equivalent model. These successive images can be 2D images acquired by any one of the image sensors 11 - 14 or 3D images provided by a 3D image sensor 11 - 14 or obtained by combining multi-view 2D images acquired simultaneously . In order to facilitate the detection of a moving object 30 - 31 in the presentation space 9 , the image sensors 11 - 14 are, in one embodiment, oriented so as to have a substantially static background. The upper face of the cabinet 2 of the automatic payment terminal 10 is preferably of uniform color. An article 32-33 already present in the presentation space 9 is integrated into the background of the object 30-31 detected in motion.

The processing unit 1 identifies, in the object 30 - 31 in motion detected, a hand 30 (or, more generally, a gripping means) and an article 31 offered for sale by the point of sale. The detected hand 30 can be a right hand, a left hand, or both hands. The article 31 identified in the detected moving object 30 - 31 is gripped by the hand 30 during at least part of the movement of this object 30 - 31 in the presentation space 9 . The gripping of the article 30 designates, here, the grip of the latter, to present it to the terminal 10 of automatic collection. A sliding observation window of a predefined duration or comprising a predefined number of successive images acquired by at least one of the image sensors 11 - 14 makes it possible to follow the object 30 - 31 in motion detected. When only the hand 30 or only an article 31 is detected moving in this observation window, this moving object 30 - 31 is not taken into consideration. The condition according to which the article 31 is in prehension by the hand 30 during at least part of the movement of the object 30 - 31 detected in a plurality of successive images makes it possible to filter (i.e. not to take considered) image sequences where only the hand 30 or only an article 31 is detected in motion (for example, the movement of the hand 30 from the presentation space to the basket 36 to retrieve another article 34 - 35 to be recorded or the movement of an article 32 - 33 already recorded).

At least one attribute of the identified article 31 is determined by the processing unit 1 . This attribute is any parameter, descriptor, or characteristic of the article 31 identified. This attribute can, in fact, be a text (in particular, distinctive key words of Article 31 ), a drawing, a symbol, a logo, a barcode (one-dimensional or two-dimensional such as a code matrix or a quick response code better known as the English name QR code), a color and/or a plurality of colors, a dimension, a shape, and/or a volume. Attributes are, in one embodiment, determined by means of optical character recognition visible on the article 31 (better known by the Anglo-Saxon terminologies “Optical Character Recognition” or OCR). A semantic analysis advantageously makes it possible to correct or complete missing data in a textual attribute (for example, to correct "fromge" to "fromage"), by using for example the N-gram model. The use of a plurality of attributes also makes it possible to exploit incomplete data, such as a drawing, a text, or a barcode determined only partially from the acquired images.

In one embodiment, a plurality of attributes including at least one barcode are associated with the most coveted and/or most expensive items at the point of sale. These attributes can be determined from any one of the images of the plurality of successive images having allowed the detection of the moving object 30 - 31 , when the article 31 is gripped by the hand 30 or when it is placed in the space 9 of presentation.

Preferably, a plurality of attributes of the article 31 are determined so as to form an attribute vector. The determined attribute or attributes is/are supplied as input to a recognition module of the processing unit 1 capable of estimating, with a coefficient, a score or a confidence index, the article 31 on the basis of at least this or these attributes. The recognition module (CNN, R-CNN, Fast R-CNN, Faster R-CNN, or equivalent models) is, in one embodiment, based on machine learning trained with previously captured images of the section 31 . The higher the number of attributes, the higher the confidence index. In one embodiment, the weight of the article 31 determined by a variation of the weight measured by the side scale 6 and/or by the central scale 5 can be used as an additional verification attribute in addition to the visual attributes determined from acquired images. Recognizing Section 31 based on more than one attribute (eg, barcode and text or design) advantageously helps to combat shrinkage and unintentional errors.

When an attribute is sufficiently discriminating (such as the barcode), a single attribute can be used by the processing unit 1 to recognize the article 31 . When the barcode is among the determined attributes, the processing unit 1 is, in one embodiment, capable of detecting an outline on the article 31 around this barcode. This contour detection function advantageously makes it possible to check whether this barcode is printed directly on the article 31 or on a label glued/affixed thereto (the edges of this label form a contour around the barcode determined). Fraudulent manipulation of the barcodes can thus be detected.

Furthermore, at least one attribute of the gripping posture of the article 31 by the hand 30 is determined by the processing unit 1 . This attribute is, for example, the number of fingers and/or the surfaces of the hand which participate in gripping the article 31 to present it to the automatic payment terminal 10 and/or the shape of the hand 30 ( closed hand or open hand). Different gripping postures can, in fact, be adopted to present the article 31 such as bidigital postures, tridigital postures, quadridigital postures, postures involving the five fingers, or postures involving the palm of the hand. The gripping posture of the article 31 adopted by the customer is classified by the processing unit 1 into a plurality of predefined classes on the basis of at least one attribute of this posture.

In one embodiment, at least two classes of gripping postures are defined, namely a first class in which only the fingers participate in gripping the article 31 (i.e. the palm does not participate in the gripping). gripping), and a second class in which the palm of the hand 30 participates in gripping the article 31 . Thus, based on the surfaces of the hand 30 involved in gripping the article 31 , the gripping posture is classified in the first class when only the fingers are in contact with the article, and in the second class when the palm of the hand 30 is in contact with the article 31 .

When the shape, the size and/or the weight of the article 31 allow it, the gripping posture of this expected article 31 is a precision posture where only the fingers are spontaneously solicited to present it to the terminal 10 of collection automatic (ie without the palm of the hand 30 participating in this gripping). It is, in fact, considered that the gripping posture of article 31 depends on the client's motivation, in other words on his goal of the gripping posture adopted. A gripping posture involving the palm of the hand 30 is interpreted by the processing unit 1 as being likely to want to conceal part of the article 31 comprising an attribute relevant for the recognition of this article 31 (such as the code -bars, a text or a distinctive design of this article 31 ). A machine learning model (like CNN, R-CNN, Fast R-CNN, Faster R-CNN, or equivalents) regularly trained with grip postures adopted during Article 31 and other flight situations adopted during the correct recordings of this article 31 makes it possible, advantageously, to suitably classify and, therefore, interpret the gripping posture which is submitted to it.

The shape (spherical, cubic, or cylindrical for example), the dimensions and/or the weight of the article 31 are used (or taken into consideration) by the processing unit 1 in the classification of the grip posture. Section 31 attributes may, in some embodiments, influence the classification (or interpretation) of a gripping posture as natural/spontaneous or intentional (to hide one or more relevant Section 31 attributes ).

In another embodiment, at least one attribute of the movement of the article 31 when it is gripped in the presentation space 9 is determined by the processing unit 1 . These attributes include, for example, the trajectory of the movement, the speed of the movement, and/or a rotational movement (even partial) of the article 31 . These attributes aim to allow the classification or, more generally, an interpretation of the movement followed by the article 31 during its presentation at the automatic payment terminal 10 in a plurality of predefined classes (or categories). The movement of article 31 is, for example, classified
- in a first class when its speed (in particular, the average speed) is greater than a predefined threshold speed, this first class of movements not facilitating the recognition of article 31 or being considered/interpreted as doubtful movements (for example , rapid removal of an item 32 - 33 from the presentation space before the collection procedure is closed);
- in a second class when the trajectory of the movement of the article 31 is substantially at the limit of the presentation space 9 (in particular, when the article 31 is partially outside the presentation space or, equivalently , when a part of this article 31 is not included in any of the fields of vision 21 - 24 ), this second class of movements not favoring the determination of attributes of the article 31 and, consequently, also its recognition ;
- in a third class when it includes one or more rotational movements of the article 31 , this class of movements designating or, more generally, interpreted as being manipulations of the customer encountering difficulties in presenting / orienting the article 31 properly to save it.

A classification or a recognition/identification of the movement of Article 31 can be obtained by a measure of similarity between the signature of this movement and signatures predefined by a machine learning model or equivalent (multilayer neural networks or models of Hidden Markovs for example) trained with predefined scenarios (in particular, previously captured scenarios of theft and/or difficulty in recording Article 31 ).

Here, classification of a gripping posture of Article 31 or of a movement of Article 31 is understood to mean all operations making it possible to identify, recognize or interpret a gripping posture or a movement of the Article 31 based on attributes and pre-established knowledge (eg, rules, functions, or patterns).

When the confidence index with which the article 31 is recognized by the processing unit 1 is lower than a predefined threshold value (for example, 96%, 97%, 98% or 99%), the processing unit 1 processing is configured to generate a message associated with the class of the gripping posture of this article 31 and/or the class of the movement of this article 31 . This message (audio and/or visual) includes, for example,
- an indication relating to a presentation of article 31 allowing its recognition with a higher index of confidence when the class of the posture of prehension is the class in which the palm takes part in the prehension of article 31 and/or the class of the movement of the article 31 is the class in which the movement of the article 31 includes a rotational movement, and/or the class of the movement of the article 31 is the class in which the trajectory of the movement of the article 31 is substantially at the limit of the space 9 of presentation. This indication is sent to the client;
- an alert when the class of the gripping posture is the class in which the palm participates in the gripping of the article 31 , and/or the class of the movement of the article 31 is the class in which the speed of the movement of section 31 is greater than the predefined threshold speed. This alert is, in one embodiment, issued to a cashier staff requiring their intervention or their attention.

When the confidence index with which the article 31 is recognized is greater than or equal to the predefined threshold value, the processing unit 1 updates the list of recorded articles. This update includes adding item 31 to the list when it is filed in the presentation space 9 or removing this item 31 from the list when it is removed from the presentation space 9 .

In one embodiment, a customer places his basket 36 on the side support 8 and successively presents his items 31-35 to the automatic payment terminal 10 by depositing them on the upper face of the cabinet 2 . This upper face of the cabinet 2 is included in the space 9 of presentation. The processing unit 1 takes care of the segmentation of the images acquired by the image sensors 11 - 14 in order to detect therein an object 30 - 31 in motion. An article 31 and a hand 30 are identified in this object 30-31 by the processing unit 1 , the article 31 being gripped by the

hand

30 during at least part of the movement of the object 30-31 . Characteristics (i.e. attributes) of the article 31 , of the gripping posture of the article 31 by the hand 30 , and/or of the movement of the article 31 when it is gripped by the hand 30 are, respectively, supplied as input to corresponding classification modules. When the article is recognized with a confidence index lower than a predefined threshold value, a message associated with the determined class of the gripping posture and/or of the movement of the article 31 is generated by the processing unit 1 at intended for the customer and/or a cashier. When all of the customer's articles 31 - 35 are recognized with a confidence index greater than the threshold value, the customer completes the collection procedure by paying for his purchases and bagging his articles 31 - 35 . A receipt, or more generally a physical or dematerialized proof of his purchase is subsequently issued to him.

In another embodiment, the automatic payment terminal 10 operates in a network with other nodes of a network of entities. These entities may be self-checkout kiosks, shopping assistance devices integrated into portable shopping carts or wheeled shopping carts, reading or price information devices, or any other connected object. These interconnected entities can exist in the same point of sale or in distant points of sale. The automatic collection terminal 10 places the computing resources of its processing unit 1 , when they are available (off-peak periods or periods of inactivity of the processing unit 1 ), available to the other entities. This results, advantageously, in optimal use of the calculation resources of the terminal 10 for automatic collection. A sharing of computing resources between several terminals 10 of automatic collection also makes it possible to improve their performance.

In addition, the automatic payment terminal 10 contributes to the supply, preferably after validation by an operator, of a database of 2D and/or 3D images of the article 31 , or of gripping postures adopted to present this article 31 , and/or movements followed by this article 31 during its presentation in order to improve, respectively, the classification models of this article 31 , the classification models of the gripping postures of this article 31 , and /or movement classification models followed by this article 31 when it is presented. This results, advantageously, in evolutionary classification (identification or recognition) models. The customer thus participates in the production of high value-added data for the point of sale.

Advantageously, the embodiments described above allow a visual interpretation of the movements of the customer's hand during his presentation of the items at the automatic collection terminal 10 in order to assist him in the collection procedure, to speed up his passage at checkout, and reduce shrinkage and unintentional errors.

Although the automatic payment terminal is described above with respect to embodiments and variants, those skilled in the art will understand that these embodiments and variants are not limiting and can be combined with each other and/or with any other equivalent embodiment.

Claims

Automatic payment terminal ( 10 ) comprising
- a first image sensor ( 11 - 14 ) having a first field ( 21 - 24 ) of vision at least partially covering a presentation space ( 9 ) intended to receive an article ( 31 ) to be recorded by the terminal ( 10 ) automatic collection;
- a processing unit ( 1 ) capable of
- detecting in a plurality of successive images at least partially acquired by the first image sensor ( 21-24 ) an object ( 30-31 ) moving in the presentation space ( 9 ) ;
- identify, in said object ( 30 - 31 ), a hand ( 30 ) and an article ( 31 ), the article ( 31 ) being gripped by the hand ( 30 ) during at least part of the movement of the object ( 30 - 31 );
- determining a first attribute of the item ( 31 ) identified;
- recognizing with a first confidence index said article ( 31 ) on the basis of at least said first attribute;
- determining a second attribute of the gripping posture of the article ( 31 ) and classifying this gripping posture in a first plurality of predefined classes on the basis of at least said second attribute and/or determining a third attribute of the movement of the article ( 31 ) when this article ( 31 ) is gripped by the hand ( 30 ) and classifying this movement into a second plurality of predefined classes on the basis of at least said third attribute;
- generating, when the article ( 31 ) is recognized with a confidence index lower than a predefined threshold value, a message associated with the class of the grip posture and/or the class of the movement of the article ( 31 ) .
Automatic payment terminal ( 10 ) according to the preceding claim, characterized in that the second attribute is chosen from a list comprising the number of fingers of the hand ( 30 ) which take part in gripping the article ( 31 ), the surfaces of the hand ( 30 ) which assist in gripping the article ( 31 ).
Automatic payment terminal ( 10 ) according to the preceding claim, characterized in that the first plurality of classes comprises a first class in which the palm of the hand ( 30 ) takes part in gripping the article ( 31 ).
Automatic payment terminal ( 10 ) according to any one of the preceding claims, characterized in that the third attribute is chosen from a list comprising the speed of movement of the article ( 31 ), the trajectory of the movement of the article ( 31 ), a rotational movement of the article ( 31 ).
Automatic payment terminal ( 10 ) according to the preceding claim, characterized in that the second plurality of classes comprises a second class in which the speed of the movement is greater than a predefined threshold speed, and/or a third class in which the trajectory of the movement is substantially at the limit of the space ( 9 ) of presentation, and/or a fourth class in which the movement comprises a rotational movement.
Automatic payment terminal ( 10 ) according to Claim 3, characterized in that the message generated comprises, when the class of the grip posture is the first class, an alert intended for a cashier.
Automatic payment terminal ( 10 ) according to claim 5, characterized in that the message generated comprises, when the class of movement of the article ( 31 ) is the fourth class, an indication relating to a predefined presentation of the article ( 31 ), this presentation allowing recognition of the article ( 31 ) with a second confidence index greater than the first confidence index.
Automatic payment terminal ( 10 ) according to any one of the preceding claims, characterized in that the first attribute is chosen from a list comprising a barcode, a color, a text, a drawing, a symbol, a logo, a dimension, a shape, a volume.
Terminal ( 10 ) for automatic collection according to the preceding claim, characterized in that the processing unit ( 1 ) is, moreover, capable of detecting an outline on the article ( 31 ) around the barcode.
Automatic payment terminal ( 10 ) according to any one of the preceding claims, characterized in that it further comprises a second image sensor ( 13 ) having a second field ( 23 ) of vision covering at least partially the presentation space ( 9 ), a third image sensor ( 14 ) having a third field ( 24 ) of vision at least partially covering the presentation space ( 9 ), the first sensor ( 11 - 12 ) of image being arranged above the space ( 9 ) for presentation, the second and the third sensor ( 13 - 14 ) being arranged on either side of the space ( 9 ) for presentation.