WO2024100935A1

WO2024100935A1 - Input device and input method

Info

Publication number: WO2024100935A1
Application number: PCT/JP2023/026902
Authority: WO
Inventors: 匡夫濱田; 毅吉原; 智広森川; 要介田中
Original assignee: パナソニックＩｐマネジメント株式会社
Priority date: 2022-11-11
Filing date: 2023-07-21
Publication date: 2024-05-16

Abstract

This input device comprises: a display unit that displays an input screen image capable of accepting an input operation based on the gaze position of a user; a camera that captures an image of the user; a calculation unit that, on the basis of the gaze position of a first user appearing in the first captured image that was captured, calculates a correction parameter for calibrating the gaze position of the first user with respect to the input screen image; and a processor that calibrates the gaze position of a second user using the correction parameter and, on the basis of the calibrated gaze position of the second user, accepts an input operation on the input screen image.

Description

Input device and input method

This disclosure relates to an input device and an input method.

Patent Document 1 discloses an information processing device that acquires a history of information indicating the correspondence between the position of a user's gaze point and the position of an index indicating an operation position operated by the user, detects the user's gaze point, and controls the display position of the index based on the acquired history of information indicating the correspondence so that the index is displayed at a position corresponding to the detected current position of the gaze point.

International Publication No. 2016/147499

However, in Patent Document 1, the gaze point calibration process requires the accumulation of information indicating the correspondence between the gaze point position and the operation position. Therefore, in an environment where information indicating the correspondence cannot be accumulated for each user, such as an Automatic Teller Machine (ATM) used outside the home, the information processing device may not be able to perform sufficient calibration, which may result in an error in the operation position, making it difficult to perform input operations based on the gaze position.

The present disclosure has been devised in consideration of the above-mentioned conventional situation, and aims to provide an input device and input method that make it more efficient to calibrate the gaze position in gaze input.

The present disclosure provides an input device capable of accepting an input operation based on a user's gaze position, the input device comprising: a display unit that displays an input screen capable of accepting the input operation; a camera that images the user; a calculation unit that calculates a correction parameter for calibrating the gaze position of the first user with respect to the input screen based on the gaze position of the first user shown in a first captured image; and a processor that detects the gaze position of a second user shown in a second captured image taken after calculating the correction parameter, calibrates the gaze position of the second user using the correction parameter, and accepts the input operation with respect to the input screen based on the calibrated gaze position of the second user.

The present disclosure also provides an input method performed by an input device capable of accepting an input operation based on a user's gaze position, the input method including: displaying an input screen capable of accepting the input operation; acquiring a first captured image of the user; calculating a correction parameter for calibrating the gaze position of the first user with respect to the input screen based on the gaze position of the first user shown in the first captured image; acquiring a second captured image of the user; calibrating the gaze position of a second user shown in the second captured image using the correction parameter; and accepting the input operation with respect to the input screen based on the calibrated gaze position of the second user.

The present disclosure also provides an input device capable of accepting an input operation based on a user's gaze position, the input device including a processor that detects the gaze position of a first user shown in a first captured image captured by a camera and the gaze position of a second user shown in a second captured image captured by the camera, and accepts the input operation on an input screen capable of accepting the input operation based on the gaze position of the first user and the gaze position of the second user.

This disclosure makes it possible to more efficiently calibrate gaze position during gaze input.

FIG. 1 is a block diagram showing an example of an internal configuration of a gaze input device according to a first embodiment; FIG. 1 is a diagram for explaining an example of an operation procedure of the eye-gaze input device according to the first embodiment; FIG. 1 is a diagram for explaining an example of an operation procedure of the eye-gaze input device according to the first embodiment; FIG. 1 is a diagram for explaining an example of a method for calibrating a gaze position. FIG. 1 is a diagram for explaining a method for calculating the movement direction of the gaze position. A diagram for explaining a method of accepting an eye-gaze input operation based on the moving direction of the eye-gaze position. FIG. 13 is a diagram showing an example of an angle at which an eye-gaze input operation can be accepted based on the moving direction of the eye-gaze position; FIG. 1 is a diagram for explaining an example of a dead region. FIG. 11 is a diagram for explaining a first example of an eye-gaze input operation procedure. FIG. 11 is a diagram for explaining a first example of an eye-gaze input operation procedure. FIG. 11 is a diagram for explaining a second example of an eye-gaze input operation procedure. FIG. 11 is a diagram for explaining a second example of an eye-gaze input operation procedure. FIG. 11 shows another example of an input screen FIG. 11 shows another example of an input screen

Below, with reference to the drawings as appropriate, an embodiment that specifically discloses an input device and an input method according to the present disclosure will be described in detail. However, more detailed explanation than necessary may be omitted. For example, detailed explanations of already well-known matters and duplicate explanations of substantially identical configurations may be omitted. This is to avoid the following explanation becoming unnecessarily redundant and to facilitate understanding by those skilled in the art. Note that the attached drawings and the following explanation are provided to enable those skilled in the art to fully understand the present disclosure, and are not intended to limit the subject matter described in the claims.

(Embodiment 1)
First, a gaze input device P1 according to embodiment 1 will be described with reference to Fig. 1. Fig. 1 is a block diagram showing an example of the internal configuration of the gaze input device P1 according to embodiment 1. It should be noted that the gaze input device P1 shown in Fig. 1 is merely an example, and needless to say, the present invention is not limited to this.

The gaze input device P1, as an example of an input device, is equipped with a camera 13 capable of capturing an image of the face of a user looking at a display 14, and is realized, for example, by a personal computer (hereinafter referred to as "PC"), a notebook PC, a tablet terminal, a smartphone, etc. The gaze input device P1 is capable of accepting gaze input operations based on the user's gaze position. The gaze input device P1 is a system capable of accepting input operations based on the user's gaze, and includes a processor 11, a memory 12, a camera 13, a display 14, and a database DB1. Note that the database DB1 may be configured separately from the gaze input device P1. The camera 13 and the display 14 may also be configured separately from the gaze input device P1.

The processor 11, which is an example of a calculation unit, is configured using, for example, a Central Processing Unit (CPU) or a Field Programmable Gate Array (FPGA), and performs various processes and controls in cooperation with the memory 12. Specifically, the processor 11 references the programs and data stored in the memory 12 and executes the programs to realize the functions of each unit.

First, the processor 11 outputs the calibration screen SC0 (see FIG. 4, an example of an input screen) to the display 14 and displays it. The processor 11 then causes the camera 13 to capture an image of the user looking at the calibration screen SC0, and calculates correction parameters (e.g., a transformation matrix, etc.) for correcting (transforming) the positional deviation of the user's gaze position detected using the captured image of the user output from the camera 13.

After calculating the correction parameters, the processor 11 displays the input screen SC1 (see Figs. 6, 7, and 8) and starts accepting gaze input operations by the user on the input screen SC1 (see Figs. 6, 7, and 8). The processor 11 uses the correction parameters to correct and store the gaze position of the user detected based on the captured image output from the camera 13, and accepts input operations of input information (e.g., PIN code, symbols, pictograms, stroke order, password, etc.) based on the time-series changes in the gaze position.

In addition, the processor 11 performs image analysis on the captured image output from the camera 13 and generates measurement environment information. Based on the generated measurement environment information, the processor 11 determines a threshold value for each variability of the detected gaze positions.

Note that the measurement environment information here includes, for example, any one of the following information: the size of the display 14, the brightness of the user's face area shown in the face image, the distance between the display 14 and the user (imaging distance), the angle of the user's face, etc.

The processor 11 compares the input information based on the received input operation result with the user's registration information (e.g., PIN code, password, etc.) registered in the database DB1. Note that the user's registration information registered in the database DB1 may be registered in the memory 12 instead of the database DB1.

Memory 12 has, for example, a Random Access Memory (hereinafter referred to as "RAM") as a working memory used when executing each process of processor 11, and a Read Only Memory (hereinafter referred to as "ROM") that stores programs and data that define the operation of processor 11. Data or information generated or acquired by processor 11 is temporarily stored in RAM. Programs that define the operation of processor 11 are written in ROM. Memory 12 stores the size of display 14, etc. Memory 12 may also store the installation position and distance of camera 13 relative to display 14, etc.

The camera 13 is configured to have at least an image sensor (not shown) and a lens (not shown). The image sensor is, for example, a solid-state imaging element such as a Charged-Coupled Device (CCD) or a Complementary Metal Oxide Semiconductor (CMOS), and converts the optical image formed on the imaging surface into an electrical signal. The camera 13 outputs the captured image of the user to the processor 11.

The display 14, which is an example of a display unit, is configured using, for example, a Liquid Crystal Display (LCD) or an organic electroluminescence (EL) display. The display 14 displays the calibration screen SC0 (see FIG. 4), the input screen SC1 (see FIG. 6, FIG. 7, FIG. 8), etc., output from the processor 11.

Database DB1 is a so-called storage device, and is configured using a storage medium such as a flash memory, a hard disk drive (HDD), or a solid state drive (SSD). Database DB1 stores user registration information (e.g., PIN code, password, etc.) in a manner that allows it to be managed for each user.

Next, an example of the operation procedure of the eye-gaze input device P1 according to embodiment 1 will be described with reference to each of Figures 2 to 4. Figure 2 is a diagram for explaining an example of the operation procedure of the eye-gaze input device P1 according to embodiment 1. Figure 3 is a diagram for explaining an example of the operation procedure of the eye-gaze input device P1 according to embodiment 1.

First, the calibration process (steps St11 to St18) executed by the eye-gaze input device P1 will be described with reference to FIG. 4. FIG. 4 is a diagram for explaining an example of a method for calibrating the eye-gaze position. Note that the calibration screen SC0 shown in FIG. 4 shows an example that is similar to the input screen SC1 (see FIG. 6) for accepting input operations for input information, but is not limited to this.

The processor 11 outputs the calibration screen SC0 (see FIG. 4) including the center point "A" to the display 14 and displays it (St11).

The calibration screen SC0 includes a center point "A." The center point "A" is located approximately in the center of the calibration screen SC0.

The processor 11 outputs the calibration screen SC0 to the display 14 for display. The processor 11 requests the user to look at (gaze at) the center point "A" included in the calibration screen SC0. This request may be made by displaying a message requesting the user to gaze at the center point "A" on the display 14, or by outputting the message requesting the user to gaze at the center point "A" as audio from a speaker (not shown).

The camera 13 captures an image of the user gazing at the calibration screen SC0 (St12). The camera 13 outputs the captured image to the processor 11.

The processor 11 detects the face of the user (person) from the captured image output from the camera 13, and detects the user's gaze position on the calibration screen SC0 using a gaze detection algorithm (St13). The processor 11 stores information on the detected gaze position (coordinates) in the memory 12 (St13).

The processor 11 calculates the amount of positional blur of the gaze position Pt0 for a predetermined time (e.g., 0.3 seconds, 0.5 seconds, etc.) based on each of the accumulated gaze positions Pt0 for a predetermined time (St14). Note that the amount of positional blur here is the standard deviation value that indicates the variation of each of the detected gaze positions.

The processor 11 determines whether the calculated amount of positional blur of the gaze position Pt0 for a predetermined time period is less than a threshold value (St15). Note that the threshold value is a predetermined fixed value that is determined based on the measurement environment information.

If the processor 11 determines in the processing of step St15 that the calculated amount of positional blur of the gaze position Pt0 for the specified time period is less than the threshold value (St15, YES), it calculates the center position PtC0 of the gaze position Pt0 for the specified time period (St16).

On the other hand, if the processor 11 determines in the processing of step St15 that the calculated amount of positional blur of the gaze position Pt0 for the specified time period is not less than the threshold value (St15, NO), the processor 11 returns to the processing of step St12.

The processor 11 calculates a correction parameter (transformation matrix DR0) for coordinate transformation of the center position PtC0 to the center position Ct of the center point "A" (i.e., the center position PtC0 of the area ARA) based on the calculated center position PtC0 and the center position Ct (see FIG. 8) of the center point "A" (St17). The correction parameter calculated here is a parameter for transforming (correcting) the user's gaze position (i.e., the center position PtC0 of the gaze position Pt0) to the gaze input position by the user.

By using the transformation matrix DR0, the processor 11 can correct the positional shift of the user's gaze position, for example, by converting the gaze position Pt0 for a predetermined time into an input position Pt0' and the gaze position Pt1 for a predetermined time into an input position Pt1'. This allows the processor 11 to, for example, convert the gaze position Pt0 for a predetermined time into an input position Pt0', and then accept an input operation from the user at the input position Pt0', or, when the user's gaze position moves from gaze position Pt0 to gaze position Pt1, determine that the gaze position has moved from input position Pt0' to input position Pt1'.

After calculating the correction parameters, the processor 11 ends the calibration process and outputs and displays the input screen SC1 for accepting input of input information on the display 14 (St18). The input screen SC1 includes a center point "A" and input keys "1", "2", "3", and "4" corresponding to four numbers. The center point "A" is located approximately in the center of the input screen SC1. Each of the input keys "1" to "4" is located at approximately equal intervals on concentric circles centered on the center point "A".

Next, we will explain the process of accepting the gaze input operation performed by the gaze input device P1 (steps St19 to St27).

The camera 13 captures an image of the user gazing at the input screen SC1 (St19). The camera 13 outputs the captured image to the processor 11.

Processor 11 detects the face of the user (person) from the captured image output from camera 13, and detects the user's gaze position on input screen SC1 using a gaze detection algorithm (St20). Processor 11 accumulates information on the detected gaze position (coordinates) for a predetermined period of time (e.g., 0.3 seconds, 0.5 seconds, etc.) in memory 12 in chronological order (St20). Processor 11 may also accumulate information on the detected gaze position in association with imaging time information of the captured image in which this gaze position was detected.

The processor 11 calculates an approximation line based on the accumulated gaze positions for a predetermined time period and the center position Ct (see FIG. 8) of the center point "A" which is the starting point of the gaze position movement (St22). The processor 11 calculates the angle of the calculated approximation line and accumulates it in the memory 12 in chronological order (St23).

The processor 11 calculates an angular blur amount (e.g., the angular blur amount θA shown in FIG. 5, the angular blur amount θB shown in FIG. 6, etc.) indicating the amount of blur in the user's gaze position based on the angles of the multiple approximate straight lines accumulated by at least two approximate straight line calculation processes Rp1 (St24).

The processor 11 determines whether the amount of angular blur is less than a threshold value corresponding to a specific input key (St25). The threshold value referred to here is, for example, the threshold value θ1 shown in FIG. 7, or the threshold values θ2 and θ3 shown in FIG. 13, and is a threshold value for determining whether an input key has been input among the center point "A", the number "1", ..., which are input keys displayed on the input screens SC1, SC2, and SC3, based on the direction of movement of the user's gaze position (i.e., the angle of the approximated line).

If the processor 11 determines in the processing of step St25 that the amount of angular blur is less than the threshold value (St25, YES), it confirms the input content (input key) based on the input keys (e.g., center point "A", number "1", ..., etc.) arranged in the direction of movement of the user's gaze position indicated by the approximation line, and accepts an input operation based on the user's gaze (St26).

On the other hand, if the processor 11 determines in the processing of step St25 that the amount of angular blur is not less than the threshold value (St25, NO), the processor 11 returns to the processing of step St19.

The processor 11 determines whether or not input operations for a predetermined number of digits (e.g., three digits, four digits, etc.) have been completed based on the number of input contents (input keys) accepted as input operations (St27).

If the processor 11 determines in the process of step St27 that the input operation for the predetermined number of digits has been completed (St27, YES), the operation procedure shown in FIG. 3 ends. The processor 11 acquires input information based on the input contents (input keys) for the predetermined number of digits, and proceeds to a process of comparing the input information with the input information previously registered in the database DB1.

On the other hand, if the processor 11 determines in the processing of step St27 that the input operation for the predetermined number of digits has not been completed (St27: NO), the processor 11 returns to the processing of step St19.

As described above, the eye-gaze input device P1 according to embodiment 1 can accept an input operation based on the direction of movement of the user's eye-gaze position (i.e., changes over time) even when the user's eye-gaze position is not detected in the areas ARA, AR1, AR2, AR3, AR4 of each input key, or when the user's eye-gaze position detected in the areas ARA (an example of a first input section) and AR1 to AR4 (an example of a second input section) of each input key does not satisfy a predetermined condition for accepting an input operation (for example, the user's eye-gaze position is continuously detected within areas ARA, AR1 to AR4 for a predetermined period of time or more).

In other words, the eye-gaze input device P1 can accept input key input operations without the user having to gaze at the areas ARA, AR1 to AR4 of each input key, so the time required for each user to perform input operations can be more effectively reduced.

Furthermore, even if the calibration accuracy is low, the eye-gaze input device P1 can receive input operations based on the user's gaze with a high degree of accuracy. Therefore, even if the eye-gaze input device P1 is used by an unspecified number of users without recording and storing correction parameters for each user in advance, it can calculate correction parameters with low calibration accuracy by gazing at at least one point (for example, the central point "A"), and therefore can more effectively reduce the time required to calculate the correction parameters for each user.

Next, the process of calculating the moving direction of the user's gaze position (steps St19 to St24) will be described in detail with reference to FIG. 5. FIG. 5 is a diagram explaining the method of calculating the moving direction of the gaze position.

First, the approximate line calculation process Rp1 will be described with reference to FIG. 5. Note that in FIG. 5, for ease of understanding, an example will be described in which three approximate lines DR21, DR22, and DR23 indicating time-series changes in the user's gaze position are calculated based on three accumulated gaze positions Pt21', Pt22', and Pt23' and the center position Ct of center point "A" (see FIG. 8). Note that the explanations of steps St19 and St20 are similar and will be omitted.

In the example shown in FIG. 5, the processor 11 executes the approximate line calculation process Rp1 three times.

In the first approximate line calculation process Rp1, the processor 11 accumulates the gaze positions for a predetermined time period (St21), and calculates the approximate line DR21 based on the accumulated gaze positions Pt21' for the predetermined time period and the center position Ct of the center point "A", which is the starting point of the gaze position movement (St22). The processor 11 calculates the angle of the approximate line DR21, and accumulates it in chronological order in the memory 12 (St23).

In the second approximate line calculation process Rp1, processor 11 returns to the process of step St19, further accumulates gaze positions for a predetermined time period (St21), and calculates approximate line DR22 based on the accumulated gaze positions Pt22' for the predetermined time period and the center position Ct of center point "A", which is the starting point of the gaze position movement (St22). Processor 11 calculates the angle of approximate line DR22 and accumulates it in chronological order in memory 12 (St23).

In the third approximate line calculation process Rp1, processor 11 returns to the process of step St19, further accumulates gaze positions for a predetermined time period (St21), and calculates approximate line DR23 based on the accumulated gaze positions Pt23' for the predetermined time period and the center position Ct of center point "A", which is the starting point of the gaze position movement (St22). Processor 11 calculates the angle of approximate line DR23 and accumulates it in chronological order in memory 12 (St23).

The processor 11 calculates the angular blur amount θA, which indicates the amount of blur in the user's gaze position, based on the angles of the three approximate lines DR21 to DR23 accumulated by the approximate line calculation process Rp1 (St24).

As a result, the eye gaze input device P1 can calculate the direction in which the user's eye gaze position moves and the amount of blur in the direction in which the user's eye gaze position moves.

Next, the process of accepting an input operation based on the amount of angular shake (steps St25 to St26) will be described in detail with reference to Figs. 6 and 7. Fig. 6 is a diagram explaining a method of accepting an eye-gaze input operation based on the moving direction of the eye-gaze position. Fig. 7 is a diagram showing an example of an angle at which an eye-gaze input operation based on the moving direction of the eye-gaze position can be accepted.

First, the approximate line calculation process Rp1 will be described with reference to Figure 6. Note that in Figure 6, an example will be described in which approximate lines DR31, DR32, DR33, and DR34 that indicate time-series changes in the user's gaze position are calculated based on four gaze positions that have been accumulated for ease of understanding.

In the example shown in FIG. 6, the processor 11 calculates four approximate straight lines DR31 to DR34 corresponding to each of the accumulated gaze positions based on each of the four gaze positions and the center position Ct of the center point "A" (see FIG. 8) (St22). The processor 11 calculates the angle of each of the calculated four approximate straight lines DR31 to DR34 and accumulates them in the memory 12 (St23).

The processor 11 calculates the angular blur amount θB, which indicates the amount of blur in the user's gaze position, based on the angles of the approximated lines DR31 to DR34 accumulated by the approximated line calculation process Rp1 (St24).

The processor 11 determines whether the calculated angular blur amount θB is less than the threshold value θ1 (St25). In the example shown in FIG. 7, the threshold value θ1 may be an angle of 90° or less.

If processor 11 determines that the calculated angle blur amount θB is less than threshold θ1, it determines in which of angle regions θ11, θ12, θ13, or θ14 the angle of the approximated line falls in, in the chronological order in which the angles of approximated lines DR31 to DR34 were calculated. If processor 11 determines based on the determination result that the angle of the approximated line falls within the same angle region a predetermined number of times (e.g., two times, five times, etc.) or more consecutively in the chronological order, it accepts an input operation in which the input content is an input key that corresponds to this angle region and is positioned in the direction of movement of the user's gaze position indicated by the approximated line (St26).

Here, each angle region θ11 shown in FIG. 7 is a region that is -45° or more and less than +45°, with the position of the input key "1" as the reference (0 (zero)°). The angle region θ12 is a region that is +45° or more and less than +135°, with the position of the input key "1" as the reference (0 (zero)°). The angle region θ13 is a region that is +135° or more and less than +225°, with the position of the input key "1" as the reference (0 (zero)°). The angle region θ12 is a region that is +225° or more and less than +315° (i.e., less than -45°), with the position of the input key "1" as the reference (0 (zero)°).

Below, we will use a concrete example to explain the process of the processor 11 when it is determined that the angle of the approximated line falls within the same angle region four times consecutively in time series, and when it accepts an input operation in which the input content is an input key that corresponds to this angle region and is positioned in the direction of movement of the user's gaze position indicated by the approximated line.

Processor 11 determines whether the angle of approximate line DR31 is included in any of the angle regions θ11 to θ14. After determining that the angle of approximate line DR31 is included in angle region θ11, processor 11 determines whether the angle of approximate line DR32, which is calculated next to approximate line DR31, is included in the same angle region θ11 as the angle of approximate line DR31.

After determining that the angle of the approximated straight line DR32 is included in the angle region θ11, the processor 11 determines whether the angle of the approximated straight line DR33, which is calculated next to the approximated straight line DR32, is included in the same angle region θ11 as the angles of the approximated straight lines DR31 to DR32.

After determining that the angle of the approximated straight line DR33 is included in the angle region θ11, the processor 11 determines whether the angle of the approximated straight line DR34, which is calculated next to the approximated straight line DR33, is included in the same angle region θ11 as the angles of the approximated straight lines DR31 to DR33.

After determining that the angle of approximated straight line DR34 is included in angle region θ11, processor 11 detects that it has determined that each of the angles of approximated straight lines DR31 to DR34 is included in the same angle region θ11 four times in succession in time series (i.e., a predetermined number of times). Processor 11 accepts an input operation in which the input content is the input key "1" corresponding to the angle region θ11 that includes the angles of approximated straight lines DR31 to DR34.

As described above, the eye-gaze input device P1 according to embodiment 1 can estimate the input content that the user is about to input based on the moving direction of the user's gaze position, and can accept it as an input operation before the user gazes at the input keys. This makes it possible to more efficiently reduce the time required for each user to input information. Furthermore, because the eye-gaze input device P1 can more accurately estimate the moving direction of the user's gaze position based on the amount of angular deviation of the user's gaze position, it is possible to more effectively prevent erroneous input of input information even when calibration accuracy is low.

Next, the insensitive area ARN will be described with reference to FIG. 8. FIG. 8 is a diagram illustrating an example of the insensitive area ARN. Note that setting the insensitive area ARN is not essential and may be optional.

The insensitive area ARN is an area that disables the estimation process of the input content based on the angle of the approximate line, and is an area outside the area ARA of the center point "A" and within a radius R1 from the center position Ct of the area ARA of the center point "A". Note that the insensitive area ARN may be set in other shapes (e.g., ellipse, diamond, etc.) based on the aspect ratio and size of the display 14 or the arrangement of the input keys.

When the processor 11 determines that the detected gaze position of the user is within the insensitive area ARN, the processor 11 does not use the gaze position of the user located within the insensitive area ARN in the calculation of the approximate line and the angle of the approximate line. In other words, the processor 11 calculates the approximate line and the angle of the approximate line based on the gaze position of the user detected outside the insensitive area ARN, estimates the input content that the user is about to input based on the calculated angle of the approximate line, and accepts it as an input operation before the user gazes at the input keys.

As a result, the gaze input device P1 according to embodiment 1 uses the gaze position detected at a position that is at least a predetermined distance (radius R1) away from the center point "A" to estimate the input content, and can eliminate gaze positions that are detected near the center point "A" where the variation in the angle of the approximated line is likely to be small and that are likely to result in erroneous determination of the input content. Therefore, the gaze input device P1 can more effectively suppress erroneous determination of the input content in the process of estimating the input content based on the movement direction of the user's gaze position.

Next, the first gaze input operation procedure will be described with reference to Fig. 9 and Fig. 10. Fig. 9 is a diagram for explaining an example of the first gaze input operation procedure. Fig. 10 is a diagram for explaining an example of the first gaze input operation procedure. The first gaze input operation procedure is a gaze input operation procedure in which input operations for the center point "A" and input operations for the input keys "1" to "4" are alternately accepted, and corresponds to the processing of each of steps St19 to St26 shown in Fig. 3.

Note that the input screens SC41, SC42, SC43, SC44, SC45, SC46, SC47, and SC48 shown in Figures 9 and 10, respectively, are merely examples and are not limited to these. For example, the number of numeric input keys is not limited to four. Also, for example, the arrangement of the numeric input keys is not limited to the arrangement shown in input screens SC41 to SC48, and they may be rotated by any angle (for example, 45°) and arranged.

The processor 11 displays the input screen SC41 on the display 14, instructs the user to look at the center point "A" (by outputting voice, outputting a message, etc.), and enables (makes acceptable) only the input operation for the center point "A" out of the five input keys.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located based on the detected user's gaze position. Processor 11 may generate correction parameters for calibrating the user's gaze position at this timing.

If processor 11 determines, based on the detected user's gaze position, that the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located, it displays input screen SC42 on display 14 and instructs the user to gaze at one of the number input keys. On input screen SC42, processor 11 disables input operations for center point "A" and enables input operations for each of the four input keys "1" to "4" arranged around center point "A".

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing at area AR1 of the four input keys "1", area AR2 of input key "2", area AR3 of input key "3", or area AR4 of input key "4" based on the detected user's gaze position or the movement direction DR41 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of the number input key "1" on input screen SC42.

After accepting an input operation of any of the numeric input keys (here, input key "1"), the processor 11 displays the input screen SC43 on the display 14, instructs the user to gaze at the center point "A" again, and enables (accepts) only the input operation for the center point "A" out of the five input keys.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located based on the detected user's gaze position or the movement direction DR42 of the user's gaze position (angle of the approximated line).

If processor 11 determines, based on the detected user's gaze position, that the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located, it displays input screen SC44 on display 14 and instructs the user to gaze at one of the number input keys. On input screen SC44, processor 11 disables input operations for center point "A" and enables input operations for each of the four input keys "1" to "4" arranged around center point "A".

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing at area AR1 of the four input keys "1", area AR2 of input key "2", area AR3 of input key "3", or area AR4 of input key "4" based on the detected user's gaze position or the movement direction DR43 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of the number input key "2" on input screen SC44.

After accepting an input operation of any of the numeric input keys (here, input key "2"), the processor 11 displays the input screen SC45 on the display 14, instructs the user to gaze at the center point "A" again, and enables only the input operation for the center point "A" out of the five input keys.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located based on the detected user's gaze position or the movement direction DR44 of the user's gaze position (angle of the approximated line).

If processor 11 determines, based on the detected user's gaze position, that the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located, it displays input screen SC46 on display 14 and instructs the user to gaze at one of the number input keys. On input screen SC46, processor 11 disables input operations for center point "A" and enables input operations for each of the four input keys "1" to "4" arranged around center point "A".

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing at area AR1 of the four input keys "1", area AR2 of input key "2", area AR3 of input key "3", or area AR4 of input key "4" based on the detected user's gaze position or the movement direction DR45 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of the number input key "3" on input screen SC46.

After accepting an input operation of any of the numeric input keys (here, input key "3"), the processor 11 displays the input screen SC47 on the display 14, instructs the user to gaze at the center point "A" again, and enables only the input operation for the center point "A" out of the five input keys.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located based on the detected user's gaze position or the movement direction DR46 of the user's gaze position (angle of the approximated line).

If processor 11 determines, based on the detected user's gaze position, that the user is gazing within area ARA of center point "A" or in the direction in which center point "A" is located, it displays input screen SC48 on display 14 and instructs the user to gaze at one of the number input keys. On input screen SC48, processor 11 disables input operations for center point "A" and enables input operations for each of the four input keys "1" to "4" arranged around center point "A".

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines whether the user is gazing at area AR1 of the four input keys "1," AR2 of the input key "2," AR3 of the input key "3," or area AR4 of the input key "4" based on the detected user's gaze position or the movement direction DR46 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of the number input key "4" on the input screen SC48.

After accepting the input of the four numbers "1," "2," "3," and "4," the processor 11 compares the input numbers (input information) with the registration information previously registered in the database DB1 and performs user authentication.

As described above, the eye-gaze input device P1 according to embodiment 1 can more effectively suppress erroneous input of input information by alternately accepting an input operation of input information (here, a number) and an input operation of the center point "A" in the first eye-gaze input operation procedure. Furthermore, by alternately accepting an input operation for the center point "A" and an input operation for the input keys "1" to "4" (i.e., input information), when the user continuously inputs the same input information (for example, when the number "1" is input two or more times in succession), the eye-gaze input device P1 can more accurately accept the input operation of the same input information.

In addition, by enabling (accepting) only the input operation of either the center point "A" or the input keys "1" to "4," the eye-gaze input device P1 can prevent the center point "A" and any of the input keys "1" to "4" from being positioned on the same straight line. This makes it possible to accept input operations based on the time-series changes in the direction of movement of the user's eye-gaze position (the angle of the approximated straight line), and thus makes it possible to more effectively suppress erroneous inputs.

In addition, in each of the input screens SC41, SC43, SC45, and SC47 shown in Figures 9 and 10, only the center point "A" of the five input keys for which input operation is enabled may be displayed with a solid line or highlighted (for example, a thick line or a red frame, etc.), and each of the four input keys "1" to "4" for which input operation is disabled may be displayed with a dashed line or suppressed (for example, a thin line or a gray frame, etc.).

Similarly, the input screens SC42, SC44, SC46, and SC48 shown in Figures 9 and 10 may display four of the five input keys, "1" to "4," for which input operations are enabled, in a solid line or with emphasis (for example, a thick line or a red frame, etc.), and only the center point "A," for which input operations are disabled, in a dashed line or with a suppressed display (for example, a thin line or a gray frame, etc.).

In addition, when the processor 11 receives an input operation for input information, it may enlarge and display the input key corresponding to the input information, or change the luminance of the lighting on the input screens SC41 to SC48 to make them flash. This allows the eye-gaze input device P1 to notify the user that the acceptance of the input operation has been completed.

Next, the second gaze input operation procedure will be described with reference to Figs. 11 and 12. Fig. 11 is a diagram for explaining an example of the second gaze input operation procedure. Fig. 12 is a diagram for explaining an example of the second gaze input operation procedure. The second gaze input operation procedure is a gaze input operation procedure when input operations of the input keys "1" to "4" are accepted consecutively, and corresponds to the processing of each of steps St19 to St26 shown in Fig. 3.

Note that the input screens SC51, SC52, SC53, SC54, and SC55 shown in Figures 11 and 12, respectively, are merely examples and are not limited to these. For example, the number of numeric input keys is not limited to four. Also, for example, the arrangement of the numeric input keys is not limited to the arrangement shown in input screens SC51 to SC55, and may be rotated by any angle (for example, 45°) and arranged.

The processor 11 displays the input screen SC51 on the display 14, instructs the user to gaze at the center point "A" (outputs voice and message), and enables (accepts) only the input operation for the center point "A" out of the five input keys.

If the processor 11 determines, based on the detected user's gaze position, that the user is gazing within the area ARA of the center point "A" or in the direction in which the center point "A" is located, it displays the input screen SC52 on the display 14 and instructs the user to gaze at one of the number input keys. On the input screen SC52, the processor 11 disables input operations for the center point "A" and enables input operations for each of the four input keys "1" to "4" arranged around the center point "A".

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines which of area AR1 of input key "1" to area AR4 of input key "4" the user is gazing at based on the detected user's gaze position or the movement direction DR51 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of the number input key "1" on input screen SC52.

After accepting the input operation of any number input key (here, input key "1"), the processor 11 displays the input screen SC53 on the display 14 and instructs the user to look at the next number input key. It goes without saying that this instruction is not essential and may be omitted.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines which of area AR1 of input key "1" to area AR4 of input key "4" the user is gazing at based on the detected user's gaze position or the movement direction DR52 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of number input key "2" on input screen SC54.

After accepting an input operation of any numeric input key (here, input key "2"), the processor 11 displays the input screen SC54 on the display 14 and instructs the user to focus on the next numeric input key.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines which of area AR1 of input key "1" to area AR4 of input key "4" the user is gazing at based on the detected user's gaze position or the movement direction DR53 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of number input key "3" on input screen SC54.

After accepting an input operation of any numeric input key (here, input key "3"), the processor 11 displays the input screen SC55 on the display 14 and instructs the user to focus on the next numeric input key.

Processor 11 detects the user's gaze position based on the captured image. Processor 11 determines which of area AR1 of input key "1" to area AR4 of input key "4" the user is gazing at based on the detected user's gaze position or the movement direction DR54 of the user's gaze position (angle of the approximated line). Processor 11 accepts the input operation of number input key "4" on input screen SC54.

As described above, the eye-gaze input device P1 according to embodiment 1 can more effectively suppress erroneous input of input information by continuously accepting input operations of a predetermined number of digits of input information (here, numbers) after accepting an input operation of the center point "A" in the second eye-gaze input operation procedure. Furthermore, because the eye-gaze input device P1 does not alternately accept input operations for the center point "A" and input operations for the input keys "1" to "4" (i.e., input information), it is possible to shorten the time required for the input operation for the center point "A" between each piece of input information, and more efficiently shorten the eye-gaze input operation time per user.

In addition, the input screen SC51 shown in Figures 11 and 12 may display only the center point "A" of the five input keys for which input operation is enabled, using a solid line or highlighting (for example, displaying a thick line or a red frame, etc.), and each of the four input keys "1" to "4" for which input operation is disabled, using a dashed line or suppressing (for example, displaying a thin line or a gray frame, etc.).

Similarly, in each of the input screens SC52 to SC55 shown in Figures 11 and 12, the four input keys "1" to "4" for which input operations are enabled among the five input keys may be displayed with a solid line or highlighted (for example, a thick line, a red frame, etc.), and only the center point "A" for which input operations are disabled may be displayed with a dashed line or suppressed (for example, a thin line, a gray frame, etc.).

In addition, when the processor 11 receives an input operation for input information, it may enlarge and display the input key corresponding to the input information, or change the illuminance of the lighting on the input screens SC51 to SC55 to make them flash. This allows the eye-gaze input device P1 to notify the user that the acceptance of the input operation has been completed.

Next, other examples of input screens will be described with reference to Fig. 13 and Fig. 14. Fig. 13 is a diagram showing another example of an input screen. Fig. 14 is a diagram showing another example of an input screen. It goes without saying that the input screens SC2 and SC3 shown in Fig. 13 are merely examples and are not limited to these.

The input screen SC2 includes a center point "A" and eight input keys corresponding to the numeric input keys "1" to "8". The processor 11 sets the threshold θ2 of the angular blur of the approximated line to 45° or less in gaze input using the input screen SC2. In such a case, the processor 11 accepts the input operation of the input key "1" when the threshold θ2 of the angular blur of the approximated line is 45° or less and the angle of the approximated line is -22.5° or more and less than +22.5° with the position of the input key "1" as the reference (0 (zero)°).

The input screen SC3 includes a center point "A" and ten input keys corresponding to the numeric input keys "0" to "9." In gaze input using the input screen SC3, the processor 11 sets the threshold θ3 of the angular deviation of the approximated line to 36° or less. In such a case, the processor 11 accepts the input operation of the input key "0" when the threshold θ3 of the angular deviation of the approximated line is 36° or less and the angle of the approximated line is -18° or more and less than +18° with the position of the input key "0" as the reference (0 (zero)°).

As described above, the gaze input device P1 (an example of an input device) according to the first embodiment is capable of receiving an input operation based on the gaze position of a user, and includes a calibration screen SC0 (an example of an input screen) capable of receiving an input operation, a display 14 (an example of a display unit) displaying the input screen SC1, a camera 13 that captures an image of a user, a processor 11 (an example of a calculation unit) that calculates a correction parameter for calibrating the gaze position of the first user relative to the input screen SC1 based on the gaze position of the first user shown in the captured first captured image, and a processor 11 that detects the gaze position of the second user shown in the captured second captured image after calculating the correction parameter, calibrates the gaze position of the second user using the correction parameter, and receives an input operation relative to the input screen SC1 based on the calibrated gaze position of the second user.

As a result, the eye-gaze input device P1 according to embodiment 1 can more efficiently calibrate the eye-gaze position and more efficiently accept input operations based on the user's eye gaze, even when correction parameters for each user are not recorded and stored in advance and the device is used by an unspecified number of users.

Furthermore, the eye-gaze input device P1 according to the first embodiment includes an area ARA (an example of a first input section) of a center point "A" that accepts a first input operation, and areas AR1 to AR4 (an example of a second input section) corresponding to a plurality of input keys "1" to "4" that accept a second input operation different from the first input operation. This allows the eye-gaze input device P1 according to the first embodiment to accept an input operation for calibration and an input operation for input information on a single input screen SC1.

The first captured image in the gaze input device P1 according to embodiment 1 is an image captured of a user looking at area ARA of center point "A", and the processor 11 calculates correction parameters based on the gaze position of the first user and the position of area ARA of center point "A". As a result, the gaze input device P1 according to embodiment 1 can more efficiently accept input operations based on the user's gaze, even when correction parameters for each user are not recorded and stored in advance and the device is used by an unspecified number of users.

The second captured image in the eye-gaze input device P1 according to the first embodiment is an image captured of a user looking at the areas AR1 to AR4 corresponding to any one of the input keys "1" to "4". The processor 11 accepts an input operation based on the eye gaze position of the second user and the positions of the areas AR1 to AR4 corresponding to the multiple input keys "1" to "4". As a result, the eye-gaze input device P1 according to the first embodiment can more effectively suppress erroneous input of input information by continuously accepting input operations of a predetermined number of digits of input information (here, numbers) after accepting an input operation of the center point "A" in the second eye-gaze input operation procedure. Furthermore, since the eye-gaze input device P1 does not alternately accept an input operation for the center point "A" and an input operation for the input keys "1" to "4" (i.e., input information), the time required for the input operation for the center point "A" between each piece of input information can be shortened, and the eye-gaze input operation time per user can be more efficiently shortened.

The second captured image in the eye-gaze input device P1 according to the first embodiment is an image captured of a user looking at the area ARA of the center point "A" or the areas AR1 to AR4 corresponding to any one of the input keys "1" to "4". The processor 11 alternately accepts the first input operation and the second input operation. As a result, the eye-gaze input device P1 according to the first embodiment can more effectively suppress erroneous input of input information by alternately repeating the acceptance of the input operation of the input information (here, a number) and the acceptance of the input operation of the center point "A" in the first eye-gaze input operation procedure. Furthermore, by alternately accepting the input operation for the center point "A" and the input operation for the input keys "1" to "4" (i.e., input information), even when the user continuously inputs the same input information (for example, when the number "1" is input two or more times in succession), the eye-gaze input device P1 can more accurately accept the input operation of the same continuous input information because it accepts the input of a different center point "A" while inputting the same input information.

The processor 11 in the eye-gaze input device P1 according to embodiment 1 activates the area ARA of the center point "A" on the input screen SC1, disables the areas AR1 to AR4 corresponding to the multiple input keys "1" to "4", and accepts a first input operation, and activates the areas AR1 to AR4 corresponding to the multiple input keys "1" to "4" on the input screen SC1, disables the area ARA of the center point "A", and accepts a second input operation. As a result, the eye-gaze input device P1 according to embodiment 1 can more effectively suppress erroneous inputs by only activating (accepting) the input operation of either the center point "A" or the input keys "1" to "4".

Furthermore, when the processor 11 in the eye-gaze input device P1 according to embodiment 1 accepts a first input operation or a second input operation, it highlights the area ARA of the center point "A" that corresponds to the accepted first input operation or second input operation, or the areas AR1 to AR4 that correspond to the input keys "1" to "4". This allows the eye-gaze input device P1 according to embodiment 1 to notify the user that acceptance of the input operation has been completed.

In addition, the area ARA of the center point "A" in the eye-gaze input device P1 according to embodiment 1 is located approximately in the center of the input screen SC1. Areas AR1 to AR4 corresponding to the multiple input keys "1" to "4" are each located at approximately the same distance from the area ARA of the center point "A". This allows the eye-gaze input device P1 according to embodiment 1 to more clearly distinguish between the first input operation and the second input operation, and to more accurately accept the second input operation.

In addition, the areas AR1 to AR4 corresponding to the multiple input keys "1" to "4" in the eye-gaze input device P1 according to embodiment 1 are arranged at approximately equal intervals on a circumference centered on the area ARA at the center point "A". This allows the eye-gaze input device P1 according to embodiment 1 to more accurately receive the second input operation.

In addition, the input screen SC1 of the eye-gaze input device P1 according to embodiment 1 has an insensitive area ARN around the area ARA of the center point "A" that disables input operations. When the processor 11 determines that the gaze position of the second user is within the insensitive area ARN, it omits the acceptance of input operations based on the gaze position of the second user. As a result, the eye-gaze input device P1 according to embodiment 1 uses the gaze position detected at a position that is outside the insensitive area ARN and is at least a predetermined distance (radius R1) away from the center point "A", which is outside the insensitive area ARN, to estimate the input content, thereby being able to eliminate gaze positions that are prone to erroneous determination of the input content, and therefore being able to more effectively suppress erroneous determination of the input content.

The processor 11 in the gaze input device P1 according to the first embodiment detects the gaze position of the second user from a plurality of second captured images captured by the camera 13, accumulates them in chronological order, calculates the movement direction of the user's gaze position based on the accumulated gaze positions of the second user, and accepts an input operation based on the movement direction of the user's gaze position. As a result, the gaze input device P1 according to the first embodiment can accept an input operation based on the movement direction of the user's gaze position even when the user's gaze position is not detected within the areas ARA to AR4 of each input key, or when the user's gaze position detected in the areas ARA, AR1 to AR4 of each input key does not satisfy a predetermined condition for accepting an input operation (for example, the user's gaze position is detected within the areas ARA, AR1 to AR4 continuously for a predetermined time or more), etc.

The processor 11 in the eye-gaze input device P1 according to the first embodiment calculates and accumulates the moving direction of the user's eye gaze position based on the accumulated gaze position of the second user for a predetermined time, calculates the amount of blur in the moving direction of the user's eye gaze position based on the accumulated multiple moving directions, and accepts an input operation based on the moving direction when it is determined that the calculated amount of blur is equal to or less than a threshold. As a result, the eye-gaze input device P1 according to the first embodiment can estimate the input content that the user is about to input based on the moving direction of the user's eye gaze position and accept it as an input operation before the user gazes at the input key, so that the time required for input information per user can be more efficiently shortened. Furthermore, the eye-gaze input device P1 can more accurately estimate the moving direction of the user's eye gaze position based on the amount of angular blur of the user's eye gaze position, so that erroneous input of input information can be more effectively suppressed even when the calibration accuracy is low.

As described above, in the first embodiment, the input method performed by the gaze input device P1 (an example of an input device) capable of accepting an input operation based on a user's gaze position includes displaying a calibration screen SC0 (an example of an input screen) capable of accepting an input operation, an input screen SC1, acquiring a first captured image of a user, calculating a correction parameter for calibrating the gaze position of the first user relative to the input screen based on the gaze position of the first user shown in the first captured image, acquiring a second captured image of the user, calibrating the gaze position of the second user shown in the second captured image using the correction parameter, and accepting an input operation relative to the input screen based on the calibrated gaze position of the second user.

Although various embodiments have been described above with reference to the drawings, it goes without saying that the present disclosure is not limited to such examples. It is clear that a person skilled in the art could conceive of various modifications, amendments, substitutions, additions, deletions, and equivalents within the scope of the claims, and it is understood that these also naturally fall within the technical scope of the present disclosure. Furthermore, the components in the various embodiments described above may be combined in any manner as long as it does not deviate from the spirit of the invention.

This application is based on a Japanese patent application (Patent Application No. 2022-181386) filed on November 11, 2022, the contents of which are incorporated by reference into this application.

The present disclosure is useful as an input device and input method that makes calibration of eye gaze input more efficient.

11 Processor 12 Memory 13 Camera 14 Display DB1 Database P1 Eye-gaze input device SC0 Calibration screen SC1, SC2, SC3, SC41, SC42, SC43, SC44, SC45, SC46, SC47, SC48, SC51, SC52, SC53, SC54, SC55 Input screen

Claims

An input device capable of receiving an input operation based on a user's gaze position,
a display unit that displays an input screen capable of accepting the input operation;
A camera for capturing an image of the user;
a calculation unit that calculates a correction parameter for calibrating a gaze position of the first user with respect to the input screen based on a gaze position of the first user shown in a captured first captured image;
a processor that detects a gaze position of a second user appearing in a second captured image captured after the calculation of the correction parameter, calibrates the gaze position of the second user using the correction parameter, and accepts the input operation on the input screen based on the calibrated gaze position of the second user.
Input device.
The input screen is
a first input unit that accepts a first input operation, and a plurality of second input units that accept a second input operation different from the first input operation,
The input device according to claim 1 .
The first captured image is an image captured by capturing the user looking at the first input unit,
the calculation unit calculates the correction parameter based on a gaze position of the first user and a position of the first input unit.
The input device according to claim 2 .
The second captured image is an image captured by capturing the user looking at any one of the second input units,
The input device according to claim 2 , wherein the processor accepts the input operation based on a gaze position of the second user and positions of the plurality of second input units.
The second captured image is an image captured by capturing the user looking at the first input unit or any one of the second input units,
The processor,
The first input operation and the second input operation are alternately received.
The input device according to claim 2 .
The processor,
enabling the first input unit on the input screen and disabling the second input units to receive the first input operation;
enabling the plurality of second input units on the input screen and disabling the first input unit to accept the second input operation;
5. The input device according to claim 4.
The processor,
When the first input operation or the second input operation is received, the first input unit or the second input unit corresponding to the received first input operation or the second input operation is highlighted.
The input device according to claim 2 .
the first input unit is disposed at approximately the center of the input screen,
The second input units are disposed at approximately the same distance from the first input unit.
The input device according to claim 2 .
The plurality of second input units are arranged at substantially equal intervals on a circumference centered on the first input unit.
9. The input device according to claim 8.
the input screen has an insensitive area around the first input unit for invalidating the input operation,
The processor,
When it is determined that the gaze position of the second user is within the insensitive area, acceptance of the input operation based on the gaze position of the second user is omitted.
The input device according to claim 2 .
The processor,
Detecting a gaze position of the second user from each of the plurality of second captured images captured by the camera, and storing the detected gaze positions in a time series;
Calculating a moving direction of the gaze position of the second user based on the accumulated gaze position of the second user;
accepting the input operation based on a moving direction of a gaze position of the user;
The input device according to claim 1 .
The processor,
Calculating and storing a moving direction of the gaze position of the second user based on the accumulated gaze position of the second user for a predetermined time period;
Calculating an amount of blur in the movement direction of the gaze position of the user based on the accumulated plurality of movement directions;
when it is determined that the calculated amount of blur is equal to or smaller than a threshold, the input operation based on the moving direction is accepted.
12. The input device according to claim 11.
An input method performed by an input device capable of accepting an input operation based on a user's gaze position, comprising:
displaying an input screen capable of accepting the input operation;
acquiring a first captured image of the user;
calculating a correction parameter for calibrating a gaze position of the first user with respect to the input screen based on a gaze position of the first user captured in the first captured image;
acquiring a second captured image of the user;
calibrating a gaze position of a second user appearing in the second captured image using the correction parameters;
accepting the input operation on the input screen based on the calibrated gaze position of the second user;
input method.
An input device capable of receiving an input operation based on a user's gaze position,
a processor that detects a gaze position of a first user shown in a first captured image captured by a camera and a gaze position of a second user shown in a second captured image captured by the camera, and accepts an input operation on an input screen capable of accepting the input operation based on the gaze position of the first user and the gaze position of the second user.
Input device.
The input screen is
a first input unit that accepts a first input operation, and a plurality of second input units that accept a second input operation different from the first input operation;
15. The input device of claim 14.