WO2014101219A1 - Action recognition method and television - Google Patents

Action recognition method and television Download PDF

Info

Publication number
WO2014101219A1
WO2014101219A1 PCT/CN2012/088111 CN2012088111W WO2014101219A1 WO 2014101219 A1 WO2014101219 A1 WO 2014101219A1 CN 2012088111 W CN2012088111 W CN 2012088111W WO 2014101219 A1 WO2014101219 A1 WO 2014101219A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
user
shield
time
moment
Prior art date
Application number
PCT/CN2012/088111
Other languages
French (fr)
Chinese (zh)
Inventor
葛中峰
刘丽丽
刘卫东
Original Assignee
青岛海信信芯科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 青岛海信信芯科技有限公司 filed Critical 青岛海信信芯科技有限公司
Priority to PCT/CN2012/088111 priority Critical patent/WO2014101219A1/en
Publication of WO2014101219A1 publication Critical patent/WO2014101219A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition

Definitions

  • the present application belongs to the field of pattern recognition, and specifically relates to a motion recognition method and a television set. Background technique
  • a somatosensory game is a new type of video game that manipulates the game through changes in physical movements. It breaks through the traditional way of inputting with the handle button, so that gamers can immerse themselves in the game as they please. use
  • the human body motion can also be recognized by using the 2D camera.
  • the character image data, the size and the predetermined display position of the preset initial image are required as reference data, and the currently acquired character image is used. The adjustment is made, and the image of the person part corresponding to the reference data is cut out and displayed at a predetermined position.
  • the embodiment of the invention provides a method for motion recognition, which is used to solve the technical problem that the program operation complexity is high, the calculation amount is large, and the recognition process is complicated in the prior art, and the algorithm and the smaller program are realized through the single program.
  • a motion recognition method is applied to an electronic device having a video playback function including a camera, the method comprising:
  • a second image including the first area of the first user is obtained by the camera;
  • a television set includes a camera, and the television set includes:
  • An image obtaining module configured to obtain, by the camera, a first region first image including a first user at a first moment; and at a second moment after the first moment, obtain, by the camera, the first image a second image of the first area of the user;
  • a first obtaining module configured to obtain, according to the first image, a first shield coordinate position of the first user at the first moment; and obtain, according to the second image, the first user a second shield center coordinate position of the second moment; an identification module, configured to determine, according to the first shield center coordinate position and the second shield center coordinate position, that the first user is at the first moment and The operation action between the second moments.
  • the first image of the first user is obtained at the first moment, and after the first moment Obtaining a second image of the first user at two times, and obtaining a first shield center coordinate position and a second shield center coordinate position respectively based on the first image and the second image, based at least on the first shield center coordinate
  • the position and the second shield center coordinate position are used to identify an operation action of the first user between the first time and the second time, which solves the problem of high computational complexity and computational complexity in the prior art.
  • the technical effect of recognizing the action by the algorithm of the single block and the small amount of program operation for example, the prior art requires the character data of the initial image to be preset. , size and predetermined display position as reference data, identify the currently acquired task image and obtain character feature data information, and then reference number According to the proportional adjustment, the image of the part of the person corresponding to the reference data is cut out and displayed at a predetermined position, and the present invention only needs to determine the position of the shield core in the captured image of the person, and uses the coordinate of the shield core
  • the difference operation based on the position change of the shield core, can identify the action of the character, the algorithm is simple, and the program operation amount is small;
  • FIG. 1 is a flow chart of a motion recognition method according to an embodiment of the present invention.
  • FIG. 2(a) is a schematic diagram of a foreground image before removing a shadow image according to an embodiment of the present invention
  • 2(b) is a schematic diagram of a foreground image after removing a shadow image according to an embodiment of the present invention
  • FIG. 3(a)-3(k) are schematic diagrams of various actions in the base motion model library according to an embodiment of the present invention.
  • FIG. 4 is a structural diagram of a television set according to an embodiment of the present invention. detailed description
  • the embodiment of the invention provides a method for motion recognition, which is used to solve the technical problem that the program operation complexity is high, the calculation amount is large, and the recognition process is complicated in the prior art, and the algorithm and the smaller program are realized through the single program. Operation The technical effect of the amount of recognition of the action.
  • An embodiment of the present invention provides a motion recognition method, which is applied to an electronic device having a video playback function, including a camera, wherein the electronic device may be an existing high-end television, and the high-end television has a 2D camera. Moreover, the television itself has a certain data processing capability. After obtaining the image containing the user through the camera, the data processing capability can be used to analyze the image, thereby identifying the action performed by the user.
  • the method includes:
  • Step 101 When the first user is not in the first area, obtain a first background image of the first area by using the camera.
  • step 102 After collecting the first background image, performing step 102: obtaining, at the first moment, a first region first image including the first user by the camera; and at a second moment after the first moment, passing the The camera obtains a second image comprising the first region of the first user.
  • step 103 After completing step 102, performing step 103: obtaining, according to the first image, a first shield center coordinate position of the first user at the first moment; obtaining the first user based on the second image The second shield center coordinate position at the second moment.
  • the first background image including only the background area is collected by the camera, and then, at the first moment after the user enters the image collection area, and At a second time after the first time, the first image and the second image of the user and the background area are collected.
  • the white balance method uses total reflection theory, assuming that the brightest point on the image is white point, and this white point is used as a reference object to automatically white balance the image.
  • a color component value of each pixel in the color-corrected image can be obtained, and a color restored image can be obtained. among them,
  • RGB is the three color component values of the brightest point on the original image, respectively, R, G, and B are the three color component values (usually 255 or slightly smaller) after the white balance of the brightest point, respectively, R B , G B , ⁇ respectively
  • the three color component values of the respective pixels on the original image, R A , G A are the three color component values after the white balance of each pixel point.
  • the obtaining, by the first image, the first shield center coordinate position of the first user at the first moment based on the first image specifically includes:
  • the first processed image includes a foreground image composed of pixels having a first color value and a second background image composed of pixels of a second color value having different first color values;
  • the first color restored image wb_Pre_rgb and the second color restored image wb_Bg_rgb, b Bg i are respectively obtained.
  • )>T i ⁇ r, g, b use these two images, and the formula two Fg- b/"- - ,
  • the obtaining, according to the first processed image, the first shield center coordinate position of the first user at the first moment specifically includes: Determining whether there is a shadow image in the foreground image;
  • the shadow image exists in the foreground image, the shadow image is removed to obtain a shadow-destroy first processed image;
  • FIG. 2( a ) there may be a shadow image caused by the shadow of the user in the obtained foreground image, as shown in FIG. 2( a ), wherein the shadow image is an image with a small number of white dots in the continuous region. And has a separation from the user's image. If the shadow image exists in the foreground image, the shadow image is removed to obtain a first image to be shaded, as shown in FIG. 2(b).
  • the obtaining, according to the first image of the shading, the first shield center position of the first user at the first moment specifically includes:
  • a first coordinate system composed of an X axis and a Y axis is established in the first processed image, and the right axis of the X axis is specified in the embodiment of the present application.
  • the side is the positive direction and the upper side of the Y axis is the positive direction. Based on the coordinate system, the horizontal and vertical coordinates of each pixel in the first processed image can be obtained.
  • ⁇ and ⁇ are the horizontal and vertical coordinate values of the respective pixel points in the foreground image in the first processed image, and ⁇ is the number of total pixel points in the foreground image.
  • the shield core coordinate of the foreground image of each frame can be obtained by the above-mentioned shield obtaining method. ,].
  • step 104 is performed: determining, according to the first shield center coordinate position and the second shield core coordinate position, that the first user is between the first moment and the second moment Operational actions.
  • step 104 specifically includes:
  • the operation operation between the first time and the second time is a left movement.
  • the operation operation seen at the first time and the second time is a squat operation.
  • the method further includes:
  • N is an integer greater than or equal to 4.
  • the camera will acquire 15 ⁇ 25 frames per second, of which the standard standard camera sampling rate is 25 frames/second, and the industrial camera usage rate is up to 60 frames/second, or 200 frames/second. Even higher, but the PAL file on the TV is 25 frames per second, and the NTSC file is 30 frames per second.
  • the initialization is first performed, that is, the coordinate position of the shield image of the foreground image in the first frame image is simultaneously assigned to the three parameters set in advance, and the reference frame coordinate reference_frame (r_x 0 , r_y 0) ), the previous frame ⁇ shield coordinates previous-frame ( p_x 0 , p_yo ), the current frame shield coordinates current_frame ( c_x 0 , c_y 0 ).
  • the three preset parameters will change according to the user's action.
  • the difference between the current frame shield abscissa value and the previous frame shield abscissa value dx C p c_xo-p_x 0 .
  • determining that the operation action of the first user between the first time and the second time is a right movement Specifically, including:
  • Determining, when the first difference is less than the second threshold, the first user at the first moment and the The operation action at the second time is a left movement and specifically includes:
  • the first shield center coordinate of the first moment is assigned to the reference shield center coordinate
  • the first shield center coordinate position and the second shield core coordinate position are located, and if dx CT > T lr , the first is determined.
  • the user's operation action has a right shift tendency. If dx CT ⁇ -T lr , it is determined that the first user's operation action has a left shift tendency.
  • the operation action of the first user has a right shifting tendency, it is determined whether there is a right shift end flag, and when there is a right shift end flag, it indicates that the first user's operation action is a right shift.
  • the application only provides two implementations of the right shift end flag, and those skilled in the art may also use other methods as the flag for the right shift end.
  • Manner 2 When the operation action has a right shift trend, that is, dx CT > T lr , when the sign of ⁇ is always a positive number, it is determined whether the value of
  • the shield center coordinate of the current frame is given to the reference frame shield center coordinate, and the operation action between the next time points is determined.
  • the determination process of the operation movement specifically shifting to the left is exactly the opposite of the judgment process of the right movement.
  • the judgment process of the left movement according to the right movement can be obtained by a person of ordinary skill in the art, and is not Let me repeat.
  • determining that the operation action of the first user between the first time and the second time is a jumping action Specifically include:
  • the operation action seen at the second moment is a squat action, which specifically includes:
  • the operation action of the first user When the operation action of the first user has a jumping trend, it is determined whether there is a jump end flag, and when there is a jump end flag, it indicates that the operation action of the first user is a jump action, wherein the jump end flag is specifically determining that dy cp is Whether the sign in the first half of the time is positive and the sign in the second half of the time is negative in a certain period of time, finally, dy cr ⁇ Tj, if the above situation exists, it indicates that there is a jump end flag, thereby determining that the first user is in the first The operation between the time and the second time is a jump action.
  • the judgment process of the operation action specifically as the squat action is exactly the opposite of the judgment process of the hop action, and the judgment process of the squat action can be obtained by a person of ordinary skill in the art according to the judgment process of the jump action, Let me repeat.
  • the method further includes:
  • a second area of the first body part image is obtained at a second time.
  • a body part that the user does not change during exercise is selected as the first body part, for example, a face.
  • determining, according to the first shield center coordinate position and the second shield core coordinate position, determining an operation action of the first user between the first time and the second time Specifically:
  • the change of the position of the shield core and the area of the image of the body part can also determine other user actions.
  • the method further includes:
  • the action model library includes various basic models corresponding to actions such as lower jaw, jump, left shift, right shift, and no action.
  • the basis for establishing the model is: J. H. Yoo et al. established a human body line graph model based on human anatomy knowledge. Assuming that the total height of the human body is ⁇ , the relative lengths of various parts of the human body can be obtained, as shown in Table 1 below:
  • the coordinates of the 17 joint points of the human body can be obtained, and then the real skeleton model of the user during the action can be obtained, and the skeleton model can be displayed on the television.
  • the data of the skeleton model can be transmitted to the upper application for game development, so that the skeleton model can be used to control the characters on the game screen to perform the somatosensory game, so that the player has a feeling of immersing in the game.
  • the ordinate value y t of the highest point in the foreground image can also be obtained.
  • p the ordinate value of the lowest point y b . Tt . m , the leftmost abscissa value x left and the rightmost abscissa value x nght .
  • an embodiment of the present invention provides a television set including a camera. Referring to FIG. 4, the television includes:
  • a background obtaining submodule configured to obtain, by the camera, a first background image of the first area when the first user is not in the first area;
  • An image obtaining module 401 configured to obtain, by using the camera, a first region first image including a first user at a first moment; and at a second moment after the first moment, obtaining, by the camera, the first image a second image of the first area of a user;
  • a first obtaining module 402 configured to obtain, according to the first image, a first shield coordinate position of the first user at the first moment; and obtain, according to the second image, the first user The second shield center coordinate position at the second moment.
  • the first background image including only the background area is collected by the camera, and then, at the first moment after the user enters the image collection area, and At a second time after the first time, the first image and the second image of the user and the background area are collected.
  • the white balance method uses total reflection theory, assuming that the brightest point on the image is white point, and this white point is used as a reference object to automatically white balance the image.
  • Max J max max A color component value of each pixel in the color-corrected image can be obtained, and a color restored image can be obtained. among them,
  • RGB is the three color component values of the brightest point on the original image, respectively, R, G, and B are the three color component values (usually 255 or slightly smaller) after the white balance of the brightest point, respectively, R B , G B , ⁇ respectively
  • the three color component values of the respective pixels on the original image, R A , G A are the three color component values after the white balance of each pixel point.
  • the first obtaining module specifically includes:
  • the background obtaining a submodule
  • a color atomic module configured to perform color correction processing on the first image, obtain a first color restored image, and perform color correction processing on the first background image to obtain a second color restored image;
  • An image processing submodule configured to obtain a first processed image based on the first color restored image and the second color restored image, wherein the first processed image includes a pixel composed of a first color value a foreground image and a second background image composed of pixel points having a second color value different from the first color value;
  • the first color restored image wb_Pre_rgb and the second color restored image wb_Bg_rgb, b Bg i are respectively obtained.
  • the first obtaining module further includes:
  • a determining submodule configured to determine whether a shadow image exists in the foreground image
  • De-shadowing image obtaining sub-module configured to remove the shadow image when the shadow image exists in the foreground image, to obtain a shading first processed image
  • FIG. 2( a ) there may be a shadow image caused by the shadow of the user in the obtained foreground image, as shown in FIG. 2( a ), wherein the shadow image is an image with a small number of white dots in the continuous region. And has a separation from the user's image. If the shadow image exists in the foreground image, the shadow image is removed to obtain The shadow first processed image, as shown in Figure 2 (b).
  • the first obtaining module further includes:
  • a second obtaining submodule configured to obtain, according to an abscissa value and an ordinate value of each pixel point, a first abscissa value and a first first coordinate value of the first shield of the first user at the first moment The ordinate value, and the first shield center coordinate position is obtained.
  • a first coordinate system composed of an X axis and a Y axis is established in the first processed image, and the right axis of the X axis is specified in the embodiment of the present application.
  • the side is the positive direction and the upper side of the Y axis is the positive direction. Based on the coordinate system, the horizontal and vertical coordinates of each pixel in the first processed image can be obtained.
  • ⁇ and ⁇ are the horizontal and vertical coordinate values of the respective pixel points in the foreground image in the first processed image, and ⁇ is the number of total pixel points in the foreground image.
  • the shield core coordinate of the foreground image of each frame can be obtained by the above-mentioned shield obtaining method. ,].
  • the television further includes:
  • the identification module 403 is configured to determine, according to the first shield center coordinate position and the second shield core coordinate position, an operation action of the first user between the first time and the second time.
  • the identifying module 403 specifically includes:
  • a first difference obtaining submodule configured to obtain a first difference by subtracting the first abscissa value of the first shield from the second abscissa value of the second shield;
  • a first determining submodule configured to determine whether the first difference is greater than a first threshold
  • a first determining submodule configured to determine, when the first difference is greater than the first threshold, that the operation action of the first user between the first time and the second time is a right movement
  • a second determining sub-module configured to determine, when the first difference is not greater than the first threshold, whether the first difference is less than a second threshold
  • a second determining submodule configured to determine, when the first difference is smaller than the second threshold, that the operation action of the first user between the first time and the second time is a left movement
  • a second difference obtaining submodule for subtracting the first shield from the second ordinate value of the second shield The first ordinate value, obtaining a second difference
  • a third determining submodule configured to determine whether the second difference is greater than a third threshold
  • a third determining submodule configured to determine, when the second difference is greater than the third threshold, that the operation action of the first user between the first time and the second time is a jumping action
  • a fourth determining sub-module configured to determine, when the second difference is not greater than the third threshold, whether the second difference is less than a fourth threshold
  • a fourth determining submodule configured to determine, when the second difference is smaller than the fourth threshold, that the operation action that the first user sees at the first time and the second time is a squat action.
  • the television further includes:
  • a second obtaining module configured to sequentially take i from 3 to N, and obtain an i-th abscissa value and an i-th ordinate value of the i-th shield of the first user after the second moment after the second moment, N Is an integer greater than or equal to 4.
  • the camera will acquire 15 ⁇ 25 frames per second, of which the standard standard camera sampling rate is 25 frames/second, and the industrial camera usage rate is up to 60 frames/second, or 200 frames/second. Even higher, but the PAL file on the TV is 25 frames per second, and the NTSC file is 30 frames per second.
  • the initialization is first performed, that is, the coordinate position of the shield image of the foreground image in the first frame image is simultaneously assigned to the three parameters set in advance, and the reference frame coordinate reference_frame (r_x 0 , r_y 0) ), the previous frame ⁇ shield coordinates previous-frame ( p_x 0 , p_yo ), the current frame shield coordinates current_frame ( c_x 0 , c_y 0 ).
  • the three preset parameters will change according to the user's action.
  • the difference between the current frame shield abscissa value and the previous frame shield abscissa value dx C p c_xo-p_x 0 .
  • the first determining submodule specifically includes:
  • a first determining unit configured to determine, when the first difference is greater than the first threshold, that the operation action of the first user between the first time and the second time has a right shifting trend
  • a first determining unit configured to determine, according to the second abscissa value and the at least one i-th abscissa value, whether the right action end flag exists in the operation action;
  • a second determining unit configured to: when the right shift end flag is present, determine that the operation action is a right move; or the second determining submodule, specifically:
  • a third determining unit configured to determine, when the first difference is smaller than the second threshold, that the operation action of the first user between the first time and the second time has a left shifting tendency
  • a second determining unit configured to determine, according to the second abscissa value and the at least one of the i-th abscissa values, whether the left-shift end flag exists in the operation action;
  • a fourth determining unit configured to determine that the operation action is a left movement when the left shift end flag is present.
  • the first shield center coordinate of the first moment is assigned to the reference shield center coordinate
  • the first shield center coordinate position and the second shield core coordinate position are located, and if dx CT > T lr , the first is determined.
  • the user's operation action has a right shift tendency. If dx CT ⁇ -T lr , it is determined that the first user's operation action has a left shift tendency.
  • the operation action of the first user has a right shifting tendency, it is determined whether there is a right shift end flag, and when there is a right shift end flag, it indicates that the first user's operation action is a right shift.
  • the application only provides two implementations of the right shift end flag, and those skilled in the art may also use other methods as the flag for the right shift end.
  • Manner 2 When the operation action has a right shift trend, that is, dx CT > T lr , when the sign of ⁇ is always a positive number, it is determined whether the value of
  • the shield center coordinates of the current frame are assigned to the reference frame shield center coordinates, and the determination of the operation action between the next time intervals is performed.
  • the determination process of the operation movement specifically shifting to the left is exactly the opposite of the judgment process of the right movement.
  • the judgment process of the left movement according to the right movement can be obtained by a person of ordinary skill in the art, and is not Let me repeat.
  • the third determining submodule specifically includes:
  • a fifth determining unit configured to determine, when the second difference is greater than the third threshold, that the operation action of the first user between the first time and the second time has a jumping trend
  • a third determining unit configured to determine, according to the second ordinate value and the at least one of the ith ordinate values, whether the operation action is a skip end flag
  • a sixth determining unit configured to: when the skip end flag is present, determine that the operation action is a skip action; or the fourth determining submodule, specifically:
  • a seventh determining unit configured to determine, when the second difference is smaller than the fourth threshold, that the operation action of the first user between the first time and the second time has a downward trend
  • a fourth determining unit configured to determine, according to the second ordinate value and the at least one of the ith ordinate values, whether the operation action is a squat end flag
  • an eighth determining unit configured to determine that the operation action is a squatting action when the squat end flag is present.
  • the first shield center coordinate position of the first moment is given to the reference shield core coordinate position, according to the first shield center coordinate position and the second shield center coordinate position, if dy c , r > Tj, Determining that the first user's action has Jumping trend, if dy c , r ⁇ -T s , it is determined that the first user's operation action has a downward trend.
  • the operation action of the first user When the operation action of the first user has a jumping trend, it is determined whether there is a jump end flag, and when there is a jump end flag, it indicates that the operation action of the first user is a jump action, wherein the jump end flag is specifically determining that dy cp is Whether the sign in the first half of the time is positive and the sign in the second half of the time is negative in a certain period of time, finally, dy cr ⁇ Tj, if the above situation exists, it indicates that there is a jump end flag, thereby determining that the first user is in the first The operation between the time and the second time is a jump action.
  • the judgment process of the operation action specifically as the squat action is exactly the opposite of the judgment process of the hop action, and the judgment process of the squat action can be obtained by a person of ordinary skill in the art according to the judgment process of the jump action, Let me repeat.
  • the television further includes:
  • An area obtaining module configured to obtain, according to the first image, a first area of the first body part image of the first body part of the first user at a first moment; based on the second image, obtained in the first At the second moment, the second area of the first body part image.
  • a body part that the user does not change during exercise is selected as the first body part, for example, a face.
  • the identifying module is specifically configured to:
  • determining, according to the first shield center coordinate position and the second shield center coordinate position, that the operation action of the first user between the first time and the second time is a jump action determining Whether the first area is larger than the second area, if the first area is larger than the second area, indicating that the first user is away from the camera, thereby determining that the operation action of the first user is a backward jumping action, If the first area is smaller than the second area, it indicates that the first user is close to the camera, thereby determining that the operation action of the first user is a forward jumping action.
  • the change of the position of the shield core and the area of the image of the body part can also determine other user actions.
  • the television further includes:
  • a matching module configured to match the operation action with a standard action model in the action model library to obtain a first matching result
  • a display module configured to display, when the first matching result indicates that the matching is successful, display the corresponding operation
  • the standard action model configured to display, when the first matching result indicates that the matching is successful, display the corresponding operation
  • the action model library includes various basic models corresponding to actions such as squatting, jumping, left shifting, right shifting, and no motion.
  • the basis for establishing the model is: J ' H .
  • Yoo et al. established a human body line graph model based on human anatomy knowledge. Assuming that the total height of the human body is H, the relative lengths of various parts of the human body can be obtained, as shown in Table 1 below:
  • the coordinates of the 17 joint points of the human body can be obtained, and then the real skeleton model of the user during the action can be obtained, and the skeleton model can be displayed on the television.
  • the data of the skeleton model can be transmitted to the upper application for game development, so that the skeleton model can be used to control the characters on the game screen to perform the somatosensory game, so that the player has a feeling of immersing in the game.
  • the ordinate value y t of the highest point in the foreground image can also be obtained.
  • p the ordinate value of the lowest point y b . Tt . m , the leftmost abscissa value x left and the rightmost abscissa value x nght .
  • the action is determined to be standing with one hand, and when the action is determined to stand with one hand, if the peak is Xtop-xo ⁇ O, stand for the left hand, if Xtop-xo ⁇ O, stand for the right hand; if Hi-H ⁇ T h , and
  • the first image of the first user is obtained at the first time
  • the second image of the first user is obtained at the second time after the first time
  • the operation operation between the moment and the second moment solves the technical problem that the recognition process is complicated due to the high complexity of the program operation and the large amount of calculation in the prior art, and the algorithm and the smaller method are realized.
  • the technical effect of the program operation amount to recognize the action for example, the prior art needs to use the preset character image data, size and predetermined display position of the initial image as reference data to identify the currently acquired task image and acquire the character feature.
  • the present invention only needs to determine the position of the shield core in the captured character image, and uses the difference calculation of the shield core coordinate, and the movement of the character can be recognized based on the position change of the shield core, the algorithm cylinder Single, the program operation is small;
  • embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the present invention can be embodied in the form of a computer program product embodied on one or more computer-usable storage interfaces (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer usable program code.
  • computer-usable storage interfaces including but not limited to disk storage, CD-ROM, optical storage, etc.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
  • the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
  • the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Psychiatry (AREA)
  • Social Psychology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

Disclosed are an action recognition method and a television, which are applied in an electronic device comprising a camera and having a video playing function. The method comprises: obtaining a first image of a first area comprising a first user at a first moment via the camera; obtaining a second image of the first area comprising the first user at a second moment after the first moment via the camera; on the basis of the first image, obtaining a first barycentric coordinate location of the first user at the first moment; on the basis of the second image, obtaining a second barycentric coordinate location of the first user at the second moment; and at least on the basis of the first barycentric coordinate location and the second barycentric coordinate location, recognizing and determining an operation action of the first user between the first moment and the second moment.

Description

-种动作识别方法及电视机 技术领域  -Action recognition method and television technology field
本申请属于模式识别领域, 具体涉及一种动作识别方法及电视机。 背景技术  The present application belongs to the field of pattern recognition, and specifically relates to a motion recognition method and a television set. Background technique
体感游戏, 是一种通过肢体动作变化来对游戏进行操控的新型电子游戏。 它突破了以 往单纯以手柄按键输入的操作方式, 使得游戏玩家可以随心所欲的沉浸于游戏中。 利用 A somatosensory game is a new type of video game that manipulates the game through changes in physical movements. It breaks through the traditional way of inputting with the handle button, so that gamers can immerse themselves in the game as they please. use
Wii 和 PS Move平台玩体感游戏时, 由于仍然需要佩戴一些辅助设备来完成人体动作的感 应, 所以给玩家带来了操作上的不便。 而利用具有周边外设 Kinect的 Xbox平台玩体感游 戏时, 则不需要使用任何控制器。 但 Kinect需要利用深度摄像机和彩色摄影机相结合来进 行人体动作的识别。 When the Wii and PS Move platforms play a somatosensory game, the player is still inconvenienced because he still needs to wear some auxiliary equipment to complete the body movement. When playing a somatosensory game with the Xbox platform with the peripheral peripheral Kinect, you don't need to use any controller. However, Kinect needs to use a combination of a depth camera and a color camera to recognize human motion.
现有技术中, 利用 2D摄像头也可以对人体动作进行识别, 在一种方法中, 需要以预 先设定的初始图像的人物特征数据、 大小及预定显示位置为参考数据, 对当前获取的人物 图像进行调整, 并截取出与参考数据相符合的人物部分图像, 显示在预定的位置。  In the prior art, the human body motion can also be recognized by using the 2D camera. In one method, the character image data, the size and the predetermined display position of the preset initial image are required as reference data, and the currently acquired character image is used. The adjustment is made, and the image of the person part corresponding to the reference data is cut out and displayed at a predetermined position.
本申请发明人在实现本申请实施例技术方案的过程中, 至少发现现有技术中存在如下 技术问题:  In the process of implementing the technical solutions of the embodiments of the present application, at least the following technical problems exist in the prior art:
在现有技术的方法中, 需要提取人物特征数据, 而由于提取特征算法复杂度较高、 运 算量大, 存在识别过程复杂的技术问题。 发明内容  In the prior art method, character feature data needs to be extracted, and since the extraction feature algorithm has high complexity and large operation amount, there is a technical problem that the recognition process is complicated. Summary of the invention
本发明实施例提供一种动作识别的方法, 用于解决现有技术中由于程序运算复杂度 高, 运算量大, 存在识别过程复杂的技术问题, 实现了通过筒单的算法和较小的程序运算 量对动作进行识别的技术效果。  The embodiment of the invention provides a method for motion recognition, which is used to solve the technical problem that the program operation complexity is high, the calculation amount is large, and the recognition process is complicated in the prior art, and the algorithm and the smaller program are realized through the single program. The technical effect of the amount of calculation to identify the action.
—种动作识别方法, 应用于包括一摄像头的具有视频播放功能的电子设备中, 所述方 法包括:  A motion recognition method is applied to an electronic device having a video playback function including a camera, the method comprising:
在第一时刻, 通过所述摄像头获得包括第一用户的第一区域第一图像;  At a first moment, obtaining, by the camera, a first image of the first region including the first user;
在所述第一时刻之后的第二时刻, 通过所述摄像头获得包括所述第一用户的所述第一 区域的第二图像;  At a second time after the first time, a second image including the first area of the first user is obtained by the camera;
基于所述第一图像, 获得所述第一用户在所述第一时刻的第一盾心坐标位置; 基于所 述第二图像, 获得所述第一用户在所述第二时刻的第二盾心坐标位置;  Obtaining, according to the first image, a first shield center coordinate position of the first user at the first moment; and obtaining, according to the second image, a second shield of the first user at the second moment Heart coordinate position;
至少基于所述第一盾心坐标位置及所述第二盾心坐标位置, 识别确定所述第一用户在 所述第一时刻及所述第二时刻间的操作动作。 一种电视机, 包括一摄像头, 所述电视机包括: And determining, according to the first shield center coordinate position and the second shield core coordinate position, an operation action of the first user between the first time and the second time. A television set includes a camera, and the television set includes:
图像获得模块, 用于在第一时刻, 通过所述摄像头获得包括第一用户的第一区域第一 图像; 在所述第一时刻之后的第二时刻, 通过所述摄像头获得包括所述第一用户的所述第 一区域的第二图像;  An image obtaining module, configured to obtain, by the camera, a first region first image including a first user at a first moment; and at a second moment after the first moment, obtain, by the camera, the first image a second image of the first area of the user;
第一获得模块, 用于基于所述第一图像, 获得所述第一用户在所述第一时刻的第一盾 心坐标位置;基于所述第二图像,获得所述第一用户在所述第二时刻的第二盾心坐标位置; 识别模块, 用于至少基于所述第一盾心坐标位置及所述第二盾心坐标位置, 识别确定 所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  a first obtaining module, configured to obtain, according to the first image, a first shield coordinate position of the first user at the first moment; and obtain, according to the second image, the first user a second shield center coordinate position of the second moment; an identification module, configured to determine, according to the first shield center coordinate position and the second shield center coordinate position, that the first user is at the first moment and The operation action between the second moments.
本申请实施例中提供的一个或多个技术方案, 至少具有如下技术效果或优点: 1、 本发明实施例中通过在第一时刻获得第一用户的第一图像, 在第一时刻之后的第 二时刻获得第一用户的第二图像, 并基于所述第一图像和所述第二图像, 分别获得第一盾 心坐标位置和第二盾心坐标位置, 至少基于所述第一盾心坐标位置和所述第二盾心坐标位 置, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作, 解决了现有技 术中由于程序运算复杂度高、 运算量大, 存在的识别过程复杂的技术问题, 实现了通过筒 单的算法和较小的程序运算量对动作进行识别的技术效果, 例如, 现有技术需要以预先设 定的初始图像的人物特征数据、 大小及预定显示位置为参考数据, 对当前获取的任务图像 进行识别并获取人物特征数据信息, 再与参考数据通过比例调整, 截取出与参考数据相符 合的人物部分图像, 显示在预定位置, 而本发明仅需要确定出捕捉到的人物图像中盾心的 位置, 并利用对盾心坐标进行筒单的差值运算, 基于盾心的位置变化, 即可识别出人物的 动作, 算法筒单, 程序运算量小;  One or more technical solutions provided in the embodiments of the present application have at least the following technical effects or advantages: 1. In the embodiment of the present invention, the first image of the first user is obtained at the first moment, and after the first moment Obtaining a second image of the first user at two times, and obtaining a first shield center coordinate position and a second shield center coordinate position respectively based on the first image and the second image, based at least on the first shield center coordinate The position and the second shield center coordinate position are used to identify an operation action of the first user between the first time and the second time, which solves the problem of high computational complexity and computational complexity in the prior art. Large, existing technical problems of the recognition process, the technical effect of recognizing the action by the algorithm of the single block and the small amount of program operation, for example, the prior art requires the character data of the initial image to be preset. , size and predetermined display position as reference data, identify the currently acquired task image and obtain character feature data information, and then reference number According to the proportional adjustment, the image of the part of the person corresponding to the reference data is cut out and displayed at a predetermined position, and the present invention only needs to determine the position of the shield core in the captured image of the person, and uses the coordinate of the shield core The difference operation, based on the position change of the shield core, can identify the action of the character, the algorithm is simple, and the program operation amount is small;
2、 由于仅通过现有技术中高端电视机具有的数据处理能力及配备的 2D摄像头, 就可 以对动作进行识别, 不仅不会增加硬件的成本, 还丰富了电视的功能。 附图说明  2. Because only the data processing capability and the equipped 2D camera of the high-end TV in the prior art can identify the action, it not only increases the cost of the hardware, but also enriches the function of the TV. DRAWINGS
图 1为本发明一实施例中动作识别方法的流程图 ;  1 is a flow chart of a motion recognition method according to an embodiment of the present invention;
图 2 ( a ) 为本发明一实施例中去除阴影图像之前的前景图像的示意图;  2(a) is a schematic diagram of a foreground image before removing a shadow image according to an embodiment of the present invention;
图 2 ( b ) 为本发明一实施例中去除阴影图像之后的前景图像的示意图;  2(b) is a schematic diagram of a foreground image after removing a shadow image according to an embodiment of the present invention;
图 3 ( a ) -图 3 ( k ) 为本发明一实施例中基础动作模型库中各个动作的示意图; 图 4为本发明一实施例中电视机的结构图。 具体实施方式  3(a)-3(k) are schematic diagrams of various actions in the base motion model library according to an embodiment of the present invention; and FIG. 4 is a structural diagram of a television set according to an embodiment of the present invention. detailed description
本发明实施例提供一种动作识别的方法, 用于解决现有技术中由于程序运算复杂度 高, 运算量大, 存在识别过程复杂的技术问题, 实现了通过筒单的算法和较小的程序运算 量对动作进行识别的技术效果。 The embodiment of the invention provides a method for motion recognition, which is used to solve the technical problem that the program operation complexity is high, the calculation amount is large, and the recognition process is complicated in the prior art, and the algorithm and the smaller program are realized through the single program. Operation The technical effect of the amount of recognition of the action.
本发明实施例中的技术方案为解决上述问题, 总体思路如下:  The technical solution in the embodiment of the present invention is to solve the above problems, and the general idea is as follows:
通过在第一时刻, 通过所述摄像头获得包括第一用户的第一区域第一图像; 在所述第 一时刻之后的第二时刻, 通过所述摄像头获得包括所述第一用户的所述第一区域的第二图 像; 基于所述第一图像, 获得所述第一用户在所述第一时刻的第一盾心坐标位置; 基于所 述第二图像, 获得所述第一用户在所述第二时刻的第二盾心坐标位置; 至少基于所述第一 盾心坐标位置及所述第二盾心坐标位置, 识别确定所述第一用户在所述第一时刻及所述第 二时刻间的操作动作, 解决了现有技术中由于程序运算复杂度高, 运算量大, 存在识别过 程复杂的技术问题, 实现了通过筒单的算法和较小的程序运算量对动作进行识别的技术 效果。  Obtaining, by the camera, a first region first image including a first user at a first moment; obtaining, by the camera, the first user including the first user at a second moment after the first moment a second image of an area; obtaining, based on the first image, a first shield center coordinate position of the first user at the first moment; and obtaining, according to the second image, the first user a second shield center coordinate position at a second time; determining, based at least on the first shield center coordinate position and the second shield center coordinate position, that the first user is at the first time and the second time The operation action between the two sides solves the technical problem that the complexity of the program operation is large, the calculation amount is large, and the recognition process is complicated in the prior art, and the technology for recognizing the action through the algorithm of the single tube and the small amount of program operation is realized. effect.
为了更好的理解上述技术方案, 下面将结合说明书附图以及具体的实施方式对上述技 术方案进行详细的说明。  In order to better understand the above technical solutions, the above technical solutions will be described in detail below in conjunction with the drawings and specific embodiments.
本申请一实施例提供一种动作识别方法, 应用于包括一摄像头的具有视频播放功能的 电子设备中, 其中, 所述电子设备可以为现有的高端电视, 所述高端电视具有一 2D摄像 头,且电视本身具备有一定的数据处理能力,在通过所述摄像头获得包含用户的图像之后, 利用数据处理能力可以对图像进行分析 , 进而识别出用户所进行的动作。  An embodiment of the present invention provides a motion recognition method, which is applied to an electronic device having a video playback function, including a camera, wherein the electronic device may be an existing high-end television, and the high-end television has a 2D camera. Moreover, the television itself has a certain data processing capability. After obtaining the image containing the user through the camera, the data processing capability can be used to analyze the image, thereby identifying the action performed by the user.
参见图 1 , 所述方法包括:  Referring to FIG. 1, the method includes:
步骤 101 : 在所述第一用户没有在所述第一区域中时, 通过所述摄像头获得所述第一 区域的第一背景图像。  Step 101: When the first user is not in the first area, obtain a first background image of the first area by using the camera.
在釆集到第一背景图像之后, 执行步骤 102: 在第一时刻, 通过所述摄像头获得包括 第一用户的第一区域第一图像; 在所述第一时刻之后的第二时刻, 通过所述摄像头获得包 括所述第一用户的所述第一区域的第二图像。  After collecting the first background image, performing step 102: obtaining, at the first moment, a first region first image including the first user by the camera; and at a second moment after the first moment, passing the The camera obtains a second image comprising the first region of the first user.
在完成步骤 102之后, 执行步骤 103 : 基于所述第一图像, 获得所述第一用户在所述 第一时刻的第一盾心坐标位置; 基于所述第二图像, 获得所述第一用户在所述第二时刻的 第二盾心坐标位置。  After completing step 102, performing step 103: obtaining, according to the first image, a first shield center coordinate position of the first user at the first moment; obtaining the first user based on the second image The second shield center coordinate position at the second moment.
在具体实施过程中, 首先, 在用户没有进入到图像釆集区域时, 通过摄像头釆集仅包 含背景区域的第一背景图像, 接着, 在用户进入到图像釆集区域之后的第一时刻, 以及第 一时刻之后的第二时刻, 釆集包含用户和背景区域的第一图像及第二图像。 其中, 在通过 摄像头釆集得到任意图像之后, 都需要对图像进行用于色彩还原的预处理, 去除因光照变 化引起的光照不均而对图像真实颜色产生的影响, 从而还原出图像的真实颜色。 通常釆用 白平衡法进行颜色的校正从而将色彩还原。 由于白光中的 RGB分量相同 ( R=G=B=255 ) , 首先将白光进行校正, 进而其他颜色的光也随着进行校正, 同理, 由于灰色光中的 RGB 分量也相同, 也可以釆用校正灰色光来达到白平衡的目的。 具体的, 白平衡法釆用全反射 理论,假设图像上最亮点就是白点, 并将此白点作为参考对象, 以对图像进行自动白平衡。 在实际工程应用中, 定义最亮点为图像上 R+G+B的值为最大值的点, 接着, 根据所述最 亮点的三个色彩分量值,并利用公式一: RA + RB , GA = GB ' BA =^~BB ,In a specific implementation process, first, when the user does not enter the image collection area, the first background image including only the background area is collected by the camera, and then, at the first moment after the user enters the image collection area, and At a second time after the first time, the first image and the second image of the user and the background area are collected. After obtaining an arbitrary image through the camera, it is necessary to perform pre-processing for color restoration on the image to remove the influence of uneven illumination caused by the illumination on the true color of the image, thereby restoring the true color of the image. . Usually, the white balance method is used to correct the color to restore the color. Since the RGB components in white light are the same (R=G=B=255), the white light is first corrected, and the light of other colors is also corrected. Similarly, due to RGB in gray light. The components are also the same, and the gray light can be corrected to achieve white balance. Specifically, the white balance method uses total reflection theory, assuming that the brightest point on the image is white point, and this white point is used as a reference object to automatically white balance the image. In practical engineering applications, the brightest point is defined as the point at which the value of R+G+B on the image is the maximum value. Then, based on the three color component values of the brightest point, and using Equation 1: R A + R B , G A = G B ' B A =^~B B ,
J  J
可以获得色彩校正后的图像中各像素点的色彩分量值,进而可以获得色彩还原图像。其中,A color component value of each pixel in the color-corrected image can be obtained, and a color restored image can be obtained. among them,
R G B 分别为原图像上最亮点的三个色彩分量值, R 、 G 、 B 分别为 最亮点白平衡后的三个色彩分量值(通常为 255或略小), RB、 GBβ分别为原图像 上各个像素点的三个色彩分量值, RA、 GA、 分别为各个像素点白平衡后的三个色彩 分量值。 在本申请实施例中, 步骤 103中的所述基于所述第一图像, 获得所述第一用户在所述 第一时刻的第一盾心坐标位置, 具体包括: RGB is the three color component values of the brightest point on the original image, respectively, R, G, and B are the three color component values (usually 255 or slightly smaller) after the white balance of the brightest point, respectively, R B , G B , β respectively The three color component values of the respective pixels on the original image, R A , G A , are the three color component values after the white balance of each pixel point. In the embodiment of the present application, the obtaining, by the first image, the first shield center coordinate position of the first user at the first moment based on the first image, specifically includes:
对所述第一图像进行色彩校正处理, 获得第一色彩还原图像, 及对所述第一背景图像 进行色彩校正处理, 获得第二色彩还原图像;  Performing color correction processing on the first image, obtaining a first color restored image, and performing color correction processing on the first background image to obtain a second color restored image;
基于所述第一色彩还原图像和所述第二色彩还原图像, 获得第一处理图像, 其中, 所 述第一处理图像包括由具有第一色彩值的像素点组成的前景图像和由具有与所述第一色 彩值不同的第二色彩值的像素点组成的第二背景图像;  Obtaining a first processed image based on the first color restored image and the second color restored image, wherein the first processed image includes a foreground image composed of pixels having a first color value and a second background image composed of pixels of a second color value having different first color values;
基于所述第一处理图像, 获得所述第一用户在所述第一时刻的第一盾心坐标位置。 在具体实施过程中, 釆用上述提到的色彩校正预处理过程,对第一图像和第一背景图 像进行处理之后 ,分别得到第一色彩还原图像 wb_Pre_rgb和第二色彩还原图像 wb_Bg_rgb , b Bg i)>T i二 r, g, b 利用这两幅图像,及公式二 Fg— b/"—
Figure imgf000005_0001
- ,
And obtaining, according to the first processed image, a first shield center coordinate position of the first user at the first moment. In a specific implementation process, after processing the first image and the first background image by using the color correction preprocessing process mentioned above, the first color restored image wb_Pre_rgb and the second color restored image wb_Bg_rgb, b Bg i are respectively obtained. )>T i 二r, g, b use these two images, and the formula two Fg- b/"-
Figure imgf000005_0001
- ,
0 else  0 else
可以获得经过二值化处理之后的第一处理图像, 其中, i=r、 g、 b分别代表 r、 g、 b的三个 通道。 公式二表明只要针对一个通道的第一色彩还原图像与第二色彩还原图像的差值大于 阈值 T , 则此部分为前景图像, 并将其设置为白色, 其余部分为第二背景图像, 并将其设 置为黑色, 从而得到仅包含两种色彩值的第一处理图像。 在本申请实施过程中, 获得第二 图像和其他任意图像之后, 均需要对图像进行上述二值化处理过程, 从而区分出图像中的 前景部分和背景部分。 进一步, 所述基于所述第一处理图像, 获得所述第一用户在所述第一时刻的第一盾心 坐标位置, 具体包括: 判断所述前景图像中是否存在阴影图像; A first processed image after binarization processing can be obtained, where i=r, g, b represent three channels of r, g, b, respectively. Equation 2 shows that as long as the difference between the first color restored image and the second color restored image for one channel is greater than the threshold T, then this portion is the foreground image, and is set to white, and the rest is the second background image, and It is set to black, resulting in a first processed image containing only two color values. In the implementation process of the present application, after obtaining the second image and other arbitrary images, it is necessary to perform the above-mentioned binarization processing on the image, thereby distinguishing the foreground portion and the background portion in the image. Further, the obtaining, according to the first processed image, the first shield center coordinate position of the first user at the first moment, specifically includes: Determining whether there is a shadow image in the foreground image;
在所述前景图像中存在所述阴影图像时, 去除所述阴影图像, 获得去阴影第一处理图 像;  When the shadow image exists in the foreground image, the shadow image is removed to obtain a shadow-destroy first processed image;
基于所述去阴影第一处理图像,获得所述第一用户在所述第一时刻的第一盾心坐标位 置。  And obtaining, according to the de-shadowed first processed image, a first shield coordinate position of the first user at the first moment.
在具体实施过程中,得到的前景图像中可能存在由用户的影子而造成的阴影图像, 如 图 2 ( a ) 图所示, 其中, 阴影图像为连续区域内包含白点个数较少的图像, 且与用户的图 像具有一个分离。 若前景图像中存在所述阴影图像, 则将所述阴影图像进行去除, 获得去 阴影第一处理图像, 如图 2 ( b ) 图所示。  In the specific implementation process, there may be a shadow image caused by the shadow of the user in the obtained foreground image, as shown in FIG. 2( a ), wherein the shadow image is an image with a small number of white dots in the continuous region. And has a separation from the user's image. If the shadow image exists in the foreground image, the shadow image is removed to obtain a first image to be shaded, as shown in FIG. 2(b).
进一步, 所述基于所述去阴影第一处理图像, 获得所述第一用户在所述第一时刻的第 一盾心坐标位置, 具体包括:  Further, the obtaining, according to the first image of the shading, the first shield center position of the first user at the first moment, specifically includes:
基于所述去阴影第一处理图像中由 X轴和 Y轴构成的第一坐标体系, 获得组成所述 前景图像的每个像素点的横坐标值和纵坐标值;  And obtaining an abscissa value and an ordinate value of each pixel point constituting the foreground image based on the first coordinate system composed of the X axis and the Y axis in the first processed image of the shading;
基于所述每个像素点的横坐标值和纵坐标值,获得所述第一用户在所述第一时刻的第 一盾心的第一横坐标值和第一纵坐标值, 进而获得所述第一盾心坐标位置。  Obtaining, according to the abscissa value and the ordinate value of each pixel point, a first abscissa value and a first ordinate value of the first shield core of the first user at the first moment, thereby obtaining the The first shield center coordinate position.
在具体实施过程中,在获得去阴影的第一处理图像之后,在所述第一处理图像中建立 由 X轴和 Y轴构成的第一坐标体系, 在本申请实施例中规定 X轴的右侧为正方向、 Y轴 的上侧为正方向。 基于所述坐标体系, 可以获得第一处理图像中每个像素点的横、 纵坐标  In a specific implementation process, after obtaining the first processed image that is not shaded, a first coordinate system composed of an X axis and a Y axis is established in the first processed image, and the right axis of the X axis is specified in the embodiment of the present application. The side is the positive direction and the upper side of the Y axis is the positive direction. Based on the coordinate system, the horizontal and vertical coordinates of each pixel in the first processed image can be obtained.
N N 值。 再, 结合公式三 _ η = ^ ^ , = ^ ^ , 可以获得第一盾心的第一横坐标值为 _ η和第 一纵坐标值 y。, 其中, ; ^和 3 ^为第一处理图像中的前景图像中各个像素点的横、 纵坐标 值, Ν为所述前景图像中总的像素点的个数。 在本申请实施过程中, 通过上述盾心获得方 法可以获得各巾贞前景图像的盾心坐标〔 。, 〕。 在完成步骤 103之后, 执行步骤 104: 至少基于所述第一盾心坐标位置及所述第二盾 心坐标位置, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作。 NN value. Furthermore, the first abscissa value of the first shield core is and the first ordinate value y can be obtained by combining the formula _ η = ^ ^ , = ^ ^ . Where ^ and ^^ are the horizontal and vertical coordinate values of the respective pixel points in the foreground image in the first processed image, and Ν is the number of total pixel points in the foreground image. In the implementation process of the present application, the shield core coordinate of the foreground image of each frame can be obtained by the above-mentioned shield obtaining method. ,]. After the step 103 is completed, step 104 is performed: determining, according to the first shield center coordinate position and the second shield core coordinate position, that the first user is between the first moment and the second moment Operational actions.
在本申请实施过程中, 步骤 104具体包括:  In the implementation of the present application, step 104 specifically includes:
通过将所述第二盾心的所述第二横坐标值减去所述第一盾心的所述第一横坐标值,获 得第一差值;  Obtaining a first difference by subtracting the first abscissa value of the first shield from the second abscissa value of the second shield;
判断所述第一差值是否大于第一阈值;  Determining whether the first difference is greater than a first threshold;
在所述第一差值大于所述第一阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为右移动作; Determining, when the first difference is greater than the first threshold, the first user at the first moment and the second The operation action at the moment is the right movement;
在所述第一差值不大于所述第一阈值时, 判断所述第一差值是否小于第二阈值; 在所述第一差值小于所述第二阈值时, 确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为左移动作。  Determining whether the first difference is less than a second threshold when the first difference is not greater than the first threshold; determining the first user when the first difference is less than the second threshold The operation operation between the first time and the second time is a left movement.
通过将所述第二盾心的第二纵坐标值减去所述第一盾心的所述第一纵坐标值,获得第 二差值;  Obtaining a second difference by subtracting the first ordinate value of the first shield from the second ordinate value of the second shield;
判断所述第二差值是否大于第三阈值;  Determining whether the second difference is greater than a third threshold;
在所述第二差值大于所述第三阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为跳跃动作;  When the second difference is greater than the third threshold, determining that the operation action of the first user between the first time and the second time is a jumping action;
在所述第二差值不大于所述第三阈值时, 判断所述第二差值是否小于第四阈值; 在所述第二差值小于所述第四阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻见的操作动作为下蹲动作。  Determining whether the second difference is less than a fourth threshold when the second difference is not greater than the third threshold; determining the first user when the second difference is less than the fourth threshold The operation operation seen at the first time and the second time is a squat operation.
在本申请实施例中, 在步骤 103之后, 所述方法还包括:  In the embodiment of the present application, after the step 103, the method further includes:
依次取 i从 3到 N, 获得在所述第二时刻之后第 i时刻所述第一用户的第 i盾心的第 i 横坐标值和第 i纵坐标值, N为大于等于 4的整数。  Taking i from 3 to N in turn, the i-th abscissa value and the i-th ordinate value of the i-th shield of the first user at the i-th moment after the second moment are obtained, and N is an integer greater than or equal to 4.
在具体实施过程中, 摄像机每秒会获取 15~25帧图像, 其中, 通用标准的摄像机釆样 率为 25帧 /秒, 工业摄像机釆用率可达 60帧 /秒, 或 200帧 /每秒, 甚至更高, 但电视播放 PAL制式的文件为 25帧 /秒, 播放 NTSC制式的文件为 30帧 /秒。 其中, 在获得第一帧图 像时, 首先进行初始化, 即将第一帧图像中前景图像的盾心坐标位置同时赋给预先设置的 三个参数, 参考巾贞盾心坐标 reference_frame(r_x0 , r_y0)、 前一巾贞盾心坐标 previous—frame ( p_x0, p_yo )、 当前帧盾心坐标 current_frame ( c_x0, c_y0 )。 在不同时刻, 所述预先设置 的三个参数会根据用户的动作情况进行变化。 另外, 根据所述三个参数, 可以获得当前帧 质心纵坐标值与参考盾心纵坐标值的差值 dyCT=c_yQ-r_yQ , 当前帧盾心横坐标值与参考盾心 横坐标值的差值 dxCT=c_xQ-r_xQ , 当前帧盾心纵坐标值与前一帧盾心纵坐标值的差值 dyCp=c_yo-p_y。及当前帧盾心横坐标值与前一帧盾心横坐标值的差值 dxCp=c_xo-p_x0In the specific implementation process, the camera will acquire 15~25 frames per second, of which the standard standard camera sampling rate is 25 frames/second, and the industrial camera usage rate is up to 60 frames/second, or 200 frames/second. Even higher, but the PAL file on the TV is 25 frames per second, and the NTSC file is 30 frames per second. Wherein, when obtaining the image of the first frame, the initialization is first performed, that is, the coordinate position of the shield image of the foreground image in the first frame image is simultaneously assigned to the three parameters set in advance, and the reference frame coordinate reference_frame (r_x 0 , r_y 0) ), the previous frame 贞 shield coordinates previous-frame ( p_x 0 , p_yo ), the current frame shield coordinates current_frame ( c_x 0 , c_y 0 ). At different times, the three preset parameters will change according to the user's action. In addition, according to the three parameters, the difference between the current frame centroid ordinate value and the reference shield ordinate value dy CT = c_y Q - r_y Q , the current frame shield abscissa value and the reference shield abscissa value can be obtained. The difference dx CT = c_x Q - r_x Q , the difference between the current frame shield ordinate value and the previous frame shield ordinate value dy C p = c_yo-p_y. And the difference between the current frame shield abscissa value and the previous frame shield abscissa value dx C p=c_xo-p_x 0 .
在本申请实施例中, 所述在所述第一差值大于所述第一阈值时, 确定所述第一用户在 所述第一时刻及所述第二时刻间的操作动作为右移动作, 具体包括:  In the embodiment of the present application, when the first difference is greater than the first threshold, determining that the operation action of the first user between the first time and the second time is a right movement Specifically, including:
在所述第一差值大于所述第一阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作具有右移趋势;  When the first difference is greater than the first threshold, determining that the operation action of the first user between the first time and the second time has a right shifting tendency;
基于所述第二横坐标值和至少一个所述第 i横坐标值, 判断所述操作动作是否存在右 移结束标志;  Determining, according to the second abscissa value and the at least one of the i-th abscissa values, whether the operation action has a right shift end flag;
在存在所述右移结束标志时, 确定所述操作动作为右移动作; 或  When the right shift end flag is present, determining that the operation action is a right movement; or
所述在所述第一差值小于所述第二阈值时,确定所述第一用户在所述第一时刻及所述 第二时刻间的操作动作为左移动作, 具体包括: Determining, when the first difference is less than the second threshold, the first user at the first moment and the The operation action at the second time is a left movement, and specifically includes:
在所述第一差值小于所述第二阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作具有左移趋势;  When the first difference is smaller than the second threshold, determining that the operation action of the first user between the first time and the second time has a left shifting tendency;
基于所述第二横坐标值和至少一个所述第 i横坐标值, 判断所述操作动作是否存在左 移结束标志;  Determining, according to the second abscissa value and the at least one of the i-th abscissa values, whether the operation action has a left-shift end flag;
在存在所述左移结束标志时, 确定所述操作动作为左移动作。  When the left shift end flag is present, it is determined that the operation action is a left shift.
在具体实施过程中, 当第一时刻的第一盾心坐标赋予到参考盾心坐标时, 居第一盾 心坐标位置和第二盾心坐标位置,若 dxCT > Tlr时,确定第一用户的操作动作具有右移趋势, 若 dxCT < -Tlr时, 确定第一用户的操作动作具有左移趋势。 当第一用户的操作动作具有右 移趋势时, 判断是否存在右移结束标志, 当存在右移结束标志时, 表明第一用户的操作动 作为右移动作。 其中, 本申请仅提供两种右移结束标志的实现方式, 本领域普通技术人员 还可以釆用其他方式作为右移结束的标志。 In the specific implementation process, when the first shield center coordinate of the first moment is assigned to the reference shield center coordinate, the first shield center coordinate position and the second shield core coordinate position are located, and if dx CT > T lr , the first is determined. The user's operation action has a right shift tendency. If dx CT < -T lr , it is determined that the first user's operation action has a left shift tendency. When the operation action of the first user has a right shifting tendency, it is determined whether there is a right shift end flag, and when there is a right shift end flag, it indicates that the first user's operation action is a right shift. The application only provides two implementations of the right shift end flag, and those skilled in the art may also use other methods as the flag for the right shift end.
方式一: 在操作动作具有右移趋势时, 即 dxcr > Tlr, 当 dxcp的符号由正号变化为负号 时, 判断 |dxcp|是否大于阈值 Τη , 若 |dxcp|没有大于阈值 Tn, 则表明右移并没有结束, dxcp 符号的变化仅是用户左右晃动带来的千扰,若 |dxcp|大于阈值 Tn,则表明存在右移结束标志, 从而确定第一用户在第一时刻和第二时刻间的操作动作为右移动作。 Manner 1: When the operation action has a right shift tendency, that is, dx cr > T lr , when the sign of dx cp changes from a positive sign to a negative sign, it is judged whether |dx cp | is greater than the threshold Τ η if |dx cp | If the threshold is greater than the threshold T n , it indicates that the right shift does not end. The change of the dx cp symbol is only the interference caused by the user's left and right shaking. If |dx cp | is greater than the threshold T n , it indicates that there is a right shift end flag, thereby determining the first The operation action of a user between the first time and the second time is a right movement.
方式二: 在操作动作具有右移趋势时, 即 dxCT > Tlr, 当 ^的符号一直为正号时, 判 断在预设的连续帧数内 |dxcp|的值是否均小于 2 , 例如, 连续 5帧或连续 6帧, 若 |dxcp|均小 于 2时, 则表明存在一段距离中的右移结束标志, 从而确定第一用户在第一时刻和第二时 刻间的操作动作为右移动作。 Manner 2: When the operation action has a right shift trend, that is, dx CT > T lr , when the sign of ^ is always a positive number, it is determined whether the value of |dx cp | is less than 2 in the preset number of consecutive frames, for example 5 consecutive frames or 6 consecutive frames. If |dx cp | is less than 2, it indicates that there is a right shift end flag in a distance, thereby determining that the first user operates right between the first time and the second time. Move.
另外, 在确定第一时刻和第二时刻间的操作动作为右移动作之后 , 将当前帧的盾心坐 标赋予参考帧盾心坐标, 并进行下一段时刻间的操作动作确定。  In addition, after determining that the operation action between the first time and the second time is the right movement, the shield center coordinate of the current frame is given to the reference frame shield center coordinate, and the operation action between the next time points is determined.
在具体实施过程中, 对操作动作具体为左移的判断过程与右移的判断过程正好相反, 本领域普通技术人员根据右移动作的判断过程即可以得到左移动作的判断过程, 在此不再 赘述。  In the specific implementation process, the determination process of the operation movement specifically shifting to the left is exactly the opposite of the judgment process of the right movement. The judgment process of the left movement according to the right movement can be obtained by a person of ordinary skill in the art, and is not Let me repeat.
在本申请实施例中, 所述在所述第二差值大于所述第三阈值时, 确定所述第一用户在 所述第一时刻及所述第二时刻间的操作动作为跳跃动作, 具体包括:  In the embodiment of the present application, when the second difference is greater than the third threshold, determining that the operation action of the first user between the first time and the second time is a jumping action, Specifically include:
在所述第二差值大于所述第三阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作具有跳跃趋势;  When the second difference is greater than the third threshold, determining that the operation action of the first user between the first time and the second time has a jumping trend;
基于所述第二纵坐标值和至少一个所述第 i纵坐标值, 判断所述操作动作是否是存在 跳跃结束标志;  Determining, based on the second ordinate value and the at least one of the i-th ordinate values, whether the operation action is a presence skip end flag;
在存在所述跳跃结束标志时, 确定所述操作动作为跳跃动作; 或  When the skip end flag is present, determining that the operation action is a jump action; or
所述在所述第二差值小于所述第四阈值时,确定所述第一用户在所述第一时刻及所述 第二时刻见的操作动作为下蹲动作, 具体包括: Determining, when the second difference is smaller than the fourth threshold, the first user at the first moment and the The operation action seen at the second moment is a squat action, which specifically includes:
在所述第二差值小于所述第四阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作具有下蹲趋势;  When the second difference is smaller than the fourth threshold, determining that the operation action of the first user between the first time and the second time has a downward trend;
基于所述第二纵坐标值和至少一个所述第 i纵坐标值, 判断所述操作动作是否是存在 下蹲结束标志;  Determining, based on the second ordinate value and the at least one of the i-th ordinate values, whether the operation action is a presence suffix end flag;
在存在所述下蹲结束标志时, 确定所述操作动作为下蹲动作。  When the squat end flag is present, it is determined that the operation action is a squat action.
在具体实施过程中, 当第一时刻的第一盾心坐标位置赋予到参考盾心坐标位置时, 根 据第一盾心坐标位置和第二盾心坐标位置, 若 dyc,r > Tj时, 确定第一用户的操作动作具有 跳跃趋势, 若 dyc,r < -Ts时, 确定第一用户的操作动作具有下蹲趋势。 当第一用户的操作动 作具有跳跃趋势时, 判断是否存在跳跃结束标志, 在存在跳跃结束标志时, 表明第一用户 的操作动作为跳跃动作, 其中, 所述跳跃结束标志具体为判断 dycp在一定时间内是否前半 段时间内的符号为正且后半段时间内的符号为负, 最终, dycr < Tj , 若存在上述情况, 则表 明存在跳跃结束标志, 从而确定第一用户在第一时刻和第二时刻间的操作动作为跳跃动 作。 In the specific implementation process, when the first shield center coordinate position of the first moment is given to the reference shield core coordinate position, according to the first shield center coordinate position and the second shield center coordinate position, if dy c , r > Tj, It is determined that the operation action of the first user has a jumping tendency. If dy c , r < -T s , it is determined that the operation action of the first user has a tendency to squat. When the operation action of the first user has a jumping trend, it is determined whether there is a jump end flag, and when there is a jump end flag, it indicates that the operation action of the first user is a jump action, wherein the jump end flag is specifically determining that dy cp is Whether the sign in the first half of the time is positive and the sign in the second half of the time is negative in a certain period of time, finally, dy cr < Tj, if the above situation exists, it indicates that there is a jump end flag, thereby determining that the first user is in the first The operation between the time and the second time is a jump action.
另外, 在确定第一时刻和第二时刻间的操作动作为跳跃动作之后, 不更换参考坐标。 在具体实施过程中, 对操作动作具体为下蹲动作的判断过程与跳跃动作的判断过程正 好相反, 本领域普通技术人员根据跳跃动作的判断过程即可以得到下蹲动作的判断过程, 在此不再赘述。  In addition, after determining that the operation action between the first time and the second time is a jump action, the reference coordinates are not replaced. In the specific implementation process, the judgment process of the operation action specifically as the squat action is exactly the opposite of the judgment process of the hop action, and the judgment process of the squat action can be obtained by a person of ordinary skill in the art according to the judgment process of the jump action, Let me repeat.
在本申请实施例中, 在步骤 103之后, 所述方法还包括:  In the embodiment of the present application, after the step 103, the method further includes:
基于所述第一图像, 获得在第一时刻, 所述第一用户的第一身体部位的第一身体部位 图像的第一面积;  And obtaining, according to the first image, a first area of the first body part image of the first body part of the first user at the first moment;
基于所述第二图像, 获得在第二时刻, 所述第一身体部位图像的第二面积。  Based on the second image, a second area of the first body part image is obtained at a second time.
在具体实施过程中,选定用户在运动的过程中不会变化的身体部分作为所述第一身体 部分, 例如, 脸部。 在对第一图像和第二图像进行预处理之后, 获得第一处理图像, 基于 所述第一处理图像中的前景图像中像素点的个数, 获得所述第一面积和所述第二面积, 其 中, 图像中的像素点个数越多, 则表明面积越大, 相反, 像素点个数越少, 则表明面积越 小。  In a specific implementation, a body part that the user does not change during exercise is selected as the first body part, for example, a face. After preprocessing the first image and the second image, obtaining a first processed image, and obtaining the first area and the second area based on the number of pixel points in the foreground image in the first processed image The more the number of pixels in the image, the larger the area. Conversely, the smaller the number of pixels, the smaller the area.
在本申请实施例中, 至少基于所述第一盾心坐标位置及所述第二盾心坐标位置, 识别 确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作, 具体为:  In the embodiment of the present application, determining, according to the first shield center coordinate position and the second shield core coordinate position, determining an operation action of the first user between the first time and the second time , Specifically:
基于所述第一盾心坐标位置,所述第二盾心坐标位置,所述第一面积及所述第二面积, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  Determining, by the first shield center coordinate position, the second shield center coordinate position, the first area and the second area, the first user at the first moment and the second moment Operational actions between.
在具体实施过程中, 在基于所述第一盾心坐标位置及所述第二盾心坐标位置, 确定所 述第一用户在第一时刻及第二时刻间的操作动作为跳跃动作之后 , 判断所述第一面积是否 大于所述第二面积, 若所述第一面积大于所述第二面积, 则表明第一用户远离摄像头, 从 而确定所述第一用户的操作动作为向后跳跃动作, 若所述第一面积小于所述第二面积, 则 表明第一用户靠近摄像头, 从而确定所述第一用户的操作动作为向前跳跃动作。 In a specific implementation process, after determining, according to the first shield center coordinate position and the second shield center coordinate position, that the operation action of the first user between the first time and the second time is a jump action, determining Whether the first area is If the first area is larger than the second area, indicating that the first user is away from the camera, thereby determining that the operation action of the first user is a backward jump action, if the first area If the second area is smaller than the second area, it indicates that the first user is close to the camera, thereby determining that the operation action of the first user is a forward jumping action.
由上所述, 由于通过第一身体部分的第一面积和第二面积, 可以判断出用户是否存在 向前或向后方向的动作, 可以更精确的对动作进行识别, 并丰富了识别动作的方法, 可以 识别出更多的动作;  As described above, since the first area and the second area of the first body part are passed, it is possible to determine whether the user has an action in the forward or backward direction, which can more accurately recognize the action and enrich the recognition action. Method, more actions can be identified;
另夕卜,结合盾心坐标位置的变化及身体部位图像的面积还可以确定出用户其他的操作 动作。  In addition, the change of the position of the shield core and the area of the image of the body part can also determine other user actions.
本申请实施例中, 在步骤 104之后, 所述方法还包括:  In the embodiment of the present application, after the step 104, the method further includes:
将所述操作动作与动作模型库中的标准动作模型进行匹配, 获得第一匹配结果; 在所述第一匹配结果表明匹配成功时, 显示与所述操作动作对应的所述标准动作模 型。  Matching the operation action with a standard action model in the action model library to obtain a first match result; and when the first match result indicates that the match is successful, displaying the standard action model corresponding to the action action.
在具体实施过程中, 如图 3 ( a ) ~ ( k )所示, 动作模型库中包括对应下蹲、 跳跃、 左 移、 右移、 无动作等动作的各个基础模型,。 其中建立模型的依据为: J . H . Yoo 等根据 人体解剖学知识建立了人体线图模型,假设人体的总高度为 Η则可以获得人体各部分的相 对长度, 如下表一所示:  In the specific implementation process, as shown in Fig. 3 (a) ~ (k), the action model library includes various basic models corresponding to actions such as lower jaw, jump, left shift, right shift, and no action. The basis for establishing the model is: J. H. Yoo et al. established a human body line graph model based on human anatomy knowledge. Assuming that the total height of the human body is Η, the relative lengths of various parts of the human body can be obtained, as shown in Table 1 below:
Figure imgf000010_0002
Figure imgf000010_0002
Figure imgf000010_0001
Figure imgf000010_0001
又, 基于用户的操作动作以及人体各部分的比例, 可以获得人体的 17 个关节点的坐 标, 进而可以得到用户在动作的过程中真实的骨架模型, 并可以将所述骨架模型显示到电 视中, 同时还可以将所述骨架模型的数据传送给上层应用程序进行游戏开发, 从而可以实 现利用骨架模型来控制游戏画面上的人物进行体感游戏, 使得玩家有种沉浸于游戏的感 觉。  Moreover, based on the user's operation motion and the proportion of each part of the human body, the coordinates of the 17 joint points of the human body can be obtained, and then the real skeleton model of the user during the action can be obtained, and the skeleton model can be displayed on the television. At the same time, the data of the skeleton model can be transmitted to the upper application for game development, so that the skeleton model can be used to control the characters on the game screen to perform the somatosensory game, so that the player has a feeling of immersing in the game.
另夕卜,基于前景图像中的每个像素点的坐标位置,还可以获得前景图像中最高点的纵 坐标值 ytp、 最低点的纵坐标值 ybttm、 最左端的横坐标值 xleft及最右端的横坐标值 xnght。 进而, 在用户为站立动作的情况下, 可以根据^' H = yto -yhottom - 盾心 以及标准 W值和 H值来判断用户处于站立状态下的各种操作动作。例如,若 H'-H
Figure imgf000011_0001
In addition, based on the coordinate position of each pixel in the foreground image, the ordinate value y t of the highest point in the foreground image can also be obtained. p , the ordinate value of the lowest point y b . Tt . m , the leftmost abscissa value x left and the rightmost abscissa value x nght . Further, in the case where the user is in the standing motion, various operation actions of the user in the standing state can be determined based on the ^'H = y to -y hottom - shield core and the standard W value and H value. For example, if H'-H
Figure imgf000011_0001
> Th, 表明用户处于举手动作, 接着, 对在此状态下前景图像中的每个点进行统计, 统计 出对应每个横坐标值的最高的纵坐标值, 从而获得一数组, 若该数组组成的图形呈双峰, 则确定动作为举双手站立, 若该数组组成的图形呈单峰, 则确定动作为举单手站立, 在确 定动作为举单手站立时, 若峰值 Xtop-xo^ O, 则为举左手站立, 若 Xtop-xo^ O, 则为举右手 站立; 若 Hi-H < Th, 且 |Wi-W| > Tw, 则为在水平方向伸手或脚运动, 若 |Xrig t -Χθ |"|xieft "Χθ | < -T 则为伸右手或右脚站立, 若 IXnght'-Xoi-lxieft'-XQ' Ti, 则为伸左手或左脚站立, 其他情 况为伸双手站立。 基于同一构思, 本发明一实施例提供一种电视机, 包括一摄像头, 参见图 4, 所述电 视机包括: > T h , indicating that the user is in the raising hand movement, and then counting each point in the foreground image in this state, and counting the highest ordinate value corresponding to each abscissa value, thereby obtaining an array, if The graph consisting of arrays is bimodal, then the action is determined to stand with both hands. If the graph consisting of the array is a single peak, then the action is determined to be standing with one hand, and when the action is determined to stand with one hand, if the peak is Xtop-xo ^ O, stand for the left hand, if Xtop-xo^ O, stand for the right hand; if Hi-H < T h , and | Wi-W| > T w , then extend the hand or foot in the horizontal direction, If |Xrig t -Χθ |"|xieft "Χθ | < -T, it stands for the right hand or the right foot. If IXnght'-Xoi-lxieft'-XQ' Ti, it stands for the left or left foot. In other cases, Stand with your hands out. Based on the same concept, an embodiment of the present invention provides a television set including a camera. Referring to FIG. 4, the television includes:
背景获得子模块, 用于在所述第一用户没有在所述第一区域中时, 通过所述摄像头获 得所述第一区域的第一背景图像;  a background obtaining submodule, configured to obtain, by the camera, a first background image of the first area when the first user is not in the first area;
图像获得模块 401 , 用于在第一时刻, 通过所述摄像头获得包括第一用户的第一区域 第一图像; 在所述第一时刻之后的第二时刻, 通过所述摄像头获得包括所述第一用户的所 述第一区域的第二图像;  An image obtaining module 401, configured to obtain, by using the camera, a first region first image including a first user at a first moment; and at a second moment after the first moment, obtaining, by the camera, the first image a second image of the first area of a user;
第一获得模块 402, 用于基于所述第一图像, 获得所述第一用户在所述第一时刻的第 一盾心坐标位置; 基于所述第二图像, 获得所述第一用户在所述第二时刻的第二盾心坐标 位置。  a first obtaining module 402, configured to obtain, according to the first image, a first shield coordinate position of the first user at the first moment; and obtain, according to the second image, the first user The second shield center coordinate position at the second moment.
在具体实施过程中, 首先, 在用户没有进入到图像釆集区域时, 通过摄像头釆集仅包 含背景区域的第一背景图像, 接着, 在用户进入到图像釆集区域之后的第一时刻, 以及第 一时刻之后的第二时刻, 釆集包含用户和背景区域的第一图像及第二图像。 其中, 在通过 摄像头釆集得到任意图像之后, 都需要对图像进行用于色彩还原的预处理, 去除因光照变 化引起的光照不均而对图像真实颜色产生的影响, 从而还原出图像的真实颜色。 通常釆用 白平衡法进行颜色的校正从而将色彩还原。 由于白光中的 RGB分量相同 ( R=G=B=255 ) , 首先将白光进行校正, 进而其他颜色的光也随着进行校正, 同理, 由于灰色光中的 RGB 分量也相同, 也可以釆用校正灰色光来达到白平衡的目的。 具体的, 白平衡法釆用全反射 理论,假设图像上最亮点就是白点, 并将此白点作为参考对象, 以对图像进行自动白平衡。 在实际工程应用中, 定义最亮点为图像上 R+G+B的值为最大值的点, 接着, 根据所述最 亮点的三个色彩分量值,并利用公式一: RA + RB , GA = GB ' BA =^~BB ,In a specific implementation process, first, when the user does not enter the image collection area, the first background image including only the background area is collected by the camera, and then, at the first moment after the user enters the image collection area, and At a second time after the first time, the first image and the second image of the user and the background area are collected. After obtaining an arbitrary image through the camera, it is necessary to perform pre-processing for color restoration on the image to remove the influence of uneven illumination caused by the illumination on the true color of the image, thereby restoring the true color of the image. . Usually, the white balance method is used to correct the color to restore the color. Since the RGB components in white light are the same (R=G=B=255), the white light is first corrected, and the light of other colors is also corrected. Similarly, since the RGB components in the gray light are also the same, Use gray light to achieve white balance. Specifically, the white balance method uses total reflection theory, assuming that the brightest point on the image is white point, and this white point is used as a reference object to automatically white balance the image. In practical engineering applications, the brightest point is defined as the point at which the value of R+G+B on the image is the maximum value, and then, according to the most Highlight the three color component values and use Equation 1: R A + R B , G A = G B ' B A =^~B B ,
max J max max 可以获得色彩校正后的图像中各像素点的色彩分量值,进而可以获得色彩还原图像。其中, Max J max max A color component value of each pixel in the color-corrected image can be obtained, and a color restored image can be obtained. among them,
R G B 分别为原图像上最亮点的三个色彩分量值, R 、 G 、 B 分别为 最亮点白平衡后的三个色彩分量值(通常为 255或略小), RB、 GBβ分别为原图像 上各个像素点的三个色彩分量值, RA、 GA、 分别为各个像素点白平衡后的三个色彩 分量值。 在本申请实施例中, 所述第一获得模块, 具体包括: RGB is the three color component values of the brightest point on the original image, respectively, R, G, and B are the three color component values (usually 255 or slightly smaller) after the white balance of the brightest point, respectively, R B , G B , β respectively The three color component values of the respective pixels on the original image, R A , G A , are the three color component values after the white balance of each pixel point. In the embodiment of the present application, the first obtaining module specifically includes:
所述背景获得子模块;  The background obtaining a submodule;
色彩还原子模块, 用于对所述第一图像进行色彩校正处理, 获得第一色彩还原图像, 及对所述第一背景图像进行色彩校正处理, 获得第二色彩还原图像;  a color atomic module, configured to perform color correction processing on the first image, obtain a first color restored image, and perform color correction processing on the first background image to obtain a second color restored image;
图像处理子模块, 用于基于所述第一色彩还原图像和所述第二色彩还原图像, 获得第 一处理图像, 其中, 所述第一处理图像包括由具有第一色彩值的像素点组成的前景图像和 由具有与所述第一色彩值不同的第二色彩值的像素点组成的第二背景图像;  An image processing submodule, configured to obtain a first processed image based on the first color restored image and the second color restored image, wherein the first processed image includes a pixel composed of a first color value a foreground image and a second background image composed of pixel points having a second color value different from the first color value;
在具体实施过程中, 釆用上述提到的色彩校正预处理过程, 对第一图像和第一背景图 像进行处理之后 ,分别得到第一色彩还原图像 wb_Pre_rgb和第二色彩还原图像 wb_Bg_rgb , b Bg i)>T i二 r, g, b 利用这两幅图像,
Figure imgf000012_0001
,
In a specific implementation process, after processing the first image and the first background image by using the color correction preprocessing process mentioned above, the first color restored image wb_Pre_rgb and the second color restored image wb_Bg_rgb, b Bg i are respectively obtained. )>T i 二r, g, b using these two images,
Figure imgf000012_0001
,
0 else  0 else
可以获得经过二值化处理之后的第一处理图像, 其中, i=r、 g、 b分别代表 r、 g、 b的三个 通道。 公式二表明只要针对一个通道的第一色彩还原图像与第二色彩还原图像的差值大于 阈值 T, 则此部分为前景图像, 并将其设置为白色, 其余部分为第二背景图像, 并将其设 置为黑色, 从而得到仅包含两种色彩值的第一处理图像。 在本申请实施过程中, 获得第二 图像和其他任意图像之后, 均需要对图像进行上述二值化处理过程, 从而区分出图像中的 前景部分和背景部分。 在本申请实施例中, 所述第一获得模块还包括: A first processed image after binarization processing can be obtained, where i = r, g, b represent three channels of r, g, b, respectively. Equation 2 shows that as long as the difference between the first color restored image and the second color restored image for one channel is greater than the threshold T, then this portion is the foreground image, and is set to white, and the rest is the second background image, and It is set to black, resulting in a first processed image containing only two color values. In the implementation process of the present application, after obtaining the second image and other arbitrary images, the above-mentioned binarization processing is required for the image, thereby distinguishing the foreground portion and the background portion in the image. In the embodiment of the present application, the first obtaining module further includes:
判断子模块, 用于判断所述前景图像中是否存在阴影图像;  a determining submodule, configured to determine whether a shadow image exists in the foreground image;
去阴影图像获得子模块, 用于在所述前景图像中存在所述阴影图像时, 去除所述阴影 图像, 获得去阴影第一处理图像;  De-shadowing image obtaining sub-module, configured to remove the shadow image when the shadow image exists in the foreground image, to obtain a shading first processed image;
在具体实施过程中, 得到的前景图像中可能存在由用户的影子而造成的阴影图像, 如 图 2 ( a ) 图所示, 其中, 阴影图像为连续区域内包含白点个数较少的图像, 且与用户的图 像具有一个分离。 若前景图像中存在所述阴影图像, 则将所述阴影图像进行去除, 获得去 阴影第一处理图像, 如图 2 ( b ) 图所示。 In the specific implementation process, there may be a shadow image caused by the shadow of the user in the obtained foreground image, as shown in FIG. 2( a ), wherein the shadow image is an image with a small number of white dots in the continuous region. And has a separation from the user's image. If the shadow image exists in the foreground image, the shadow image is removed to obtain The shadow first processed image, as shown in Figure 2 (b).
在本申请实施例中, 所述第一获得模块还包括:  In the embodiment of the present application, the first obtaining module further includes:
创建子模块, 用于在所述第一处理图像中创建由 X轴和 Y轴构成的第一坐标体系; 第一获得子模块, 用于基于所述第一坐标体系, 获得组成所述前景图像的每个像素点 的横坐标值和从坐标值;  Creating a submodule for creating a first coordinate system composed of an X axis and a Y axis in the first processed image; a first obtaining submodule, configured to obtain the foreground image based on the first coordinate system The abscissa value and the slave coordinate value of each pixel point;
第二获得子模块, 用于基于所述每个像素点的横坐标值和纵坐标值, 获得所述第一用 户在所述第一时刻的第一盾心的第一横坐标值和第一纵坐标值, 进而获得所述第一盾心坐 标位置。  a second obtaining submodule, configured to obtain, according to an abscissa value and an ordinate value of each pixel point, a first abscissa value and a first first coordinate value of the first shield of the first user at the first moment The ordinate value, and the first shield center coordinate position is obtained.
在具体实施过程中,在获得去阴影的第一处理图像之后,在所述第一处理图像中建立 由 X轴和 Y轴构成的第一坐标体系, 在本申请实施例中规定 X轴的右侧为正方向、 Y轴 的上侧为正方向。 基于所述坐标体系, 可以获得第一处理图像中每个像素点的横、 纵坐标  In a specific implementation process, after obtaining the first processed image that is not shaded, a first coordinate system composed of an X axis and a Y axis is established in the first processed image, and the right axis of the X axis is specified in the embodiment of the present application. The side is the positive direction and the upper side of the Y axis is the positive direction. Based on the coordinate system, the horizontal and vertical coordinates of each pixel in the first processed image can be obtained.
N N 值。 再, 结合公式三 _ η = ^ ^ , = ^ ^ , 可以获得第一盾心的第一横坐标值为 _ η和第 一纵坐标值 。, 其中, ; ^和 3 ^为第一处理图像中的前景图像中各个像素点的横、 纵坐标 值, Ν为所述前景图像中总的像素点的个数。 在本申请实施过程中, 通过上述盾心获得方 法可以获得各巾贞前景图像的盾心坐标〔 。 , 〕。 在本申请实施例中, 所述电视机还包括: NN value. Furthermore, the first abscissa value of the first shield core can be obtained as and the first ordinate value by combining the formula _ η = ^ ^ , = ^ ^ . Where ^ and ^^ are the horizontal and vertical coordinate values of the respective pixel points in the foreground image in the first processed image, and Ν is the number of total pixel points in the foreground image. In the implementation process of the present application, the shield core coordinate of the foreground image of each frame can be obtained by the above-mentioned shield obtaining method. ,]. In the embodiment of the present application, the television further includes:
识别模块 403 , 用于至少基于所述第一盾心坐标位置及所述第二盾心坐标位置, 识别 确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  The identification module 403 is configured to determine, according to the first shield center coordinate position and the second shield core coordinate position, an operation action of the first user between the first time and the second time.
在本申请实施例中, 所述识别模块 403 , 具体包括:  In the embodiment of the present application, the identifying module 403 specifically includes:
第一差值获得子模块,用于通过将所述第二盾心的所述第二横坐标值减去所述第一盾 心的所述第一横坐标值, 获得第一差值;  a first difference obtaining submodule, configured to obtain a first difference by subtracting the first abscissa value of the first shield from the second abscissa value of the second shield;
第一判断子模块, 用于判断所述第一差值是否大于第一阈值;  a first determining submodule, configured to determine whether the first difference is greater than a first threshold;
第一确定子模块, 用于在所述第一差值大于所述第一阈值时, 确定所述第一用户在所 述第一时刻及所述第二时刻间的操作动作为右移动作;  a first determining submodule, configured to determine, when the first difference is greater than the first threshold, that the operation action of the first user between the first time and the second time is a right movement;
第二判断子模块, 用于在所述第一差值不大于所述第一阈值时, 判断所述第一差值是 否小于第二阈值;  a second determining sub-module, configured to determine, when the first difference is not greater than the first threshold, whether the first difference is less than a second threshold;
第二确定子模块, 用于在所述第一差值小于所述第二阈值时, 确定所述第一用户在所 述第一时刻及所述第二时刻间的操作动作为左移动作;  a second determining submodule, configured to determine, when the first difference is smaller than the second threshold, that the operation action of the first user between the first time and the second time is a left movement;
第二差值获得子模块,用于通过将所述第二盾心的第二纵坐标值减去所述第一盾心的 所述第一纵坐标值, 获得第二差值; a second difference obtaining submodule for subtracting the first shield from the second ordinate value of the second shield The first ordinate value, obtaining a second difference;
第三判断子模块, 用于判断所述第二差值是否大于第三阈值;  a third determining submodule, configured to determine whether the second difference is greater than a third threshold;
第三确定子模块, 用于在所述第二差值大于所述第三阈值时, 确定所述第一用户在所 述第一时刻及所述第二时刻间的操作动作为跳跃动作;  a third determining submodule, configured to determine, when the second difference is greater than the third threshold, that the operation action of the first user between the first time and the second time is a jumping action;
第四判断子模块, 用于在所述第二差值不大于所述第三阈值时, 判断所述第二差值是 否小于第四阈值;  a fourth determining sub-module, configured to determine, when the second difference is not greater than the third threshold, whether the second difference is less than a fourth threshold;
第四确定子模块, 用于在所述第二差值小于所述第四阈值时, 确定所述第一用户在所 述第一时刻及所述第二时刻见的操作动作为下蹲动作。  And a fourth determining submodule, configured to determine, when the second difference is smaller than the fourth threshold, that the operation action that the first user sees at the first time and the second time is a squat action.
在本申请实施例中, 所述电视机还包括:  In the embodiment of the present application, the television further includes:
第二获得模块, 用于依次取 i从 3到 N, 获得在所述第二时刻之后第 i时刻所述第一 用户的第 i盾心的第 i横坐标值和第 i纵坐标值, N为大于等于 4的整数。  a second obtaining module, configured to sequentially take i from 3 to N, and obtain an i-th abscissa value and an i-th ordinate value of the i-th shield of the first user after the second moment after the second moment, N Is an integer greater than or equal to 4.
在具体实施过程中, 摄像机每秒会获取 15~25帧图像, 其中, 通用标准的摄像机釆样 率为 25帧 /秒, 工业摄像机釆用率可达 60帧 /秒, 或 200帧 /每秒, 甚至更高, 但电视播放 PAL制式的文件为 25帧 /秒, 播放 NTSC制式的文件为 30帧 /秒。 其中, 在获得第一帧图 像时, 首先进行初始化, 即将第一帧图像中前景图像的盾心坐标位置同时赋给预先设置的 三个参数, 参考巾贞盾心坐标 reference_frame(r_x0 , r_y0)、 前一巾贞盾心坐标 previous—frame ( p_x0 , p_yo )、 当前帧盾心坐标 current_frame ( c_x0 , c_y0 )。 在不同时刻, 所述预先设置 的三个参数会根据用户的动作情况进行变化。 另外, 根据所述三个参数, 可以获得当前帧 质心纵坐标值与参考盾心纵坐标值的差值 dyCT=c_yQ-r_yQ , 当前帧盾心横坐标值与参考盾心 横坐标值的差值 dxCT=c_xQ-r_xQ , 当前帧盾心纵坐标值与前一帧盾心纵坐标值的差值 dyCp=c_yo-p_y。及当前帧盾心横坐标值与前一帧盾心横坐标值的差值 dxCp=c_xo-p_x0In the specific implementation process, the camera will acquire 15~25 frames per second, of which the standard standard camera sampling rate is 25 frames/second, and the industrial camera usage rate is up to 60 frames/second, or 200 frames/second. Even higher, but the PAL file on the TV is 25 frames per second, and the NTSC file is 30 frames per second. Wherein, when obtaining the image of the first frame, the initialization is first performed, that is, the coordinate position of the shield image of the foreground image in the first frame image is simultaneously assigned to the three parameters set in advance, and the reference frame coordinate reference_frame (r_x 0 , r_y 0) ), the previous frame 贞 shield coordinates previous-frame ( p_x 0 , p_yo ), the current frame shield coordinates current_frame ( c_x 0 , c_y 0 ). At different times, the three preset parameters will change according to the user's action. In addition, according to the three parameters, the difference between the current frame centroid ordinate value and the reference shield ordinate value dy CT = c_y Q - r_y Q , the current frame shield abscissa value and the reference shield abscissa value can be obtained. The difference dx CT = c_x Q - r_x Q , the difference between the current frame shield ordinate value and the previous frame shield ordinate value dy C p = c_yo-p_y. And the difference between the current frame shield abscissa value and the previous frame shield abscissa value dx C p=c_xo-p_x 0 .
在本申请实施例中, 所述第一确定子模块, 具体包括:  In the embodiment of the present application, the first determining submodule specifically includes:
第一确定单元, 用于在所述第一差值大于所述第一阈值时, 确定所述第一用户在所述 第一时刻及所述第二时刻间的操作动作具有右移趋势;  a first determining unit, configured to determine, when the first difference is greater than the first threshold, that the operation action of the first user between the first time and the second time has a right shifting trend;
第一判断单元, 用于基于所述第二横坐标值和至少一个所述第 i横坐标值, 判断所述 操作动作是否存在右移结束标志;  a first determining unit, configured to determine, according to the second abscissa value and the at least one i-th abscissa value, whether the right action end flag exists in the operation action;
第二确定单元, 用于在存在所述右移结束标志时, 确定所述操作动作为右移动作; 或 所述第二确定子模块, 具体包括:  a second determining unit, configured to: when the right shift end flag is present, determine that the operation action is a right move; or the second determining submodule, specifically:
第三确定单元, 用于在所述第一差值小于所述第二阈值时, 确定所述第一用户在所述 第一时刻及所述第二时刻间的操作动作具有左移趋势;  a third determining unit, configured to determine, when the first difference is smaller than the second threshold, that the operation action of the first user between the first time and the second time has a left shifting tendency;
第二判断单元, 用于基于所述第二横坐标值和至少一个所述第 i横坐标值, 判断所述 操作动作是否存在左移结束标志;  a second determining unit, configured to determine, according to the second abscissa value and the at least one of the i-th abscissa values, whether the left-shift end flag exists in the operation action;
第四确定单元, 用于在存在所述左移结束标志时, 确定所述操作动作为左移动作。 在具体实施过程中, 当第一时刻的第一盾心坐标赋予到参考盾心坐标时, 居第一盾 心坐标位置和第二盾心坐标位置,若 dxCT > Tlr时,确定第一用户的操作动作具有右移趋势, 若 dxCT < -Tlr时, 确定第一用户的操作动作具有左移趋势。 当第一用户的操作动作具有右 移趋势时, 判断是否存在右移结束标志, 当存在右移结束标志时, 表明第一用户的操作动 作为右移动作。 其中, 本申请仅提供两种右移结束标志的实现方式, 本领域普通技术人员 还可以釆用其他方式作为右移结束的标志。 And a fourth determining unit, configured to determine that the operation action is a left movement when the left shift end flag is present. In the specific implementation process, when the first shield center coordinate of the first moment is assigned to the reference shield center coordinate, the first shield center coordinate position and the second shield core coordinate position are located, and if dx CT > T lr , the first is determined. The user's operation action has a right shift tendency. If dx CT < -T lr , it is determined that the first user's operation action has a left shift tendency. When the operation action of the first user has a right shifting tendency, it is determined whether there is a right shift end flag, and when there is a right shift end flag, it indicates that the first user's operation action is a right shift. The application only provides two implementations of the right shift end flag, and those skilled in the art may also use other methods as the flag for the right shift end.
方式一: 在操作动作具有右移趋势时, 即 dxcr > Tlr, 当 dxcp的符号由正号变化为负号 时, 判断 |dxcp|是否大于阈值 Τη , 若 |dxcp|没有大于阈值 Tn, 则表明右移并没有结束, dxcp 符号的变化仅是用户左右晃动带来的千扰,若 |dxcp|大于阈值 Tn,则表明存在右移结束标志, 从而确定第一用户在第一时刻和第二时刻间的操作动作为右移动作。 Manner 1: When the operation action has a right shift tendency, that is, dx cr > T lr , when the sign of dx cp changes from a positive sign to a negative sign, it is judged whether |dx cp | is greater than the threshold Τ η if |dx cp | If the threshold is greater than the threshold T n , it indicates that the right shift does not end. The change of the dx cp symbol is only the interference caused by the user's left and right shaking. If |dx cp | is greater than the threshold T n , it indicates that there is a right shift end flag, thereby determining the first The operation action of a user between the first time and the second time is a right movement.
方式二: 在操作动作具有右移趋势时, 即 dxCT > Tlr, 当 ^的符号一直为正号时, 判 断在预设的连续帧数内 |dxcp|的值是否均小于 2 , 例如, 连续 5帧或连续 6帧, 若 |dxcp|均小 于 2时, 则表明存在一段距离中的右移结束标志, 从而确定第一用户在第一时刻和第二时 刻间的操作动作为右移动作。 Manner 2: When the operation action has a right shift trend, that is, dx CT > T lr , when the sign of ^ is always a positive number, it is determined whether the value of |dx cp | is less than 2 in the preset number of consecutive frames, for example 5 consecutive frames or 6 consecutive frames. If |dx cp | is less than 2, it indicates that there is a right shift end flag in a distance, thereby determining that the first user operates right between the first time and the second time. Move.
另外, 在确定第一时刻和第二时刻间的操作动作为右移动作之后, 将当前帧的盾心坐 标赋予参考帧盾心坐标, 并进行下一段时刻间操作动作的确定。  In addition, after determining that the operation action between the first time and the second time is the right movement, the shield center coordinates of the current frame are assigned to the reference frame shield center coordinates, and the determination of the operation action between the next time intervals is performed.
在具体实施过程中, 对操作动作具体为左移的判断过程与右移的判断过程正好相反, 本领域普通技术人员根据右移动作的判断过程即可以得到左移动作的判断过程, 在此不再 赘述。  In the specific implementation process, the determination process of the operation movement specifically shifting to the left is exactly the opposite of the judgment process of the right movement. The judgment process of the left movement according to the right movement can be obtained by a person of ordinary skill in the art, and is not Let me repeat.
在本申请实施例中, 所述第三确定子模块, 具体包括:  In the embodiment of the present application, the third determining submodule specifically includes:
第五确定单元, 用于在所述第二差值大于所述第三阈值时, 确定所述第一用户在所述 第一时刻及所述第二时刻间的操作动作具有跳跃趋势;  a fifth determining unit, configured to determine, when the second difference is greater than the third threshold, that the operation action of the first user between the first time and the second time has a jumping trend;
第三判断单元, 用于基于所述第二纵坐标值和至少一个所述第 i纵坐标值, 判断所述 操作动作是否是存在跳跃结束标志;  a third determining unit, configured to determine, according to the second ordinate value and the at least one of the ith ordinate values, whether the operation action is a skip end flag;
第六确定单元, 用于在存在所述跳跃结束标志时, 确定所述操作动作为跳跃动作; 或 所述第四确定子模块, 具体包括:  a sixth determining unit, configured to: when the skip end flag is present, determine that the operation action is a skip action; or the fourth determining submodule, specifically:
第七确定单元, 用于在所述第二差值小于所述第四阈值时, 确定所述第一用户在所述 第一时刻及所述第二时刻间的操作动作具有下蹲趋势;  a seventh determining unit, configured to determine, when the second difference is smaller than the fourth threshold, that the operation action of the first user between the first time and the second time has a downward trend;
第四判断单元, 用于基于所述第二纵坐标值和至少一个所述第 i纵坐标值, 判断所述 操作动作是否是存在下蹲结束标志;  a fourth determining unit, configured to determine, according to the second ordinate value and the at least one of the ith ordinate values, whether the operation action is a squat end flag;
第八确定单元, 用于在存在所述下蹲结束标志时, 确定所述操作动作为下蹲动作。 在具体实施过程中, 当第一时刻的第一盾心坐标位置赋予到参考盾心坐标位置时, 根 据第一盾心坐标位置和第二盾心坐标位置, 若 dyc,r > Tj时, 确定第一用户的操作动作具有 跳跃趋势, 若 dyc,r < -Ts时, 确定第一用户的操作动作具有下蹲趋势。 当第一用户的操作动 作具有跳跃趋势时, 判断是否存在跳跃结束标志, 在存在跳跃结束标志时, 表明第一用户 的操作动作为跳跃动作, 其中, 所述跳跃结束标志具体为判断 dycp在一定时间内是否前半 段时间内的符号为正且后半段时间内的符号为负, 最终, dycr < Tj , 若存在上述情况, 则表 明存在跳跃结束标志, 从而确定第一用户在第一时刻和第二时刻间的操作动作为跳跃动 作。 And an eighth determining unit, configured to determine that the operation action is a squatting action when the squat end flag is present. In the specific implementation process, when the first shield center coordinate position of the first moment is given to the reference shield core coordinate position, according to the first shield center coordinate position and the second shield center coordinate position, if dy c , r > Tj, Determining that the first user's action has Jumping trend, if dy c , r < -T s , it is determined that the first user's operation action has a downward trend. When the operation action of the first user has a jumping trend, it is determined whether there is a jump end flag, and when there is a jump end flag, it indicates that the operation action of the first user is a jump action, wherein the jump end flag is specifically determining that dy cp is Whether the sign in the first half of the time is positive and the sign in the second half of the time is negative in a certain period of time, finally, dy cr < Tj, if the above situation exists, it indicates that there is a jump end flag, thereby determining that the first user is in the first The operation between the time and the second time is a jump action.
另外, 在确定第一时刻和第二时刻间的操作动作为跳跃动作之后, 不更换参考坐标。 在具体实施过程中, 对操作动作具体为下蹲动作的判断过程与跳跃动作的判断过程正 好相反, 本领域普通技术人员根据跳跃动作的判断过程即可以得到下蹲动作的判断过程, 在此不再赘述。  In addition, after determining that the operation action between the first time and the second time is a jump action, the reference coordinates are not replaced. In the specific implementation process, the judgment process of the operation action specifically as the squat action is exactly the opposite of the judgment process of the hop action, and the judgment process of the squat action can be obtained by a person of ordinary skill in the art according to the judgment process of the jump action, Let me repeat.
在本申请实施例中, 所述电视机还包括:  In the embodiment of the present application, the television further includes:
面积获得模块, 用于基于所述第一图像, 获得在第一时刻, 所述第一用户的第一身体 部位的第一身体部位图像的第一面积; 基于所述第二图像, 获得在第二时刻, 所述第一身 体部位图像的第二面积。  An area obtaining module, configured to obtain, according to the first image, a first area of the first body part image of the first body part of the first user at a first moment; based on the second image, obtained in the first At the second moment, the second area of the first body part image.
在具体实施过程中,选定用户在运动的过程中不会变化的身体部分作为所述第一身体 部分, 例如, 脸部。 在对第一图像和第二图像进行预处理之后, 获得第一处理图像, 基于 所述第一处理图像中的前景图像中像素点的个数, 获得所述第一面积和所述第二面积, 其 中, 图像中的像素点个数越多, 则表明面积越大, 相反, 像素点个数越少, 则表明面积越 小。  In a specific implementation, a body part that the user does not change during exercise is selected as the first body part, for example, a face. After preprocessing the first image and the second image, obtaining a first processed image, and obtaining the first area and the second area based on the number of pixel points in the foreground image in the first processed image The more the number of pixels in the image, the larger the area. Conversely, the smaller the number of pixels, the smaller the area.
在本申请实施例中, 所述识别模块, 具体用于:  In the embodiment of the present application, the identifying module is specifically configured to:
基于所述第一盾心坐标位置,所述第二盾心坐标位置,所述第一面积及所述第二面积, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  Determining, by the first shield center coordinate position, the second shield center coordinate position, the first area and the second area, the first user at the first moment and the second moment Operational actions between.
在具体实施过程中, 在基于所述第一盾心坐标位置及所述第二盾心坐标位置, 确定所 述第一用户在第一时刻及第二时刻间的操作动作为跳跃动作之后 , 判断所述第一面积是否 大于所述第二面积, 若所述第一面积大于所述第二面积, 则表明第一用户远离摄像头, 从 而确定所述第一用户的操作动作为向后跳跃动作, 若所述第一面积小于所述第二面积, 则 表明第一用户靠近摄像头, 从而确定所述第一用户的操作动作为向前跳跃动作。  In a specific implementation process, after determining, according to the first shield center coordinate position and the second shield center coordinate position, that the operation action of the first user between the first time and the second time is a jump action, determining Whether the first area is larger than the second area, if the first area is larger than the second area, indicating that the first user is away from the camera, thereby determining that the operation action of the first user is a backward jumping action, If the first area is smaller than the second area, it indicates that the first user is close to the camera, thereby determining that the operation action of the first user is a forward jumping action.
另夕卜,结合盾心坐标位置的变化及身体部位图像的面积还可以确定出用户其他的操作 动作。  In addition, the change of the position of the shield core and the area of the image of the body part can also determine other user actions.
在本申请实施例中, 所述电视机还包括:  In the embodiment of the present application, the television further includes:
匹配模块, 用于将所述操作动作与动作模型库中的标准动作模型进行匹配,获得第一 匹配结果;  a matching module, configured to match the operation action with a standard action model in the action model library to obtain a first matching result;
显示模块, 用于在所述第一匹配结果表明匹配成功时,显示与所述操作动作对应的所 述标准动作模型。 a display module, configured to display, when the first matching result indicates that the matching is successful, display the corresponding operation The standard action model.
在具体实施过程中, 如图 3 (a) ~ (k)所示, 动作模型库中包括对应下蹲、 跳跃、 左 移、 右移、 无动作等动作的各个基础模型。 其中建立模型的依据为: J ' H . Yoo等根据人 体解剖学知识建立了人体线图模型,假设人体的总高度为 H则可以获得人体各部分的相对 长度, 如下表一所示:  In the specific implementation process, as shown in Fig. 3 (a) ~ (k), the action model library includes various basic models corresponding to actions such as squatting, jumping, left shifting, right shifting, and no motion. The basis for establishing the model is: J ' H . Yoo et al. established a human body line graph model based on human anatomy knowledge. Assuming that the total height of the human body is H, the relative lengths of various parts of the human body can be obtained, as shown in Table 1 below:
Figure imgf000017_0001
Figure imgf000017_0001
表一  Table I
又, 基于用户的操作动作以及人体各部分的比例, 可以获得人体的 17 个关节点的坐 标, 进而可以得到用户在动作的过程中真实的骨架模型, 并可以将所述骨架模型显示到电 视中, 同时还可以将所述骨架模型的数据传送给上层应用程序进行游戏开发, 从而可以实 现利用骨架模型来控制游戏画面上的人物进行体感游戏, 使得玩家有种沉浸于游戏的感 觉。  Moreover, based on the user's operation motion and the proportion of each part of the human body, the coordinates of the 17 joint points of the human body can be obtained, and then the real skeleton model of the user during the action can be obtained, and the skeleton model can be displayed on the television. At the same time, the data of the skeleton model can be transmitted to the upper application for game development, so that the skeleton model can be used to control the characters on the game screen to perform the somatosensory game, so that the player has a feeling of immersing in the game.
另外, 基于前景图像中的每个像素点的坐标位置, 还可以获得前景图像中最高点的纵 坐标值 ytp、 最低点的纵坐标值 ybttm、 最左端的横坐标值 xleft及最右端的横坐标值 xnght。 进而, 在用户为站立动作的情况下, 可以根据^^ = - Hi = K。ttm、 质心 〔 。, ^|以及标准 W值和 H值来判断用户处于站立状态下的各种操作动作。例如,若 Hi-H In addition, based on the coordinate position of each pixel in the foreground image, the ordinate value y t of the highest point in the foreground image can also be obtained. p , the ordinate value of the lowest point y b . Tt . m , the leftmost abscissa value x left and the rightmost abscissa value x nght . Further, in the case where the user is standing, it is possible to follow ^^ = - H i = K. Tt . m , centroid [. , ^| and the standard W value and H value to determine the various operating actions of the user in the standing state. For example, if Hi-H
>Th, 表明用户处于举手动作, 接着, 对在此状态下前景图像中的每个点进行统计, 统计 出对应每个横坐标值的最高的纵坐标值, 从而获得一数组, 若该数组组成的图形呈双峰, 则确定动作为举双手站立, 若该数组组成的图形呈单峰, 则确定动作为举单手站立, 在确 定动作为举单手站立时, 若峰值 Xtop-xo^O, 则为举左手站立, 若 Xtop-xo^O, 则为举右手 站立; 若 Hi-H<Th, 且 |Wi-W|>Tw, 则为在水平方向伸手或脚运动, 若 |Xrig t -Χθ |"|xieft "Χθ | <-T 则为伸右手或右脚站立, 若 IXnght'-Xoi-lxieft'-XQ' Ti, 则为伸左手或左脚站立, 其他情 况为伸双手站立。 本申请实施例中提供的一个或多个技术方案, 至少具有如下技术效果或优点:>T h , indicating that the user is in the raising hand movement, and then counting each point in the foreground image in this state, and counting the highest ordinate value corresponding to each horizontal coordinate value, thereby obtaining an array, if The graph consisting of arrays is bimodal, then the action is determined to stand with both hands. If the graph consisting of the array is a single peak, then the action is determined to be standing with one hand, and when the action is determined to stand with one hand, if the peak is Xtop-xo ^O, stand for the left hand, if Xtop-xo^O, stand for the right hand; if Hi-H<T h , and |Wi-W|>T w , then extend the hand or foot in the horizontal direction, If |Xrig t -Χθ |"|xieft "Χθ | <-T, stand for the right hand or the right foot. If IXnght'-Xoi-lxieft'-XQ' Ti, stand for the left or left foot. Otherwise, Stand with your hands out. One or more technical solutions provided in the embodiments of the present application have at least the following technical effects or advantages:
1、 本发明实施例中通过在第一时刻获得第一用户的第一图像, 在第一时刻之后的第 二时刻获得第一用户的第二图像, 并基于所述第一图像和所述第二图像, 分别获得第一盾 心坐标位置和第二盾心坐标位置, 至少基于所述第一盾心坐标位置和所述第二盾心坐标位 置, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作, 解决了现有技 术中由于程序运算复杂度高、 运算量大, 存在的识别过程复杂的技术问题, 实现了通过筒 单的算法和较小的程序运算量对动作进行识别的技术效果, 例如, 现有技术需要以预先设 定的初始图像的人物特征数据、 大小及预定显示位置为参考数据, 对当前获取的任务图像 进行识别并获取人物特征数据新息, 再与参考数据通过比例调整, 截取出与参考数据相符 合的人物部分图像, 显示在预定位置, 而本发明仅需要确定出捕捉到的人物图像中盾心的 位置, 并利用对盾心坐标进行筒单的差值运算, 基于盾心的位置变化, 即可识别出人物的 动作, 算法筒单, 程序运算量小; In the embodiment of the present invention, the first image of the first user is obtained at the first time, the second image of the first user is obtained at the second time after the first time, and based on the first image and the first And obtaining, by the second image, a first shield center coordinate position and a second shield core coordinate position, respectively, based on the first shield center coordinate position and the second shield center coordinate position, and determining that the first user is in the first The operation operation between the moment and the second moment solves the technical problem that the recognition process is complicated due to the high complexity of the program operation and the large amount of calculation in the prior art, and the algorithm and the smaller method are realized. The technical effect of the program operation amount to recognize the action, for example, the prior art needs to use the preset character image data, size and predetermined display position of the initial image as reference data to identify the currently acquired task image and acquire the character feature. Data innovation, and then proportional adjustment with the reference data, intercepting the image of the person part of the figure that matches the reference data, displayed in the pre-position However, the present invention only needs to determine the position of the shield core in the captured character image, and uses the difference calculation of the shield core coordinate, and the movement of the character can be recognized based on the position change of the shield core, the algorithm cylinder Single, the program operation is small;
2、 由于仅通过现有技术中高端电视机具有的数据处理能力及配备的 2D摄像头, 就可 以对动作进行识别, 不仅不会增加硬件的成本, 还丰富了电视的功能。  2. Because only the data processing capability and the equipped 2D camera of the high-end TV in the prior art can identify the action, it not only increases the cost of the hardware, but also enriches the function of the TV.
显然, 本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和 范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内, 则本发明也意图包含这些改动和变型在内。  It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and modifications of the invention
本领域内的技术人员应明白, 本发明的实施例可提供为方法、 系统、 或计算机程序产 品。 因此, 本发明可釆用完全硬件实施例、 完全软件实施例、 或结合软件和硬件方面的实 施例的形式。 而且, 本发明可釆用在一个或多个其中包含有计算机可用程序代码的计算机 可用存储介盾 (包括但不限于磁盘存储器、 CD-ROM、 光学存储器等)上实施的计算机程 序产品的形式。  Those skilled in the art will appreciate that embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or a combination of software and hardware. Moreover, the present invention can be embodied in the form of a computer program product embodied on one or more computer-usable storage interfaces (including but not limited to disk storage, CD-ROM, optical storage, etc.) containing computer usable program code.
本发明是参照根据本发明实施例的方法、 设备(系统)、 和计算机程序产品的流程图 和 /或方框图来描述的。 应理解可由计算机程序指令实现流程图和 /或方框图中的每一流 程和 /或方框、 以及流程图和 /或方框图中的流程和 /或方框的结合。 可提供这些计算机 程序指令到通用计算机、 专用计算机、 嵌入式处理机或其他可编程数据处理设备的处理器 以产生一个机器, 使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用 于实现在流程图一个流程或多个流程和 /或方框图一个方框或多个方框中指定的功能的 装置。  The present invention has been described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (system), and computer program products according to embodiments of the invention. It will be understood that each process and/or block of the flowchart illustrations and/or FIG. These computer program instructions can be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing device to produce a machine for the execution of instructions for execution by a processor of a computer or other programmable data processing device. Means for implementing the functions specified in one or more of the flow or in a block or blocks of the flow chart.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方 式工作的计算机可读存储器中, 使得存储在该计算机可读存储器中的指令产生包括指令装 置的制造品, 该指令装置实现在流程图一个流程或多个流程和 /或方框图一个方框或多个 方框中指定的功能。 这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上, 使得在计算机 或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理, 从而在计算机或其他 可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和 /或方框图一个 方框或多个方框中指定的功能的步骤。 The computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device. The apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart. These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device. The instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
尽管已描述了本发明的优选实施例, 但本领域内的技术人员一旦得知了基本创造性概 念, 则可对这些实施例做出另外的变更和修改。 所以, 所附权利要求意欲解释为包括优选 实施例以及落入本发明范围的所有变更和修改。  Although the preferred embodiment of the invention has been described, it will be apparent to those skilled in the < Therefore, the appended claims are intended to be construed as including the preferred embodiments and the modifications
显然, 本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和 范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内, 则本发明也意图包含这些改动和变型在内。  It is apparent that those skilled in the art can make various modifications and variations to the invention without departing from the spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and modifications of the invention

Claims

权 利 要 求 Rights request
1、 一种动作识别方法, 应用于包括一摄像头的具有视频播放功能的电子设备中, 其 特征在于, 所述方法包括:  A motion recognition method, which is applied to an electronic device having a video playback function including a camera, wherein the method includes:
在第一时刻, 通过所述摄像头获得包括第一用户的第一区域第一图像;  At a first moment, obtaining, by the camera, a first image of the first region including the first user;
在所述第一时刻之后的第二时刻,通过所述摄像头获得包括所述第一用户的所述第一 区域的第二图像;  At a second time after the first time, a second image including the first area of the first user is obtained by the camera;
基于所述第一图像, 获得所述第一用户在所述第一时刻的第一盾心坐标位置; 基于所 述第二图像, 获得所述第一用户在所述第二时刻的第二盾心坐标位置;  Obtaining, according to the first image, a first shield center coordinate position of the first user at the first moment; and obtaining, according to the second image, a second shield of the first user at the second moment Heart coordinate position;
至少基于所述第一盾心坐标位置及所述第二盾心坐标位置,识别确定所述第一用户在 所述第一时刻及所述第二时刻间的操作动作。  And determining, according to the first shield center coordinate position and the second shield core coordinate position, an operation action of the first user between the first time and the second time.
2、 如权利要求 1所述的方法, 其特征在于, 所述获得所述第一用户在所述第一时刻 的第一盾心坐标位置, 具体包括:  The method according to claim 1, wherein the obtaining the first shield center coordinate position of the first user at the first moment comprises:
在所述第一用户没有在所述第一区域中时,通过所述摄像头获得所述第一区域的第一 背景图像;  Obtaining, by the camera, a first background image of the first area when the first user is not in the first area;
基于所述第一图像和所述第一背景图像, 获得第一处理图像, 其中, 所述第一处理图 像包括由具有第一色彩值的像素点组成的前景图像和由具有与所述第一色彩值不同的第 二色彩值的像素点组成的第二背景图像。  Obtaining a first processed image based on the first image and the first background image, wherein the first processed image includes a foreground image composed of pixel points having a first color value and is provided with the first A second background image consisting of pixels of a second color value having different color values.
3、 如权利要求 2所述的方法, 其特征在于, 在所述获得第一处理图像之后, 所述方 法还包括:  3. The method according to claim 2, wherein after the obtaining the first processed image, the method further comprises:
在所述第一处理图像中创建由 X轴和 Y轴构成的第一坐标体系;  Creating a first coordinate system composed of an X axis and a Y axis in the first processed image;
基于所述第一坐标体系, 获得组成所述前景图像的每个像素点的横坐标值和纵坐标 值;  Obtaining an abscissa value and an ordinate value of each pixel point constituting the foreground image based on the first coordinate system;
基于所述每个像素点的横坐标值和纵坐标值,获得所述第一用户在所述第一时刻的第 一盾心的第一横坐标值和第一纵坐标值, 进而获得所述第一盾心坐标位置。  Obtaining, according to the abscissa value and the ordinate value of each pixel point, a first abscissa value and a first ordinate value of the first shield core of the first user at the first moment, thereby obtaining the The first shield center coordinate position.
4、 如权利要求 3所述的方法, 其特征在于, 当所述坐标体系中的所述 X轴的右侧为 正方向且所述 Y轴的上侧为正方向时,所述至少基于所述第一盾心坐标位置及所述第二盾 心坐标位置, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作, 具体 包括:  4. The method according to claim 3, wherein when the right side of the X-axis in the coordinate system is a positive direction and an upper side of the Y-axis is a positive direction, the at least based on Determining, by the first shield core coordinate position and the second shield core coordinate position, determining an operation operation of the first user between the first time and the second time, specifically:
通过将所述第二盾心的所述第二横坐标值减去所述第一盾心的所述第一横坐标值,获 得第一差值;  Obtaining a first difference by subtracting the first abscissa value of the first shield from the second abscissa value of the second shield;
判断所述第一差值是否大于第一阈值;  Determining whether the first difference is greater than a first threshold;
在所述第一差值大于所述第一阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为右移动作; 在所述第一差值不大于所述第一阈值时, 判断所述第一差值是否小于第二阈值; 在所述第一差值小于所述第二阈值时, 确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为左移动作。 Determining, when the first difference is greater than the first threshold, that the operation action of the first user between the first time and the second time is a right movement; Determining whether the first difference is less than a second threshold when the first difference is not greater than the first threshold; determining the first user when the first difference is less than the second threshold The operation operation between the first time and the second time is a left movement.
5、 如权利要求 3所述的方法, 其特征在于, 当所述坐标体系中的所述 X轴的右侧为 正方向且所述 Y轴的上侧为正方向时,所述至少基于所述第一盾心坐标位置及所述第二盾 心坐标位置, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作, 具体 包括:  5. The method according to claim 3, wherein when the right side of the X-axis in the coordinate system is a positive direction and an upper side of the Y-axis is a positive direction, the at least based on Determining, by the first shield core coordinate position and the second shield core coordinate position, determining an operation operation of the first user between the first time and the second time, specifically:
通过将所述第二盾心的第二纵坐标值减去所述第一盾心的所述第一纵坐标值,获得第 二差值;  Obtaining a second difference by subtracting the first ordinate value of the first shield from the second ordinate value of the second shield;
判断所述第二差值是否大于第三阈值;  Determining whether the second difference is greater than a third threshold;
在所述第二差值大于所述第三阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为跳跃动作;  When the second difference is greater than the third threshold, determining that the operation action of the first user between the first time and the second time is a jumping action;
在所述第二差值不大于所述第三阈值时, 判断所述第二差值是否小于第四阈值; 在所述第二差值小于所述第四阈值时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为下蹲动作。  Determining whether the second difference is less than a fourth threshold when the second difference is not greater than the third threshold; determining the first user when the second difference is less than the fourth threshold The operation operation between the first time and the second time is a squat operation.
6、 如权利要求 3所述的方法, 其特征在于, 在所述基于所述第一图像, 获得所述第 一用户在所述第一时刻的第一盾心坐标位置; 基于所述第二图像, 获得所述第一用户在所 述第二时刻的第二盾心坐标位置之后 , 所述方法还包括:  6. The method according to claim 3, wherein, based on the first image, obtaining a first shield center coordinate position of the first user at the first moment; based on the second The image, after obtaining the second shield coordinate position of the first user at the second moment, the method further includes:
基于所述第一图像, 获得在第一时刻, 所述第一用户的第一身体部位的第一身体部位 图像的第一面积;  And obtaining, according to the first image, a first area of the first body part image of the first body part of the first user at the first moment;
基于所述第二图像, 获得在第二时刻, 所述第一身体部位图像的第二面积; 判断所述第一面积是否大于所述第二面积;  Determining, according to the second image, a second area of the first body part image at a second moment; determining whether the first area is greater than the second area;
在所述第一面积大于所述第二面积时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为后移动作;  When the first area is greater than the second area, determining that the operation action of the first user between the first time and the second time is a post-movement;
在所述第一面积小于所述第二面积时,确定所述第一用户在所述第一时刻及所述第二 时刻间的操作动作为前移动作。  When the first area is smaller than the second area, determining that the operation action of the first user between the first time and the second time is a forward movement.
7、 如权利要求 6所述的方法, 其特征在于, 至少基于所述第一盾心坐标位置及所述 第二盾心坐标位置, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动 作, 具体为:  The method according to claim 6, wherein the determining, according to the first shield center coordinate position and the second shield center coordinate position, determining that the first user is at the first moment and The operation action between the second time is specifically as follows:
基于所述第一盾心坐标位置,所述第二盾心坐标位置,所述第一面积及所述第二面积, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  Determining, by the first shield center coordinate position, the second shield center coordinate position, the first area and the second area, the first user at the first moment and the second moment Operational actions between.
8、 一种电视机, 包括一摄像头, 其特征在于, 所述电视机包括:  8. A television set comprising a camera, wherein the television set comprises:
图像获得模块, 用于在第一时刻, 通过所述摄像头获得包括第一用户的第一区域第一 图像; 在所述第一时刻之后的第二时刻, 通过所述摄像头获得包括所述第一用户的所述第 一区域的第二图像; An image obtaining module, configured to obtain, by the camera, a first area including a first user by using the camera at a first moment An image obtained by the camera at a second time after the first time, including the first region of the first user;
第一获得模块, 用于基于所述第一图像, 获得所述第一用户在所述第一时刻的第一盾 心坐标位置;基于所述第二图像,获得所述第一用户在所述第二时刻的第二盾心坐标位置; 识别模块, 用于至少基于所述第一盾心坐标位置及所述第二盾心坐标位置, 识别确定 所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  a first obtaining module, configured to obtain, according to the first image, a first shield coordinate position of the first user at the first moment; and obtain, according to the second image, the first user a second shield center coordinate position of the second moment; an identification module, configured to determine, according to the first shield center coordinate position and the second shield center coordinate position, that the first user is at the first moment and The operation action between the second moments.
9、 如权利要求 8所述的电视机, 其特征在于, 所述电视机还包括:  The television set according to claim 8, wherein the television further comprises:
面积获得模块, 用于基于所述第一图像, 获得在第一时刻, 所述第一用户的第一身体 部位的第一身体部位图像的第一面积; 基于所述第二图像, 获得在第二时刻, 所述第一身 体部位图像的第二面积。  An area obtaining module, configured to obtain, according to the first image, a first area of the first body part image of the first body part of the first user at a first moment; based on the second image, obtained in the first At the second moment, the second area of the first body part image.
10、 如权利要求 9所述的电视机, 其特征在于, 所述识别模块具体用于:  The television set according to claim 9, wherein the identification module is specifically configured to:
基于所述第一盾心坐标位置,所述第二盾心坐标位置,所述第一面积及所述第二面积, 识别确定所述第一用户在所述第一时刻及所述第二时刻间的操作动作。  Determining, by the first shield center coordinate position, the second shield center coordinate position, the first area and the second area, the first user at the first moment and the second moment Operational actions between.
PCT/CN2012/088111 2012-12-31 2012-12-31 Action recognition method and television WO2014101219A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/088111 WO2014101219A1 (en) 2012-12-31 2012-12-31 Action recognition method and television

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/088111 WO2014101219A1 (en) 2012-12-31 2012-12-31 Action recognition method and television

Publications (1)

Publication Number Publication Date
WO2014101219A1 true WO2014101219A1 (en) 2014-07-03

Family

ID=51019808

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/088111 WO2014101219A1 (en) 2012-12-31 2012-12-31 Action recognition method and television

Country Status (1)

Country Link
WO (1) WO2014101219A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113743282A (en) * 2021-08-30 2021-12-03 深圳Tcl新技术有限公司 Content search method, content search device, electronic equipment and computer-readable storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101477627A (en) * 2009-02-12 2009-07-08 北京像素软件科技股份有限公司 Movement recognition method and system
CN101910781A (en) * 2007-12-25 2010-12-08 丰田自动车株式会社 Moving state estimation device
US20110002544A1 (en) * 2009-07-01 2011-01-06 Fujifilm Corporation Image synthesizer and image synthesizing method
CN102221878A (en) * 2010-04-19 2011-10-19 索尼公司 Image processing system, image processing apparatus, image processing method, and program

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101910781A (en) * 2007-12-25 2010-12-08 丰田自动车株式会社 Moving state estimation device
CN101477627A (en) * 2009-02-12 2009-07-08 北京像素软件科技股份有限公司 Movement recognition method and system
US20110002544A1 (en) * 2009-07-01 2011-01-06 Fujifilm Corporation Image synthesizer and image synthesizing method
CN102221878A (en) * 2010-04-19 2011-10-19 索尼公司 Image processing system, image processing apparatus, image processing method, and program

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113743282A (en) * 2021-08-30 2021-12-03 深圳Tcl新技术有限公司 Content search method, content search device, electronic equipment and computer-readable storage medium

Similar Documents

Publication Publication Date Title
US8941687B2 (en) System and method of user interaction for augmented reality
EP2642451B1 (en) Apparatus and method of augmented reality interaction
US9055267B2 (en) System and method of input processing for augmented reality
CN105073210B (en) Extracted using the user&#39;s body angle of depth image, curvature and average terminal position
US9236032B2 (en) Apparatus and method for providing content experience service
US9064335B2 (en) System, method, device and computer-readable medium recording information processing program for superimposing information
US20110304611A1 (en) Storage medium having stored thereon image processing program, image processing apparatus, image processing system, and image processing method
CN106201173B (en) A kind of interaction control method and system of user&#39;s interactive icons based on projection
CN109448131B (en) Kinect-based virtual piano playing system construction method
CN104781762A (en) Information processing device
JP2010137097A (en) Game machine and information storage medium
CN107115675A (en) A kind of physical fitness games system and implementation method based on Kinect
JP3866474B2 (en) GAME DEVICE AND INFORMATION STORAGE MEDIUM
WO2011158599A1 (en) Video game device, video game control program, and video game control method
CN112973110A (en) Cloud game control method and device, network television and computer readable storage medium
CN108553889A (en) Dummy model exchange method and device
WO2014101219A1 (en) Action recognition method and television
US20120133676A1 (en) Storage medium having stored thereon image processing program, image processing apparatus, image processing system, and image processing method
WO2005065798A1 (en) Information processing system, entertainment system, and information processing system input accepting method
CN115268658A (en) Multi-party remote space delineation marking method based on augmented reality
US20130260885A1 (en) Entertainment system and method of providing entertainment
KR101527188B1 (en) Method for Offering Multiple Player Game using Motion Sensor of Single
KR20120092960A (en) System and method for controlling virtual character
US20200184675A1 (en) Positioning Method and Reality Presenting Device
TWI480080B (en) Game doll recognition system, recognition method and game system using the same

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12890701

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12890701

Country of ref document: EP

Kind code of ref document: A1