CN112423059B - Gesture-based video control method, television, and computer-readable storage medium - Google Patents

Gesture-based video control method, television, and computer-readable storage medium Download PDF

Info

Publication number
CN112423059B
CN112423059B CN202011073939.2A CN202011073939A CN112423059B CN 112423059 B CN112423059 B CN 112423059B CN 202011073939 A CN202011073939 A CN 202011073939A CN 112423059 B CN112423059 B CN 112423059B
Authority
CN
China
Prior art keywords
video
gesture
television
determining
video frames
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011073939.2A
Other languages
Chinese (zh)
Other versions
CN112423059A (en
Inventor
吴晓红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen TCL New Technology Co Ltd
Original Assignee
Shenzhen TCL New Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen TCL New Technology Co Ltd filed Critical Shenzhen TCL New Technology Co Ltd
Priority to CN202011073939.2A priority Critical patent/CN112423059B/en
Publication of CN112423059A publication Critical patent/CN112423059A/en
Application granted granted Critical
Publication of CN112423059B publication Critical patent/CN112423059B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42204User interfaces specially adapted for controlling a client device through a remote control device; Remote control devices therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/017Gesture based interaction, e.g. based on a set of recognized hand gestures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47217End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a gesture-based video control method, which comprises the following steps: when a gesture is detected and an instruction corresponding to the gesture is a progress operation instruction, acquiring a video currently played by the television and receiving times corresponding to the progress operation instruction stored on the same day of the television, wherein the receiving times are updated when the television detects the progress operation instruction; determining a target video frame in each video frame in the video, and acquiring a direction and an angle corresponding to the gesture, wherein the target video frame is a highlight video frame in the video; determining the playing rate of the video frames in the video according to the target video frames, the angles corresponding to the gestures and the receiving times; and determining the playing sequence of each video frame in the video according to the direction of the gesture, and playing the video according to the playing sequence and the playing speed. The invention also discloses a television and a computer readable storage medium. The invention shortens the time spent by the user in inquiring the favorite video.

Description

Gesture-based video control method, television, and computer-readable storage medium
Technical Field
The present invention relates to the field of television technologies, and in particular, to a gesture-based video control method, a television, and a computer-readable storage medium.
Background
With the popularity of networks, people can watch video resources in the network through televisions.
Because of the numerous video resources, the user inevitably needs to control the progress of the video to quickly browse the content of the video to find the video favored by the user.
Currently, a user can control video fast forward or fast backward through gestures, that is, the user controls the playing rate of the video through gestures. However, the playing rate corresponding to the gesture is preset, which may cause the highlight of the video to be played quickly, so that the user cannot capture the highlight of the video, and the duration of querying the favorite video by the user is too long.
Disclosure of Invention
The invention mainly aims to provide a gesture-based video control method, a television and a computer-readable storage medium, and aims to solve the problem that the duration of a user inquiring favorite video is too long.
In order to achieve the above object, the present invention provides a gesture-based video control method, which includes the steps of:
When a gesture is detected and an instruction corresponding to the gesture is a progress operation instruction, acquiring a video currently played by the television and receiving times corresponding to the progress operation instruction stored in the television on the same day, wherein the receiving times are updated when the television detects the progress operation instruction;
determining a target video frame in each video frame in the video, and acquiring a direction and an angle corresponding to the gesture, wherein the target video frame is a highlight video frame in the video;
determining the playing rate of the video frames in the video according to the target video frames, the angles corresponding to the gestures and the receiving times;
and determining the playing sequence of each video frame in the video according to the direction of the gesture, and playing the video according to the playing sequence and the playing speed.
In an embodiment, the step of determining the playing rate of the video frames in the video according to the playing time point corresponding to the target video frame, the angle corresponding to the gesture, and the receiving times includes:
segmenting the video according to the receiving times to obtain each segmentation time point, wherein the segmentation time point comprises a time point when the sub video obtained by segmenting the video starts to be played and a time point when the video ends to be played;
Determining a time difference value between a playing time point of each target video frame and each segmentation time point, and determining a target difference value in each time difference value, wherein the target difference value is smaller than a preset difference value;
and determining the playing rate of the video according to the ratio between the sum of the target time points corresponding to the target difference values and the angles corresponding to the gestures, wherein the target time points are the earliest time points in the time points corresponding to the target difference values.
In an embodiment, the step of determining the playing rate of the video according to the ratio between the sum of the target time points corresponding to each target difference value and the angle corresponding to the gesture includes:
according to the ratio between the sum of the target time points corresponding to each target difference value and the angle corresponding to the gesture;
and acquiring a preset playing speed of the video, and determining the playing speed of the video according to the product of the ratio and the preset playing speed.
In one embodiment, the step of determining the target video frame for each video frame in the video comprises:
acquiring the total times corresponding to the progress operation instructions stored by the television, wherein the total times are updated when the television detects the progress operation instructions;
When the ratio of the receiving times to the total times is larger than or equal to a first preset ratio, determining an audio image frame in each video frame to be determined as a target video frame, wherein the ratio between a first audio amplitude of the audio image frame and a second audio amplitude of an image frame adjacent to the audio image frame is larger than or equal to a second preset ratio, or the ratio between the second audio amplitude and the first audio amplitude is larger than or equal to a second preset ratio; or alternatively, the first and second heat exchangers may be,
and when the ratio of the receiving times to the total times is smaller than a first preset ratio, determining a moving image frame and the audio image frame in each video frame to be determined as a target video frame, wherein the pixel value of a preset proportion of pixel points in the moving image frame is changed relative to the adjacent image frames of the moving image frame.
In one embodiment, the step of determining an audio image frame in each of the video frames comprises:
acquiring adjacent first video frames and second video frames from the video;
determining an amplitude ratio between a first audio amplitude of the first video frame and a second audio amplitude of the second video frame;
When the amplitude ratio exceeds a preset range, determining the first video frame and the second video frame as audio video frames, wherein a critical value of the preset range is the reciprocal of the second preset ratio and the second preset ratio;
and re-acquiring adjacent first video frames and second video frames from the rest video frames, and returning to the step of determining the amplitude ratio between the first audio amplitude value of the first video frames and the second audio amplitude value of the second video frames.
In one embodiment, the step of determining a moving image frame in each of the video frames comprises:
acquiring adjacent third video frames and fourth video frames from the video, and determining second pixel points corresponding to first pixel points at preset positions in the fourth video frames from the third video frames, wherein the positions of the first pixel points in the fourth video frames are the same as the positions of the second pixel points in the third video frames;
determining pixel difference values between the pixel values of the first pixel points and the pixel values of the second pixel points, and determining the number of the pixel difference values larger than a preset difference value;
When the ratio between the number and the number of the first pixel points is larger than a preset ratio, determining the third video frame and the fourth video frame as motion video frames;
and re-acquiring adjacent third video frames and fourth video frames from the rest video frames, and returning to the step of executing the second pixel point corresponding to the first pixel point of each preset position in the fourth video frames determined in the third video frames.
In an embodiment, after the step of determining the playing rate of the video frames in the video according to the target video frames, the angles corresponding to the gestures, and the receiving times, the method further includes:
acquiring the orientation of the thumb in the gesture;
and determining the direction of the gesture according to the direction of the thumb.
In an embodiment, the television is provided with an image acquisition module, and the gesture-based video control method is further characterized by comprising:
acquiring an image acquired by the image acquisition module in real time;
identifying the hands of the user in the image to obtain gestures;
when the gesture obtained by recognition is matched with a preset gesture, determining that the instruction corresponding to the gesture obtained by recognition is a progress operation instruction, and executing the step of obtaining the video currently played by the television and the times corresponding to the progress operation instruction stored in the television on the same day.
To achieve the above object, the present invention also provides a television including a memory, a processor, and a control program stored in the memory and executable on the processor, which when executed by the processor, implements the steps of the gesture-based video control method as described above.
To achieve the above object, the present invention also provides a readable storage medium storing a control program which, when executed by a processor, implements the steps of the gesture-based video control method described above.
According to the gesture-based video control method, the television and the readable storage medium, when the television detects that the command corresponding to the gesture is the progress control command, the video currently played by the television and the receiving times of the progress operation command stored by the television on the same day are obtained, the highlight video frames are determined to be target video frames in all videos in the video, the playing speed of the video frames in the video is determined according to the target video frames, the angles corresponding to the gesture and the receiving times, the playing sequence of all the video frames in the video is determined according to the direction of the gesture, and finally the video is played according to the playing sequence and the playing speed. The receiving times of the progress operation instructions represent the current progress operation frequency of the user, the angles corresponding to the gestures represent the expected playing speed of the user, namely, the progress operation frequency of the television set user and the highlight video frames are used for correcting the expected playing speed, so that the television can reasonably correct the preset playing speed by combining the highlight part of the video, namely, the television plays the highlight part of the video at the reasonable playing speed, the user is prevented from skipping the highlight part of the video, and the time consumed by the user for inquiring the favorite video is shortened.
Drawings
Fig. 1 is a schematic diagram of a hardware architecture of a television according to an embodiment of the present invention;
FIG. 2 is a flowchart of a first embodiment of a gesture-based video control method according to the present invention;
FIG. 3 is a detailed flowchart of step S30 in a second embodiment of the gesture-based video control method of the present invention;
fig. 4 is a flowchart of step S20 in a third embodiment of the gesture-based video control method of the present invention.
The achievement of the objects, functional features and advantages of the present invention will be further described with reference to the accompanying drawings, in conjunction with the embodiments.
Detailed Description
It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention.
The main solutions of the embodiments of the present invention are: when a gesture is detected and an instruction corresponding to the gesture is a progress operation instruction, acquiring a video currently played by the television and receiving times corresponding to the progress operation instruction stored in the television on the same day, wherein the receiving times are updated when the television detects the progress operation instruction; determining a target video frame in each video frame in the video, and acquiring a direction and an angle corresponding to the gesture, wherein the target video frame is a highlight video frame in the video; determining the playing rate of the video frames in the video according to the target video frames, the angles corresponding to the gestures and the receiving times; and determining the playing sequence of each video frame in the video according to the direction of the gesture, and playing the video according to the playing sequence and the playing speed.
The receiving times of the progress operation instructions represent the current progress operation frequency of the user, the angles corresponding to the gestures represent the expected playing speed of the user, namely, the progress operation frequency of the television set user and the highlight video frames are used for correcting the expected playing speed, so that the television can reasonably correct the preset playing speed by combining the highlight part of the video, namely, the television plays the highlight part of the video at the reasonable playing speed, the user is prevented from skipping the highlight part of the video, and the time consumed by the user for inquiring the favorite video is shortened.
As an implementation, a television may be as shown in fig. 1.
The embodiment of the invention relates to a television, which comprises: a processor 101, such as a CPU, a memory 102, and a communication bus 103. Wherein the communication bus 103 is used to enable connected communication among the components.
The memory 102 may be a high-speed RAM memory or a stable memory (non-volatile memory), such as a disk memory. As shown in fig. 1, a control program may be included in the memory 103 as one type of computer storage medium; and the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
When a gesture is detected and an instruction corresponding to the gesture is a progress operation instruction, acquiring a video currently played by the television and receiving times corresponding to the progress operation instruction stored in the television on the same day, wherein the receiving times are updated when the television detects the progress operation instruction;
determining a target video frame in each video frame in the video, and acquiring a direction and an angle corresponding to the gesture, wherein the target video frame is a highlight video frame in the video;
determining the playing rate of the video frames in the video according to the target video frames, the angles corresponding to the gestures and the receiving times;
and determining the playing sequence of each video frame in the video according to the direction of the gesture, and playing the video according to the playing sequence and the playing speed.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
segmenting the video according to the receiving times to obtain each segmentation time point, wherein the segmentation time point comprises a time point when the sub video obtained by segmenting the video starts to be played and a time point when the video ends to be played;
Determining a time difference value between a playing time point of each target video frame and each segmentation time point, and determining a target difference value in each time difference value, wherein the target difference value is smaller than a preset difference value;
and determining the playing rate of the video according to the ratio between the sum of the target time points corresponding to the target difference values and the angles corresponding to the gestures, wherein the target time points are the earliest time points in the time points corresponding to the target difference values.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
according to the ratio between the sum of the target time points corresponding to each target difference value and the angle corresponding to the gesture;
and acquiring a preset playing speed of the video, and determining the playing speed of the video according to the product of the ratio and the preset playing speed.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
acquiring the total times corresponding to the progress operation instructions stored by the television, wherein the total times are updated when the television detects the progress operation instructions;
When the ratio of the receiving times to the total times is larger than or equal to a first preset ratio, determining an audio image frame in each video frame to be determined as a target video frame, wherein the ratio between a first audio amplitude of the audio image frame and a second audio amplitude of an image frame adjacent to the audio image frame is larger than or equal to a second preset ratio, or the ratio between the second audio amplitude and the first audio amplitude is larger than or equal to a second preset ratio; or alternatively, the first and second heat exchangers may be,
and when the ratio of the receiving times to the total times is smaller than a first preset ratio, determining a moving image frame and the audio image frame in each video frame to be determined as a target video frame, wherein the pixel value of a preset proportion of pixel points in the moving image frame is changed relative to the adjacent image frames of the moving image frame.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
acquiring adjacent first video frames and second video frames from the video;
determining an amplitude ratio between a first audio amplitude of the first video frame and a second audio amplitude of the second video frame;
When the amplitude ratio exceeds a preset range, determining the first video frame and the second video frame as audio video frames, wherein a critical value of the preset range is the reciprocal of the second preset ratio and the second preset ratio;
and re-acquiring adjacent first video frames and second video frames from the rest video frames, and returning to the step of determining the amplitude ratio between the first audio amplitude value of the first video frames and the second audio amplitude value of the second video frames.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
acquiring adjacent third video frames and fourth video frames from the video, and determining second pixel points corresponding to first pixel points at preset positions in the fourth video frames from the third video frames, wherein the positions of the first pixel points in the fourth video frames are the same as the positions of the second pixel points in the third video frames;
determining pixel difference values between the pixel values of the first pixel points and the pixel values of the second pixel points, and determining the number of the pixel difference values larger than a preset difference value;
When the ratio between the number and the number of the first pixel points is larger than a preset ratio, determining the third video frame and the fourth video frame as motion video frames;
and re-acquiring adjacent third video frames and fourth video frames from the rest video frames, and returning to the step of executing the second pixel point corresponding to the first pixel point of each preset position in the fourth video frames determined in the third video frames.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
acquiring the orientation of the thumb in the gesture;
and determining the direction of the gesture according to the direction of the thumb.
In one embodiment, the processor 101 may be configured to call a control program stored in the memory 102 and perform the following operations:
acquiring an image acquired by the image acquisition module in real time;
identifying the hands of the user in the image to obtain gestures;
when the gesture obtained by recognition is matched with a preset gesture, determining that the instruction corresponding to the gesture obtained by recognition is a progress operation instruction, and executing the step of obtaining the video currently played by the television and the times corresponding to the progress operation instruction stored in the television on the same day.
According to the scheme, when the television detects that the command corresponding to the gesture is the progress control command, the video currently played by the television and the receiving times of the progress operation command stored by the television on the same day are obtained, the highlight video frames are determined to be target video frames in all videos in the video, then the playing rate of the video frames in the video is determined according to the target video frames, the angles corresponding to the gesture and the receiving times, the television determines the playing sequence of all the video frames in the video according to the direction of the gesture, and finally the video is played according to the playing sequence and the playing rate. The receiving times of the progress operation instructions represent the current progress operation frequency of the user, the angles corresponding to the gestures represent the expected playing speed of the user, namely, the progress operation frequency of the television set user and the highlight video frames are used for correcting the expected playing speed, so that the television can reasonably correct the preset playing speed by combining the highlight part of the video, namely, the television plays the highlight part of the video at the reasonable playing speed, the user is prevented from skipping the highlight part of the video, and the time consumed by the user for inquiring the favorite video is shortened.
Based on the hardware architecture of the television, the embodiment of the gesture-based video control method is provided.
Referring to fig. 2, fig. 2 is a first embodiment of a gesture-based video control method according to the present invention, the gesture-based video control method includes the following steps:
step S10, when a gesture is detected and an instruction corresponding to the gesture is a progress operation instruction, acquiring a video currently played by the television and receiving times corresponding to the progress operation instruction stored on the same day by the television, wherein the receiving times are updated when the progress operation instruction is detected by the television;
in this embodiment, the television is provided with an image acquisition module, which may be a camera, and the image acquisition module is disposed on a screen of the television to acquire gestures of a user. Of course, the television can be connected with an independent image acquisition device, and the image acquisition angle of the image acquisition device is the same as the video display direction of the television, so that the image of the user in front of the television can be acquired positively.
And after the television plays the video, acquiring the image acquired by the image acquisition module in real time. The television analyzes the acquired image, and if the image comprises the hands of the user, the hands can be identified to obtain gestures. The television further judges whether the identified gesture matches a preset gesture. The preset gesture is a gesture stored in the television. If the gesture matches a preset gesture, determining that the instruction corresponding to the identified gesture is a progress operation instruction. The preset gesture may be set by the user on the television, for example, the user sets the five fingers to open to the preset gesture. If the recognized gesture is the opening of the five fingers, the recognized gesture matches a preset gesture.
The progress operation instruction includes a fast forward instruction and a fast reverse instruction. The progress manipulation instruction may be determined according to an orientation of the thumb in the gesture. If the thumb in the image (in the case of square image) faces to the right, the thumb of the user points to the beginning of the video playing of the television, so the progress operation instruction is a fast-rewinding instruction; if the thumb in the image faces to the left, the thumb of the user points to the end of the video played by the television, and the progress operation instruction is a fast-rewinding instruction.
When the television determines that the instruction corresponding to the gesture is a progress operation instruction, the television acquires the video currently played and the receiving times corresponding to the progress operation instruction stored on the television on the same day. Specifically, the video currently played by the television has a play address, and the television acquires all video frames of the video according to the play address. When the television detects the progress operation instruction, the television records the receiving times of the progress operation instruction, namely the receiving times of +1. The television is connected with the network, so that the television can be synchronized with the actual time, and when the time on the television is updated to a new day, namely to 00:00, the receiving times are reset to zero, namely the television needs to record the receiving times of the progress operation instructions of the current day again.
Step S20, determining a target video frame in each video frame in the video, and acquiring a direction and an angle corresponding to the gesture, wherein the target video frame is a highlight video frame in the video;
the server can mark each highlight video frame in the video frames in advance, and after the television obtains the television, the television can determine each highlight video frame as a target video frame according to the mark. The server may determine the highlight video frame according to a popularity value of each video frame in the video, and the popularity value may be determined by the number of operations performed on the video frame by each network user. Operations of the video by the network user include, but are not limited to, fast forward, pause, fast reverse. The video frames corresponding to the heat value higher than the threshold value can be determined to be highlight video frames.
The television acquires the angle corresponding to the gesture. Specifically, the television may determine the angle of the gesture by using the open finger, for example, the thumb in the gesture must be open, and the television obtains the open finger furthest from the thumb in the gesture, so as to determine the angle according to the finger and the thumb, for example, if the open finger furthest from the thumb is a small finger, the angle corresponding to the gesture is 90 degrees, and if the open finger furthest from the thumb is a middle finger, the angle corresponding to the gesture is 45 degrees. The television also determines the direction corresponding to the gesture, specifically, the television acquires the direction of the thumb in the gesture, so that the direction of the gesture is determined according to the direction of the thumb.
Step S30, determining the playing rate of the video frames in the video according to the target video frames, the angles corresponding to the gestures and the receiving times;
the angle corresponding to the gesture can represent the preset playing rate of the video selected by the user, the number of target video frames can represent the wonderful degree of the video, and the receiving times represent the current progress operation frequency of the user. If the receiving times are more, the video fast forward or fast backward speed is too slow, and the user requires higher playing speed; the more the number of the playing time points is, the higher the highlight degree of the video is, the smaller the fast-rewinding or fast-forwarding speed is required, so that the television can determine the first coefficient according to the number of the playing time points, determine the second coefficient according to the receiving times, and obtain the current playing speed of the video frames in the video by multiplying the first coefficient and the second coefficient by the preset playing speed corresponding to the angle corresponding to the gesture. The first coefficient is related to the number of target video frames, the greater the number, the smaller the first coefficient (the first coefficient is greater than zero); the second coefficient is related to the number of times of reception, and the larger the number of times of reception, the larger the second coefficient.
And step S40, determining the playing sequence of each video frame in the video according to the direction of the gesture, and playing the video according to the playing sequence and the playing speed.
After the television determines the playing speed, the playing sequence of each video frame in the video is determined according to the direction of the gesture. The play order may be characterized as fast-reverse or fast-forward. When the thumb is oriented to the right, the gesture is oriented to the left, that is, the playing sequence is playing from the late to the early according to time, that is, the video is fast-backed. When the thumb is oriented leftwards, the gesture direction is rightwards, namely the playing sequence is playing according to the time from the early to the late, namely the fast forward of the video. After the television determines the playing sequence and the playing rate, the television can play the video according to the playing sequence and the playing rate. If the playing rate is smaller than or equal to the normal playing rate of the video, the video is played back or normally played according to the normal playing rate; and if the playing speed is greater than the normal playing speed of the video, playing the video according to the playing sequence and the playing speed.
In the technical scheme provided by the embodiment, when the television detects that the command corresponding to the gesture is a progress control command, the video currently played by the television and the receiving times of the progress operation command stored by the television on the same day are obtained, the highlight video frames are determined as target video frames in all videos in the video, then the playing speed of the video frames in the video is determined according to the target video frames, the angle corresponding to the gesture and the receiving times, the playing sequence of all the video frames in the video is determined according to the direction of the gesture, and finally the video is played according to the playing sequence and the playing speed. The receiving times of the progress operation instructions represent the current progress operation frequency of the user, the angles corresponding to the gestures represent the expected playing speed of the user, namely, the progress operation frequency of the television set user and the highlight video frames are used for correcting the expected playing speed, so that the television can reasonably correct the preset playing speed by combining the highlight part of the video, namely, the television plays the highlight part of the video at the reasonable playing speed, the user is prevented from skipping the highlight part of the video, and the time consumed by the user for inquiring the favorite video is shortened.
Referring to fig. 3, fig. 3 is a second embodiment of the gesture-based video control method of the present invention, based on the first embodiment, the step S30 includes:
step S31, segmenting the video according to the receiving times to obtain each segmentation time point, wherein the segmentation time points comprise time points of starting playing and time points of ending playing of the sub-video obtained by segmenting the video;
in this embodiment, after the television obtains the number of times of reception, the total duration of the video is obtained, so that the total duration is segmented according to the number of times of reception. Specifically, the total duration is divided by the receiving times to obtain the segment duration, so that the total duration process of the video is segmented according to the segment duration, and further, the playing time point corresponding to the segmentation point is obtained, and the playing time point is the segment time point. That is, the segmentation time point includes a time point of starting playing of the sub-video obtained by video segmentation and a time point of ending playing.
Step S32, determining a time difference value between a playing time point of each target video frame and each segmentation time point, and determining a target difference value in each time difference value, wherein the target difference value is smaller than a preset difference value;
the television acquires the playing time point of each target video frame, and a plurality of time differences can be obtained after determining the time difference between each playing time point and each segmentation time point. The television compares each time difference value with the preset difference value, and determines the time difference value smaller than the preset difference value as a target difference value. When the ratio between the number of target differences and the total number of time differences is above the threshold, the ratio between the number of target differences and the total number of television sets is the threshold. The threshold may be any suitable value, for example, 10%, that is, the television ranks all time differences smaller than the preset difference value from small to large, and the television sequentially extracts the smallest time difference value in the ranking queue as the target difference value until the difference value between the number of target difference values and the total number of time difference values is closest to the threshold and smaller than the threshold.
Step S33, determining a playing rate of the video according to a ratio between a sum of target time points corresponding to each target difference value and an angle corresponding to the gesture, where the target time point is an earliest time point in the time points corresponding to the target difference values.
The target difference value is obtained by a segmentation time point and a playing time point, namely, the target difference value corresponds to two time points, the television takes the earliest time point in the two time points corresponding to the target difference value as a target time point, then the sum of all the target time points is determined, and finally the video playing speed is determined by dividing the sum of the target time points by the angle corresponding to the gesture.
Furthermore, the ratio between the sum of the target points in time and the angle corresponding to the gesture may be characterized as a fast forward ratio or a fast reverse ratio. For example, when the ratio is 2, the fast-reverse ratio or the fast-forward ratio is x 2, that is, 2 times the normal playing rate of the video. Therefore, the television can determine the playing rate of the video according to the product of the ratio between the sum of the target time points and the angles corresponding to the gestures and the preset playing rate, wherein the preset playing rate is the playing rate of the video when the video is normally played. In the technical scheme provided by the embodiment, the television segments the video according to the receiving times to obtain each segment time point, and then determines the time difference between the playing time point of the target video frame and each segment time point to determine the target difference in the time difference, so that the playing rate of the video is accurately determined according to the ratio of the sum of the target time points corresponding to each target difference to the angle corresponding to the gesture.
Referring to fig. 4, fig. 4 is a third embodiment of the gesture-based video control method of the present invention, based on the first or second embodiment, the step S20 includes:
step S21, obtaining the total times corresponding to the progress operation instructions stored by the television, wherein the total times are updated when the television detects the progress operation instructions;
in this embodiment, when the television resets the number of times of reception, the number of times of reception is counted in the total number of times of progress operation instruction. In addition, the time corresponding to the total number of times should not exceed a preset time, for example, the television section records the total number of times of the progress operation instruction of the last week.
Step S22, when the ratio of the number of times of receiving to the total number of times is larger than or equal to a first preset ratio, determining an audio image frame in each video frame to determine the audio image frame as a target video frame, wherein the ratio between a first audio amplitude of the audio image frame and a second audio amplitude of an image frame adjacent to the audio image frame is larger than or equal to a second preset ratio, or the ratio between the second audio amplitude and the first audio amplitude is larger than or equal to a second preset ratio;
and S23, determining a moving image frame and the audio image frame in each video frame to be determined as a target video frame when the ratio of the receiving times to the total times is smaller than a first preset ratio, wherein the pixel values of the pixel points in the preset ratio in the moving image frame are changed relative to the adjacent image frames of the moving image frame.
Gesture control of a television segment is divided into a low-frequency segment control mode and a high-frequency segment control mode. The low frequency segment control mode means that the ratio of the current number of times of reception to the total number of times is smaller than a first preset ratio. The high-frequency segmented control mode refers to that the ratio of the current receiving frequency to the total frequency is greater than or equal to a second preset ratio, and the first preset ratio can be any number of times, for example, the first preset ratio is 0.5. And the control modes corresponding to the gesture control are different, so that the target video frames are different. If the control mode is a low-frequency segmentation control mode, the target video frames are moving image frames and audio image frames; if the control mode is the high-frequency segmentation control mode, the target video frame is an audio image frame.
It is to be noted that, the ratio between the first audio amplitude of the audio image frame and the second audio amplitude of the image frame adjacent to the audio image frame is greater than or equal to a second preset ratio, or the ratio between the second audio amplitude and the first audio amplitude is greater than or equal to a second preset ratio. The second preset ratio may be any suitable value, for example, the second preset ratio may be 2, i.e., the difference in amplitude between the audio amplitude of an audio image frame and the audio amplitude of its neighboring image frame is greater than the smallest of the two audio amplitudes. The moving image frame refers to a pixel value transmission change of a preset proportion of pixel points in the moving image frame with respect to an adjacent image frame of the moving image frame.
And after the total times are obtained, the television determines the control mode in which the gesture control is positioned. I.e. the ratio between the number of receptions and the total number of times the television determines. Determining an audio image frame in each video frame when the ratio is greater than or equal to a first preset ratio, so as to determine the audio image frame as a target video frame; and when the ratio is smaller than a first preset ratio, determining a moving image frame and the audio image frame in each video frame to be determined as a target video frame.
In the technical scheme provided by the embodiment, the television accurately determines the highlight image frames in the video according to the current progress operation frequency of the user on the television.
In one embodiment, step S21 includes:
step C1, acquiring adjacent first video frames and second video frames from the video;
step C2, determining an amplitude ratio between a first audio amplitude of the first video frame and a second audio amplitude of the second video frame;
step C3, when the amplitude ratio exceeds a preset range, determining the first video frame and the second video frame as audio video frames, wherein the critical value of the preset range is the reciprocal of the second preset ratio and the second preset ratio;
And step C4, re-acquiring adjacent first video frames and second video frames from the rest video frames, and returning to the step of determining the amplitude ratio between the first audio amplitude of the first video frames and the second audio amplitude of the second video frames.
In this embodiment, the television segment acquires a first video frame and a second video frame adjacent in playing time from the current video. The television then determines a magnitude ratio between the first audio magnitude of the first video frame and the second audio magnitude of the second video frame. The television is provided with a preset range, the critical value of the preset range is a second preset ratio and the reciprocal of the second preset ratio, namely, when the second preset ratio is 2, the preset range is [0.5,2]. When the amplitude ratio exceeds the preset range, it can be determined that the audio amplitude of the first video frame is more than 2 times of the audio amplitude of the second video frame (the second preset ratio is 2), or that the audio amplitude of the second video frame is more than 2 times of the audio amplitude of the first video frame. For example, the amplitude ratio is less than 0.5, i.e., it can be determined that the audio amplitude of the second video frame is more than 2 times the audio amplitude of the first video frame; the amplitude ratio is greater than 2, and the audio amplitude of the first video frame can be determined to be more than 2 times of the audio amplitude of the second video frame.
And when the television determines that the amplitude ratio exceeds the preset range, the television can determine the first video frame and the second video frame as audio video frames. I.e. the adjacent video frames have a large difference in audio amplitude, which can be regarded as highlight video frames.
The television then re-determines the adjacent first video frame and second video frame from the rest video frames, so as to return to executing the step of determining the amplitude ratio between the first audio amplitude of the first video frame and the second audio amplitude of the second video frame, that is, the television gradually determines the audio video frames by taking the adjacent two video frames as units until all the adjacent video frames are determined. Of course, when the ratio is within the preset range, the television re-determines the adjacent first video frame and second video frame from the rest of each video frame, thereby returning to execute the step of determining the amplitude ratio between the first audio amplitude of the first video frame and the second audio amplitude of the second video frame.
In one embodiment, the step of determining a moving image frame in each of the video frames comprises:
step S1, acquiring adjacent third video frames and fourth video frames in the video, and determining second pixel points corresponding to first pixel points at each preset position in the fourth video frames in the third video frames, wherein the positions of the first pixel points in the fourth video frames are the same as the positions of the second pixel points in the third video frames;
Step S2, determining pixel difference values between the pixel values of the first pixel points and the pixel values of the second pixel points, and determining the number of the pixel difference values larger than a preset difference value;
step S3, when the ratio between the number and the number of the first pixel points is larger than a preset ratio, determining the third video frame and the fourth video frame as motion video frames;
and S4, re-acquiring adjacent third video frames and fourth video frames from the rest video frames, and returning to the step of executing the second pixel point corresponding to the first pixel point of each preset position in the fourth video frames determined in the third video frames.
In this embodiment, the television acquires adjacent third video and fourth video from the videos, and determines, in the third video frame, a second pixel point corresponding to the first pixel point at each preset position in the fourth video frame. The first pixel points corresponding to the preset positions are in the same row or the same column, and can be random positions. The position of the first pixel point in the fourth video frame is the same as the position of the second pixel point in the third video frame, for example, the position of the first pixel point in the fourth video frame is the third row and the third column, and the position of the second pixel point corresponding to the first pixel point in the third video frame is also the third row and the third column.
The television obtains pixel difference values between the pixel values of the first pixel point and the pixel values of the second pixel point, and then determines the number of the pixel difference values larger than the preset difference value. And the television determines the ratio between the number and the number of the first pixel points, and if the ratio is larger than the preset ratio, the third video frame and the fourth video frame are moving image frames. The predetermined ratio may be any number of values, for example, the predetermined ratio is 1:2. It will be appreciated that motion video frames may be considered video frames in which the stress in the video requires a background change, i.e. motion video frames may also be considered highlight video frames.
And the television re-determines adjacent third video frames and fourth video frames from the rest video frames, so that the step of determining the second pixel point corresponding to the first pixel point of each preset position in the fourth video frames in the third video frames is returned, namely, the television gradually determines the motion video frames by taking the adjacent two video frames as units until all the adjacent video frames are determined. Of course, when the ratio is smaller than the preset ratio, the television redetermines the adjacent third video frame and fourth video frame from the remaining video frames.
The present invention also provides a television including a memory, a processor, and a control program stored in the memory and executable on the processor, which when executed by the processor, implements the steps of the gesture-based video control method described in the above embodiments.
The present invention also provides a readable storage medium storing a control program which, when executed by a processor, implements the steps of the gesture-based video control method described in the above embodiments.
The foregoing embodiment numbers of the present invention are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment. Based on such understanding, the technical solution of the present invention may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium (e.g. ROM/RAM, magnetic disk, optical disk) as described above, comprising instructions for causing a terminal device (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the method according to the embodiments of the present invention.
The foregoing description is only of the preferred embodiments of the present invention, and is not intended to limit the scope of the invention, but rather is intended to cover any equivalents of the structures or equivalent processes disclosed herein or in the alternative, which may be employed directly or indirectly in other related arts.

Claims (8)

1. A gesture-based video control method, characterized in that the gesture-based video control method comprises the steps of:
When a gesture is detected and an instruction corresponding to the gesture is a progress operation instruction, acquiring a video currently played by a television and receiving times corresponding to the progress operation instruction stored on the same day by the television, wherein the receiving times are updated when the television detects the progress operation instruction;
determining a target video frame in each video frame in the video, and acquiring a direction and an angle corresponding to the gesture, wherein the target video frame is a highlight video frame in the video;
determining the playing rate of video frames in the video according to the number of target video frames, the angles corresponding to the gestures and the receiving times;
and determining the playing sequence of each video frame in the video according to the direction of the gesture, and playing the video according to the playing sequence and the playing speed.
2. The method of claim 1, wherein the step of determining a target video frame for each video frame in the video comprises:
acquiring the total times corresponding to the progress operation instructions stored by the television, wherein the total times are updated when the television detects the progress operation instructions;
When the ratio of the receiving times to the total times is larger than or equal to a first preset ratio, determining an audio image frame in each video frame to be determined as a target video frame, wherein the ratio between a first audio amplitude of the audio image frame and a second audio amplitude of an image frame adjacent to the audio image frame is larger than or equal to a second preset ratio, or the ratio between the second audio amplitude and the first audio amplitude is larger than or equal to a second preset ratio; or alternatively, the first and second heat exchangers may be,
and when the ratio of the receiving times to the total times is smaller than a first preset ratio, determining a moving image frame and the audio image frame in each video frame to be determined as a target video frame, wherein the pixel value of a preset proportion of pixel points in the moving image frame is changed relative to the adjacent image frames of the moving image frame.
3. The method of claim 2, wherein said step of determining an audio image frame in each of said video frames comprises:
acquiring adjacent first video frames and second video frames from the video;
determining an amplitude ratio between a first audio amplitude of the first video frame and a second audio amplitude of the second video frame;
When the amplitude ratio exceeds a preset range, determining the first video frame and the second video frame as audio video frames, wherein a critical value of the preset range is the reciprocal of the second preset ratio and the second preset ratio;
and re-acquiring adjacent first video frames and second video frames from the rest video frames, and returning to the step of determining the amplitude ratio between the first audio amplitude value of the first video frames and the second audio amplitude value of the second video frames.
4. The method of claim 2, wherein said step of determining a moving image frame in each of said video frames comprises:
acquiring adjacent third video frames and fourth video frames from the video, and determining second pixel points corresponding to first pixel points at preset positions in the fourth video frames from the third video frames, wherein the positions of the first pixel points in the fourth video frames are the same as the positions of the second pixel points in the third video frames;
determining pixel difference values between the pixel values of the first pixel points and the pixel values of the second pixel points, and determining the number of the pixel difference values larger than a preset difference value;
When the ratio between the number and the number of the first pixel points is larger than a preset ratio, determining the third video frame and the fourth video frame as motion video frames;
and re-acquiring adjacent third video frames and fourth video frames from the rest video frames, and returning to the step of executing the second pixel point corresponding to the first pixel point of each preset position in the fourth video frames determined in the third video frames.
5. The method according to any one of claims 1-4, further comprising, after the step of determining a play rate of video frames in the video according to the target video frame, the angle corresponding to the gesture, and the number of receptions:
acquiring the orientation of the thumb in the gesture;
and determining the direction of the gesture according to the direction of the thumb.
6. The method of any one of claims 1-4, wherein the television is provided with an image acquisition module, the method further comprising:
acquiring an image acquired by the image acquisition module in real time;
identifying the hands of the user in the image to obtain gestures;
when the gesture obtained by recognition is matched with a preset gesture, determining that the instruction corresponding to the gesture obtained by recognition is a progress operation instruction, and executing the steps of obtaining the video currently played by the television and the times corresponding to the progress operation instruction stored in the television on the same day.
7. A television comprising a memory, a processor, and a control program stored in the memory and executable on the processor, which when executed by the processor, implements the steps of the gesture-based video control method of any one of claims 1-6.
8. A computer readable storage medium storing a control program which when executed by a processor implements the steps of the gesture-based video control method of any one of claims 1-6.
CN202011073939.2A 2020-10-09 2020-10-09 Gesture-based video control method, television, and computer-readable storage medium Active CN112423059B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011073939.2A CN112423059B (en) 2020-10-09 2020-10-09 Gesture-based video control method, television, and computer-readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011073939.2A CN112423059B (en) 2020-10-09 2020-10-09 Gesture-based video control method, television, and computer-readable storage medium

Publications (2)

Publication Number Publication Date
CN112423059A CN112423059A (en) 2021-02-26
CN112423059B true CN112423059B (en) 2023-08-01

Family

ID=74854369

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011073939.2A Active CN112423059B (en) 2020-10-09 2020-10-09 Gesture-based video control method, television, and computer-readable storage medium

Country Status (1)

Country Link
CN (1) CN112423059B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116761040B (en) * 2023-08-22 2023-10-27 超级芯(江苏)智能科技有限公司 VR cloud platform interaction method and interaction system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1148770A (en) * 1995-08-04 1997-04-30 索尼公司 Digital video data coding and decoding device removing caption data noise, and method thereof
CN103299645A (en) * 2010-12-22 2013-09-11 谷歌公司 Video player with assisted seek
WO2017113600A1 (en) * 2015-12-30 2017-07-06 深圳Tcl数字技术有限公司 Video playing method and device
CN107197381A (en) * 2017-06-12 2017-09-22 深圳Tcl新技术有限公司 Temperature curve generation method, device and the readable storage medium storing program for executing of television video
CN107295409A (en) * 2017-08-08 2017-10-24 广东小天才科技有限公司 Method, device, terminal equipment and storage medium for controlling video playing
CN109963196A (en) * 2017-12-26 2019-07-02 深圳Tcl数字技术有限公司 A kind of television program playing method, storage medium and smart television

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100289743A1 (en) * 2009-05-15 2010-11-18 AFA Micro Co. Laser pointer and gesture-based input device
US9430128B2 (en) * 2011-01-06 2016-08-30 Tivo, Inc. Method and apparatus for controls based on concurrent gestures
CN105898460A (en) * 2015-12-10 2016-08-24 乐视网信息技术(北京)股份有限公司 Method and device for adjusting panorama video play visual angle of intelligent TV
US10701431B2 (en) * 2017-11-16 2020-06-30 Adobe Inc. Handheld controller gestures for virtual reality video playback

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1148770A (en) * 1995-08-04 1997-04-30 索尼公司 Digital video data coding and decoding device removing caption data noise, and method thereof
CN103299645A (en) * 2010-12-22 2013-09-11 谷歌公司 Video player with assisted seek
WO2017113600A1 (en) * 2015-12-30 2017-07-06 深圳Tcl数字技术有限公司 Video playing method and device
CN107197381A (en) * 2017-06-12 2017-09-22 深圳Tcl新技术有限公司 Temperature curve generation method, device and the readable storage medium storing program for executing of television video
CN107295409A (en) * 2017-08-08 2017-10-24 广东小天才科技有限公司 Method, device, terminal equipment and storage medium for controlling video playing
CN109963196A (en) * 2017-12-26 2019-07-02 深圳Tcl数字技术有限公司 A kind of television program playing method, storage medium and smart television

Also Published As

Publication number Publication date
CN112423059A (en) 2021-02-26

Similar Documents

Publication Publication Date Title
US10425684B2 (en) System and method to create a media content summary based on viewer annotations
US9118956B2 (en) System and method of displaying content
WO2016026245A1 (en) Method and system for switching video playback resolution
CN108965705B (en) Video processing method and device, terminal equipment and storage medium
US20230269459A1 (en) Photographing processing method and apparatus, electronic device, and readable storage medium
CN108712667B (en) Smart television, screen capture application method and device thereof, and readable storage medium
CN107370768B (en) Intelligent television streaming media preview system and method
US10749923B2 (en) Contextual video content adaptation based on target device
US20160073168A1 (en) Channel of top socially-commented audio visual content
CN112423059B (en) Gesture-based video control method, television, and computer-readable storage medium
US20200275151A1 (en) Television time shifting control method, system and computer-readable storage medium
CN111131883B (en) Video progress adjusting method, television and storage medium
CN111787357A (en) Video processing method and electronic equipment
US9892757B2 (en) Apparatus and method for calculating and virtually displaying football statistics
CN113852756B (en) Image acquisition method, device, equipment and storage medium
US9721615B2 (en) Non-linear video review buffer navigation
CN107040826B (en) Video loading method and device, computer device and computer readable storage medium
CN113766261A (en) Method and device for determining pre-pulling duration, electronic equipment and storage medium
CN113709566A (en) Multimedia content playing method, device, equipment and computer storage medium
EP3616039B1 (en) Display apparatus, method of controlling the same, and computer program product thereof
CN109348280B (en) Network television program switching method, intelligent television and computer readable storage medium
US20170164050A1 (en) Presenting alternate content
CN114745505A (en) Shooting method, shooting device, electronic equipment and readable storage medium
CN108156515B (en) Video playing method, smart television and computer readable storage medium
US20170171269A1 (en) Media content playback method, apparatus and system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant