WO2022078154A1 - Display device and media asset playing method - Google Patents

Display device and media asset playing method Download PDF

Info

Publication number
WO2022078154A1
WO2022078154A1 PCT/CN2021/119052 CN2021119052W WO2022078154A1 WO 2022078154 A1 WO2022078154 A1 WO 2022078154A1 CN 2021119052 W CN2021119052 W CN 2021119052W WO 2022078154 A1 WO2022078154 A1 WO 2022078154A1
Authority
WO
WIPO (PCT)
Prior art keywords
window
media asset
play
target
playback
Prior art date
Application number
PCT/CN2021/119052
Other languages
French (fr)
Chinese (zh)
Inventor
王光强
赖园园
薛梅
刘金刚
Original Assignee
聚好看科技股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN202011102193.3A external-priority patent/CN112272324B/en
Priority claimed from CN202110275148.6A external-priority patent/CN113051435B/en
Priority claimed from CN202110448074.1A external-priority patent/CN113051432B/en
Application filed by 聚好看科技股份有限公司 filed Critical 聚好看科技股份有限公司
Priority to CN202180068337.4A priority Critical patent/CN116324700A/en
Publication of WO2022078154A1 publication Critical patent/WO2022078154A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance

Definitions

  • the present application relates to the field of display technologies, and in particular, to a display device and a method for playing media assets.
  • the display ratio of most TVs is 16:9, and the ratio of fitness videos is usually 16:9. If the TV only plays fitness videos, the fitness videos can be displayed in full screen. If the fitness videos and user images are displayed at the same time, Since the user image occupies part of the display area of the TV, the display area of the fitness video may not be 16:9.
  • the ratio of a video is inconsistent with the ratio of the playback window, the video is usually scaled to a smaller size so that it can be displayed in the playback window. However, this will cause black borders to appear around the video. , and the video will also become smaller. For fitness videos, the smaller video will make some fitness movements difficult to see, which will seriously affect the user's viewing experience.
  • the present application provides a display device, the display device comprising:
  • controller connected to the display, the controller being configured to:
  • the target video is played in the first playback window
  • the display position of the target video is moved in the first playback window away from the control, so that the target video is displayed in a direction away from the control.
  • the center position of the screen is displayed close to the center position of the target display area in the first play window that is not blocked by the controls, wherein the controls are opaque and block one side of the first play window.
  • the present application provides a method for playing media assets, the method comprising:
  • the target video is played in the first playback window
  • the display position of the target video is moved in the first playback window away from the control, so that the target video is displayed in a direction away from the control.
  • the center position of the screen is displayed close to the center position of the target display area in the first play window that is not blocked by the controls, wherein the controls are opaque and block one side of the first play window.
  • FIG. 1 shows a schematic diagram of an operation scenario between a display device and a control device in some embodiments
  • FIG. 2 shows a schematic diagram of the fitness home page in some embodiments
  • Figure 3 shows a schematic diagram of a media asset details interface in some embodiments
  • FIG. 4 shows a schematic diagram of a playback mode selection interface in some embodiments
  • Figure 5 shows a schematic diagram of a full-screen playback interface in normal mode in some embodiments
  • Figure 6 shows a schematic diagram of a dual-window playback interface in the follow-up mode in some embodiments
  • Figure 7 shows a schematic diagram of a dual-window playback interface in the follow-up mode in some embodiments
  • FIG. 8 A schematic diagram of image movement in some embodiments is shown in FIG. 8.
  • Figure 9 shows a schematic diagram of the effect after the image is moved in some embodiments.
  • Figure 10 shows a schematic diagram of a display interface in some embodiments
  • Figure 11 shows a schematic diagram of the interaction of dots of target media assets in some embodiments
  • Figure 12 shows a schematic diagram of scoring interaction of target media assets in some embodiments
  • Figure 13 shows a flowchart of the scoring method in the follow-up practice process in some embodiments
  • Figure 14 shows a schematic diagram of scoring during follow-up practice in some embodiments
  • Figure 15 shows a schematic diagram of scoring after follow-up practice in some embodiments
  • Figure 16 shows a schematic diagram of an exception handling interface of a display device in some embodiments
  • Figure 17 shows a schematic diagram of an exception handling interface of a display device in some embodiments.
  • FIG. 18 shows a schematic diagram of an exception handling interface of a display device in some embodiments.
  • FIG. 1 is a schematic diagram of an operation scenario between a display device and a control apparatus according to an embodiment. As shown in FIG. 1 , a user can operate the display device 200 through the smart device 300 or the control device 100 .
  • control apparatus 100 may be a remote controller, and the communication between the remote controller and the display device includes infrared protocol communication or Bluetooth protocol communication, and other short-range communication methods, and the display device 200 is controlled wirelessly or wiredly.
  • the user can control the display device 200 by inputting user instructions through keys on the remote control, voice input, control panel input, and the like.
  • a smart device 300 eg, a mobile terminal, a tablet computer, a computer, a notebook computer, etc.
  • the display device 200 is controlled using an application running on the smart device.
  • the display device 200 can also be controlled in a manner other than the control apparatus 100 and the smart device 300.
  • the module for acquiring voice commands configured inside the display device 200 can directly receive the user's voice command for control.
  • the user's voice command control can also be received through a voice control device provided outside the display device 200 device.
  • the display device 200 is also in data communication with the server 400 .
  • the display device 200 may be allowed to communicate via local area network (LAN), wireless local area network (WLAN), and other networks.
  • the server 400 may provide various contents and interactions to the display device 200 .
  • the server 400 may be a cluster or multiple clusters, and may include one or more types of servers.
  • the user may input user commands on a graphical user interface (GUI) displayed on the display 260, and the user input interface receives the user input commands through the graphical user interface (GUI).
  • GUI graphical user interface
  • the user may input a user command by inputting a specific sound or gesture, and the user input interface recognizes the sound or gesture through a sensor to receive the user input command.
  • a "user interface” is a medium interface for interaction and information exchange between an application program or an operating system and a user, which enables conversion between an internal form of information and a form acceptable to the user.
  • the commonly used form of user interface is Graphical User Interface (GUI), which refers to a user interface related to computer operations displayed in a graphical manner. It can be an icon, window, control and other interface elements displayed on the display screen of the electronic device, wherein the control can include icons, buttons, menus, tabs, text boxes, dialog boxes, status bars, navigation bars, Widgets, etc. visual interface elements.
  • GUI Graphical User Interface
  • the display device can directly enter the interface of the preset VOD program after startup.
  • the interface of the VOD program can be as shown in FIG. 2 , including at least a navigation bar and a content display area located below the navigation bar.
  • the content displayed in the display area changes with the selected control in the navigation bar.
  • the program in the application layer can be integrated in the video-on-demand program to be displayed through a control in the navigation bar, or it can be further displayed after the application control in the navigation bar is selected.
  • the display device after the display device is started, it can directly enter the display interface of the last selected signal source, or the signal source selection interface, where the signal source can be a preset video-on-demand program, and can also be an HDMI interface, a live TV interface At least one of etc., after the user selects different signal sources, the display can display the content obtained from the different signal sources.
  • the navigation bar may be provided with multiple controls, such as “My”, “Channel”, “Video”, “Fitness”, “VIP”, “Education”, “Mall” , “Games” and applications, different navigation bar controls correspond to different channel interfaces, if the user wants to exercise, he can select the “Fitness” control, the interface shown in Figure 2 is the interface after the "Fitness” control is selected, the user can Select a fitness video in this interface to follow the fitness video to exercise.
  • controls such as “My”, “Channel”, “Video”, “Fitness”, “VIP”, “Education”, “Mall” , “Games” and applications
  • different navigation bar controls correspond to different channel interfaces, if the user wants to exercise, he can select the “Fitness” control, the interface shown in Figure 2 is the interface after the "Fitness” control is selected, the user can Select a fitness video in this interface to follow the fitness video to exercise.
  • the display device requests the server to send the corresponding details page data according to the configuration parameters corresponding to the selected video control, and then enters the details page data according to the received details page data.
  • the media asset details interface can display multiple course subsection controls of the fitness video, or it may not include it.
  • the display device displays the course subsection controls/details page for the fitness video.
  • the corresponding fitness video and then enter the playback mode selection interface.
  • the played fitness video is also referred to as a target video.
  • the playback mode selection interface can display three mode controls.
  • the playback mode corresponding to the first mode control is the normal mode, which can also be called the first mode
  • the playback mode corresponding to the second mode control is the follow-up mode. mode, this mode may also be referred to as the second mode
  • the playback mode corresponding to the third mode control is the movie viewing mode, which may also be referred to as the third mode.
  • Each mode control can display an explanation of the playback mode.
  • the explanation of the normal mode can be: “Shield the camera to watch the complete teaching video to become familiar with the training action”
  • the explanation of the follow-up mode can be: “Turn on The camera obtains the real-time comparison of the action to make the action more standard”
  • the explanation of the viewing mode can be: "The effect of exercising while shielding the camera will not be discounted”.
  • the display device in the normal mode, does not activate the camera, and only sets a playback window on a new interface to play the fitness video.
  • the display device activates the camera, Two playback windows are set on the new interface of the monitor to play the images captured by the camera and the fitness video at the same time.
  • the display device In the movie viewing mode, the display device does not start the camera, and two playback windows are set on the new interface of the monitor, respectively simultaneously. Play a workout video and a movie.
  • the display device may be provided with a camera, and the camera may include an elevating camera or a non-elevating camera, the camera may capture user images to obtain local camera data, and the controller of the display device may display the local camera data captured by the camera on the On the display of the display device, the user can see his actions on the display.
  • the camera may include an elevating camera or a non-elevating camera
  • the camera may capture user images to obtain local camera data
  • the controller of the display device may display the local camera data captured by the camera on the On the display of the display device, the user can see his actions on the display.
  • the display device is not provided with a camera, but a camera can be connected, such as an external camera connected through USB, and the camera is used to capture user images, and the controller of the display device can display the local data captured by the camera on the display device. on the device's display.
  • the user can select a playback mode to watch the fitness video according to the above explanation.
  • the display device if the user clicks the normal mode control on the playback mode selection interface shown in FIG. 4 , the display device generates a media asset playback instruction, and the media asset playback instruction includes the information of the playback mode and the target video.
  • the mode is the normal mode, and the information of the target video includes the playback address of the target video.
  • the display device obtains the video data stream of the target video from the playback address of the target video according to the media asset playback instruction, and generates a first playback window for playing the target video on the new interface according to the normal playback mode.
  • the first playback window may be a full-screen window with a display ratio of 16:9.
  • the ratio of the fitness video is generally 16:9, and the ratio of the full-screen window is also 16:9, the ratio of the fitness video is consistent with the ratio of the full-screen window.
  • the display device In the normal mode, the display device only generates a first playback window, and does not generate other windows, and no other windows will block the display content of the first playback window. Therefore, after generating the first playback window, the display device can
  • the playback mode of the media asset playback instruction is the normal mode, which directly plays the fitness video in full screen in the first playback window.
  • the display device in order to play the fitness video in full screen in the first playback window, may zoom the image of the target video, so that the image size of the zoomed target video is scaled to be consistent with the size of the full screen window.
  • the method for scaling the target video by the display device may be: parsing the video data stream of the target video to obtain a video frame sequence of the target video, taking the first frame image in the video frame sequence, and according to the image
  • the ratio of the height to the height of the target display area obtains a scaling ratio, and then the video frame sequence of the target video is scaled according to the scaling ratio.
  • the image height of the target video is 100
  • the height of the full-screen window of the display device is 1000
  • the height can be the number of pixels in the vertical direction
  • the zoom ratio is to enlarge the video frame sequence of the target video by 10 times, so that the enlarged image of the target video can fill the entire full-screen window.
  • the display device after zooming the target video, obtains a video frame sequence of the zoomed target video, and sends the video frame sequence to the first playback window, so that the first playback window can continuously play the video frame sequence .
  • FIG. 5 it is a schematic diagram of a full-screen playback interface in the normal playback mode according to some embodiments.
  • the target video in the normal playback mode, can be played in full screen.
  • the character in FIG. 5 can be a fitness coach. It can represent the background of the person.
  • the person is displayed in the center, and the left and right sides behind the person are background images.
  • the user can call up a control list including the switching control, and after the user selects the switching control, the display is switched to the follow-up mode display in FIG. 6 .
  • the display of the follow-up training mode shown in FIG. 6 can be switched by preset key values.
  • the display device if the user clicks the follow-up mode control on the playback mode selection interface shown in FIG. 4 , the display device generates a media asset playback instruction, and the media asset playback instruction includes the information of the playback mode and the target video.
  • the playback mode is the follow-up mode, and the information of the target video includes the playback address of the target video.
  • the display device obtains the video data stream of the target video from the playback address of the target video according to the media asset playback instruction, and generates a first playback window for playing the target video on the new interface according to the playback mode as a follow-up mode, and A second playback window for playback of local camera data.
  • the second play window is superimposed above the first play window
  • the height of the second play window and the first play window are the same, the left border of the second play window and the left border of the first play window overlap, or, the right border of the second play window and the first play window The right borders of .
  • the position of the window can be realized by setting the coordinate parameters of the window in the interface.
  • the first playback window in the follow-up mode, can be a full-screen window with a display ratio of 16:9, and the second playback window can be a window of the same height as the display, and the display ratio can be adjusted according to The shooting parameters of the camera are determined.
  • the second playback window can be displayed on one side of the first playback window in the form of a texture.
  • the second playback window can be displayed on the right side of the first playback window and overlap with the right edge of the first playback window.
  • the display area on the right side of a playback window constitutes a block.
  • the display device scales the target video to the same size as the full-screen window and directly displays the scaled image in the full-screen window, due to the occlusion of the second playback window, if the normal window display logic is followed, part of the target video cannot be displayed.
  • the user can only watch the images in the target video that are not blocked by the second play window.
  • the target video is a fitness video
  • the user needs to follow the movements of the fitness coach in the fitness video.
  • the fitness coach is the character in the target video, usually located in the middle of the fitness video, and the second playback window may block the fitness coach.
  • the body affects the viewing effect of fitness videos.
  • the display device may determine a target display area in the first playback window in the follow-up mode, and the target display area is not blocked by the second playback window. In the occluded area, the display device can display the image of the target video to the left in the target display area, so that the user can see a relatively complete exercise action in the target display area.
  • the target display area is determined according to the position coordinates of the first play window and the position coordinates of the second play window. By subtracting the position coordinates of the first play window from the position coordinates of the second play window, the position coordinates of the target display area can be obtained.
  • the target display area refers to the preferred display area in the first playback window, and the content displayed in the target display area will not be blocked by other images.
  • window there is also a second playback window, and the second playback window forms a partial block to the first playback window, then the display device can display the area in the first playback window that is located on the left side of the second playback window and is not blocked by the second playback window. Determined as the target display area.
  • the second play window is superimposed and displayed on the upper right side of the first play window, and the area on the left side of the first play window that is not blocked by the second play window can be determined as the target display area of the first play window.
  • the first playback window is blocked by the second playback window, and the display device determines the target display area according to the position of the second playback window on the first playback window.
  • the first playback window used for video playback may not be blocked by the second playback window used to play the local data of the camera, but by other images, for example, by an opaque control for displaying pictures
  • the method provided by the embodiment of the present application can also be applied to determine a target display area, and play a video in the target display area, so as to achieve a better playback effect.
  • the target display area can be determined as a rectangular area in the first playback window that is not blocked by the controls. If all the sides of the control do not overlap with the first playback window, at this time, no matter which side of the control the target display area is set to, the size of the target display area will be smaller.
  • the display area is determined as the target display area, that is, the target video is played in full screen in the first play window.
  • the size ratio of the target display area may not be 16:9.
  • the center lines of the display areas are coincident, so that the user can see a relatively complete person image in the target display area.
  • the target display area may be determined, and the first moving distance of the target video may be directly determined according to the position parameters of the first playback window and the second playback window in the playback interface.
  • the exemplary target video will be scaled during the playback of the player.
  • the image is generally scaled in equal proportions in the height and width directions.
  • the general scaling rule is to confirm the scaling factor according to the height (width). After scaling the image, the scaled image fills the height (width) of the playback window in height (width), and black can be inserted in another dimension.
  • the height direction may be used as the reference, or the width direction may be used as the reference.
  • the first distance that the image of the target video needs to move during the display process can be determined according to the width parameter of the first player and the width parameter of the second player. So that the middle position of the target video image frame is displayed as far as possible in the unobstructed area on the first video window.
  • the background image on the right side of the character in the target video can be blocked by the second playback window.
  • the characters in the window are occluded, which achieves a good display effect.
  • the dislocation display of the first play window and the second play window in FIG. 6 and FIG. 9 is only to express the existence of two independently controlled play windows, and does not represent the actual superimposed display effect.
  • the image of the target video may be zoomed to obtain the image width after the target video is zoomed to the size of the first playback window, and the image width may be referred to as the width to be displayed; Obtain the width of the second playback window; take the difference between the two widths to obtain the width of the target display area; take half of the difference between the width to be displayed and the width of the target display area as the first moving distance.
  • the first moving distance D offset from the left side of the target video is the same as the distance that the right side of the target video is blocked by the second playback window, so that the center line of the target video and the center line of the target display area are overlapped.
  • This calculation method may be called an average method, and has the advantages of simple calculation and fast calculation speed, and can quickly determine the size of the first moving distance D.
  • the first moving distance may be any distance less than the difference in width of the two playback windows.
  • the first moving distance is not greater than the width of the second playback window.
  • the first moving distance may be directly determined according to the width of the second playing window. Exemplarily, half of the width of the second playing window is used as the distance to be moved, and the first moving distance is not greater than the distance to be moved. .
  • the first moving distance may also be obtained according to the difference between the width of the zoomed movie and the width of the second playback window, for example, half of the difference may be used as the first moving distance.
  • the determination of the first movement distance is performed according to a position parameter of the playback window.
  • the fitness trainer in the target video may not be located in the middle of the image.
  • the first moving distance D offset from the left side of the target video is set to the right side of the target video and is blocked by the second playback window.
  • the method with the same distance will cause the fitness trainer in the target video to be skewed to the left or right, and the display effect is not good.
  • other methods can also be used to calculate the first moving distance D.
  • human body recognition can also be performed on the image frame of the target video.
  • the central axis of the human body is symmetrically extended to both sides until the width of the display area containing the human body is the same as the width of the target display area. At this time, the difference between the width starting point of the display area containing the human body and the width starting point of the first playback window is used as the first moving distance.
  • the first moving distance obtained by this calculation method may be the same as the first moving distance obtained by the above-mentioned averaging method.
  • the first moving distance obtained by this calculation method is smaller than the first moving distance obtained by the above-mentioned averaging method, if the fitness coach is offset in the media image of the target video. Right, then the first moving distance obtained by this calculation method is greater than the first moving distance obtained by the above-mentioned averaging method.
  • the display device may implement the image offset display of the target video by dynamically setting the display of the surfaceView (planar view).
  • the offset output function of surfaceView can be: layoutParam.setMargins(0-D,0,0,0). As shown in Figure 9, the offset output function indicates that the left offset is D, so that the left starting point of the media asset image is (0-D), and the starting point on the left side of the media asset image in the first playback window is 0, Since the first playback window is displaying an image, the image needs to be displayed from the position where the starting point is 0.
  • the first playback window when displaying each frame of the target video in the first playback window, the first playback window starts to display the media assets from the pixel point on the right side of the target video at the first moving distance D in the first playback window.
  • Image the image within the first moving distance D exceeds the display range of the first playback window, and the first playback window will not display this part of the image, which realizes the display effect of shifting the image of the target video to the left and displaying it in the target video.
  • the first playback window On the right side of the image, the first playback window still has a part of the display area, and this part of the display area can display black borders.
  • the second playback window By placing the second playback window on top, the second playback window can cover the right side of the character in the image of the target video. part of the background image and the above black border, the user will not see this part of the background image and the black border, and will not affect the user's viewing experience.
  • the topping method of the second playback window may be setZOrderOnTop(true) to topping.
  • the playback interface of the display device in the follow-up mode is shown in Figure 10.
  • the middle image of the target video is displayed in the target display area of the first playback window, and the user image is displayed in the second playback window. displayed inside.
  • the display device decodes the target video, scales it according to the parameters of the first playback window, and then moves the image frame to the left by the first The position parameter after the distance is displayed. Since the image outside the playback area of the first playback window cannot be displayed, the edge of the image frame close to the second playback window is blocked by the second playback window, and the central area of the image frame can be in the first playback window. Unoccluded areas are rendered.
  • the second play window plays the acquired video data of the camera.
  • a display area where the target video is not blocked is detected on the display device, and it is determined as the target display area, and then after scaling the target video, the image of the target video is controlled to be displayed on the target.
  • the area is offset and displayed, which solves the problem of poor display effect caused by the target video being blocked by the second playback window when playing in the follow-up mode, and improves the user's viewing experience.
  • the video playback application can also score the user's actions according to some preset scoring rules, so that the user does not need to compare their actions with the actions in the target media assets, and can also know whether their actions are standardized .
  • a preset scoring rule is: compare the image of the target media asset with the user image in real time, and determine the score of the user action according to the similarity between the user action in the user image and the action in the target media asset , the higher the similarity, the higher the score, and the lower the similarity, the lower the score.
  • the user it takes some time for the user to see the action in the target media asset to act. If the user's image is captured too early or too late, the user's action score is likely to be low.
  • the target media asset is continuously played. If the user performs an action in the target media asset, the screen of the target media asset has been switched to other actions, which will directly lead to a lower user rating.
  • a preset scoring rule is: when the target media asset plays a specific action, the image of the target media asset at this time is acquired, and then multiple user images are continuously collected, and the The action is compared with the action in the image of the target media asset to obtain multiple scores, and the highest score is used as the score of the action, thereby improving the accuracy of the score.
  • the specific action used for scoring in the target media asset may be determined according to an action library, wherein the action library may include a plurality of sample pictures including actions of different characters and action data corresponding to the sample pictures.
  • the movements can be some common fitness movements, such as squat movements, hand raising movements and so on.
  • the action data of the action library may include the coordinate position and type of the skeleton key point of the character in the sample picture, wherein the skeleton key point may be obtained by a trained skeleton key point detection model
  • a kind of Exemplary bone key types may include nose bone key, neck bone key, left shoulder bone key, left elbow bone key, left wrist bone key, right shoulder bone key, right elbow bone key, right Wrist Bone Key, Left Hip Bone Key, Left Knee Bone Key, Left Ankle Bone Key, Right Hip Bone Key, Right Knee Bone Key, Right Ankle Bone Key, Left Eye Bone Keys, Right Eye Bone Keys, Left Ear Bone Keys, and Right Ear Bone Keys.
  • the skeleton key point detection model can be a model based on a deep neural network.
  • a large number of pictures with manually marked skeleton key points are input into the deep neural network model, and then the deep neural network is trained, so that the deep neural network has the function of identifying the key points of the skeleton. .
  • skeleton key points can also be obtained by manual annotation.
  • the action data of the action library may further include the positional relationship between adjacent skeleton key points, and different character actions can be distinguished according to the positional relationship between the adjacent skeleton key points.
  • the action data of the action library also includes the action difficulty of the character actions in the sample pictures, and the action difficulty can be determined by the operator.
  • An exemplary action difficulty range is 0-10. means more difficulty.
  • the action data of the action library further includes an action identifier
  • each character action may correspond to a different action identifier
  • an exemplary action identifier may be an action number, and according to the action number, it can be retrieved from the action library Quickly find out other action data and sample pictures corresponding to the action number.
  • the process of determining the image frame in the target media asset in which the specific action for scoring is located may be referred to as punctuating the target media asset, see FIG. 11 , which is the punctuation of the target media asset according to some embodiments Interactive diagram.
  • the operator can use the first tool to process the management of the target media assets by the server, the media asset service server and the media asset content server.
  • the first tool processing server can manage the target media assets, and the action library can be stored in the first tool processing server; the media asset service processor can be provided with media asset information of each target media asset, the media asset information
  • the original information that can be provided to the provider of the target media asset, such as the media asset playback address, media asset resolution, media asset duration, media asset type, etc., or the media asset information processed by the operators.
  • the processed media asset may include some new information such as the corrected media asset type, media asset label, etc., and the original media asset type is deleted.
  • the latter type of media asset may be fitness
  • the media asset content server may be a server for uploading the video stream file and original information of the target media asset to the content provider of the target media asset.
  • the first tool processing server, media asset service server, and media asset content server are distinguished according to their respective functions.
  • each server may be deployed on one hardware device, or may be deployed on multiple
  • Each of the three servers may also be deployed on one hardware device, which is not specifically limited in this embodiment of the present application.
  • the operator may input a management instruction of the target media asset to the first tool processing server, where the management instruction may include the media asset ID of the target media asset, and the first tool processing server may send the media asset ID to the media asset according to the media asset ID.
  • the asset service server obtains the media asset information corresponding to the media asset ID, that is, the media asset information of the target media asset.
  • the media asset service server may generate corresponding media asset information according to the original information of the newly uploaded media asset, and the first tool processing server It can actively monitor the newly generated media asset information on the media asset service server in real time, and judge whether the newly uploaded media asset is used as the target media asset according to the media asset type in the media asset information. If the media asset type is the preset management type, Such as fitness type, the newly uploaded media asset can be used as the target media asset. If the media asset type is not the preset management type, the newly uploaded media asset can be skipped as the target media asset to be managed, and the media asset can be skipped. Continue to judge whether the next newly uploaded media asset is the target media asset.
  • the preset management type such as fitness type
  • the content provider after the media asset content server uploads a new media asset, the content provider has already checked the media asset, and has set a check mark in the original information of the media asset, which is used to indicate The media asset has been clicked.
  • the media asset service processing server processes the original information to obtain the media asset information, if it detects a click-through label and the click-through label conforms to a preset specification, for example, the click-through label contains a time-axis-based click-through label. The playback time of the video frame, the dotted label can be retained in the media asset information, and if not, the dotted label is deleted.
  • the first tool processing server can determine whether the media asset information of the target media asset has a dotted label, and if the media asset information of the target media asset has a dotted label, the first tool processing server can It is determined that the target media asset has been checked, and if there is no sticky note, the media asset is regarded as the target media asset to be checked.
  • the media asset type in the media asset information generated by the media asset service server does not belong to the management type, but after a period of time, the media asset
  • a type attribute is added to the media asset information of the media asset as the dot type.
  • the first tool processing server can actively monitor the occurrences on the media asset service server in real time. For the changed media asset information, if the media asset type in the changed media asset information is a management type and there is no management label in the media asset information, the media asset is determined as the target media asset to be managed.
  • the content provider after the content provider manages the media asset, it can also generate a management file, and the management file can be stored in the original information of the media asset, and the media asset service processing server is processing the original information to obtain the media asset.
  • the dot file can be retained in the media asset information. Therefore, when processing a target media asset, the first tool processing server can determine whether the media asset information of the target media asset has the above-mentioned dot file, and if the media asset information of the target media asset has the above dotted file, the first tool processing The server can determine that the target media asset has been managed, and if there is no dot file and no dot tag, the media asset can be regarded as the target media asset to be dotted.
  • the first tool processing server may generate a prompt message that the target media asset is managed if the media asset information of the target media asset was previously obtained according to the management instruction, so that the target media asset is managed.
  • the operator knows that the target media asset has been managed; if the media asset information of the target media asset is automatically obtained by the first tool processing server from the media asset service server, the target media asset can be directly skipped and the next one can be processed. target media.
  • the content provider's management method for the target media asset may be different from the management method of the first tool processing server. Therefore, after knowing that the target media asset has been managed, the operator can input the first tool processing server. Re-managing the instruction to make the first tool processing server manage the target media asset.
  • the first tool processing server can obtain the video stream file of the target media asset from the media asset content server according to the media asset playback address in the media asset information, and then Analyze the video stream file to obtain the video frame of the target media asset, then detect the video frame of the target video frame by frame, and perform character motion recognition on the video frame, if the recognized character action is one of the character actions corresponding to the action library, Then, a dot recording record is generated, and the dot recording record includes at least the playing time of the video frame.
  • the first tool processing server may detect the skeleton key points in the video frame through the trained skeleton key point detection model, and then compare the relative positional relationship between the adjacent skeleton key points in the video frame with the action library Compare the relative positional relationship between the corresponding skeleton key points in each sample picture in character actions.
  • the key point of the left shoulder bone, the key point of the left elbow bone and the key point of the left wrist bone are on a straight line
  • the action library in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in the action data corresponding to a sample image
  • the left shoulder bone key in
  • the playback time of the video frame in the target media asset, and the The action mark corresponding to the action in the video frame is generated, and then a dot record is generated according to the play time and the action mark, and the dot record may include the playback time of the video frame and the action mark corresponding to the video frame.
  • the time interval between adjacent video frames is usually in milliseconds. Therefore, in the dot recording, the playback time can be accurate to milliseconds, which is convenient for determining the video frames.
  • the dotted video frames in a target media asset may cause the user to give a low score because the user does not have time to keep up with the actions in the target media asset when the target media asset is playing.
  • the action of a character is one of the actions of the characters corresponding to the action library, it is possible to first judge whether the dotting conditions are met, and then do the dots if the dotting conditions are satisfied, and skip the video if the dotting conditions are not met. frame, continue to detect the next video frame.
  • An exemplary dotting condition may be: when the character action in the video frame is one of the character actions corresponding to the action library, if the playback time of the video frame is greater than the playback time corresponding to the previous dot recording If the preset time is set, it can be dotted to generate a dot record, that is, within the preset time, at most one dot can be done, and the preset time can be set to 10 seconds or other durations.
  • the dotted video frames in the target media asset in order to prevent the dotted video frames in the target media asset from being too dense, after a dot dot is performed, no action recognition is performed on the video frames of the target media asset within a preset time after dot dot, and the pre-dot video frames are not identified.
  • the video frame after the set time is used for character action recognition.
  • the target media asset after the detection of all the video frames of the target media asset is completed, or after the target media asset detects the video frames within a preset time from the last video frame of the target media asset, it can be aggregated and recorded. Dot records and the timeline of the target media asset generate a dot file and/or a dot tag, and store the dot file and/or the dot tag in the media asset information of the target media asset.
  • only a dotted file may be generated without a dotted label, or only a dotted label may be generated without a dotted file.
  • the first tool processing server may generate a notification message indicating that the target media asset has been managed after generating the dotting file and/or the dotting label of the target media asset, if the dosing instruction was previously done according to the dosing instruction, so that the operation The personnel know that the management of the target media asset has been completed; if the target media asset is automatically identified before, the next target media asset can be processed.
  • the first tool processing server may further generate a dotting library corresponding to the target media asset according to the action data of the action of the character corresponding to the dotting record.
  • the first tool processing server can store the management library in the media asset information of the target media asset in the media asset service server, and the media asset service server can be configured to download the media asset information of the target media asset to the display device. Send RBI library.
  • the first tool processing server may also directly store the management library in the first tool processing server.
  • the display device can collect the user image according to the dot recording obtained in the above-mentioned embodiment, when the playback time of the video frame in the dot recording is reached, and perform user actions in the user image.
  • Action comparison after comparison, can also be used to score user actions.
  • FIG. 12 is a schematic diagram of scoring interaction of target media assets according to some embodiments
  • the second tool processing server may It interacts with the display device, then scores the user's actions, generates a follow-up practice record, and feeds the follow-up practice record back to the display device, so that the display device can display the follow-up practice record.
  • each server may be deployed on one hardware device or may be deployed on multiple hardware devices. Both servers may also be deployed on one hardware device, which is not specifically limited in this embodiment of the present application.
  • the display device can detect the dotted label from the media asset information, confirm that the target media asset supports the action score according to the dotted label, and then obtain the dotted file of the target media asset from the media asset information, and obtain the target media asset's dotted file. Dot record.
  • the display device can also detect the media asset information to determine whether the media asset information contains a dot file and/or dot label, and if so, the target media resource can be obtained from the dot file and/or dot label. 's hit record.
  • the user can follow the target video to make corresponding actions.
  • the display device when the display device detects that the target video is played to a time corresponding to a dot record, the display device may acquire the media image of the target video at this time, and start to collect multiple user images with time progression, The media asset image and the user image are sent to the second tool processing server.
  • the display device when the target video is played to the time corresponding to a dotting record, the display device can upload a user image to the second tool processing server every time interval, and for a dotting record, the display device can upload a preset number of , wherein the time interval of the uploaded user images may be 100 milliseconds, and the preset number may be 10, or the time interval of the uploaded user images may be 50 milliseconds, and the preset number may be 20.
  • the second tool processing server may perform an action comparison between the user image and the media asset image according to the time sequence of the user image to obtain an action score of the user image.
  • the action comparison method includes: detecting the skeleton key points in the user image and the skeleton key points in the media image by using the trained skeleton key point detection model, and comparing the adjacent skeleton key points in the user image. The relative position is compared with the relative position between the corresponding skeleton key points in the media image, that is, the action data in the user image is compared with the action data in the media image, and the error of the relative position is obtained.
  • the error of the position and the action difficulty of the media image are calculated to obtain the similarity between the user action in the user image and the action in the media image, and the action score of the user action is obtained according to the similarity.
  • the mapping relationship between the error of the relative position and the similarity, as well as the mapping relationship between the similarity, the action difficulty and the action score can be formulated in advance and can be adjusted. For example, when the number of relative positions whose errors are within the preset range is constant, the greater the difficulty of the action, the higher the action score.
  • the display device may also send a playback instruction of the target media asset to the second tool processing server, so that the second tool processing server can respond to the target media
  • the playback instruction of the asset is downloaded from the media asset service server or the first tool processing server.
  • the display device can upload the user image and the action ID, but not the asset image.
  • the second tool processing server compares the action data in the user image with the action data of the corresponding sample picture in the action library according to the action ID, and obtains the action data of the user action. Action rating.
  • the second tool processing server may download the dot library from the first tool processing server in response to a playback instruction of the target media asset, and the second tool processing server may also compare the motion data of the user image with the target media asset’s action data. The corresponding action data in the management library is compared to obtain the action score of the user's action, which avoids the problem that the action library may be large and that downloading the action library and searching for the action data from the action library are slow.
  • the action library and/or the management library can also be directly stored on the second tool processing server, which avoids the time-consuming problem that the second tool processing server needs to download the action library and/or the management library.
  • the second tool processing server stops comparing the next user image and the media asset image when the termination condition of the current comparison is reached.
  • the termination condition may be that a first preset number of user images have been compared, or a comparison needs to be performed for the next action, such as receiving the next media image, or a second preset number of consecutive actions.
  • the score is on a downward trend, the first preset number may be 10, and the second preset number may be 3.
  • the user Since the user sees the image of the target media asset, it takes a certain amount of time to perform the action in the target media asset. After the action is completed, it may return to the initial state, such as standing upright, or proceed to the next action. Therefore, , after the time-progressing user images are scored, multiple action scores can form a parabola with an approximate opening downward in chronological order. The vertex of the parabola is the highest score in the action score, and the highest score can be the highest score in this action.
  • the follow-up score can also be determined in other ways. For example, after removing several lower scores, the average score of the remaining scores is used as the follow-up score for this action.
  • FIG. 13 shows a follow-up mode control method, the method is configured and executed by the controller 250 in the display device, that is, the controller 250 is the execution subject of the method, and the method includes the following program steps :
  • Step S10 in response to receiving the operation of starting the training item video, displaying the training item video in the first window of the follow-up training interface, and displaying the local image in the video code stream collected and sent by the image collector in the second window .
  • This step is the basis and premise of user follow-up training, which is convenient for users to train according to the guidance of the follow-up interface.
  • Step S20 in response to the video of the training item being played to a key frame, periodically acquiring a follow-up image corresponding to the key frame from the video stream.
  • Step S30 compare the follow-up exercises in the follow-up images with the standard actions in the key frame, and obtain the training scores of the follow-up actions in each follow-up image respectively.
  • the training item video is not played to the key frame of the dot position, it will continue to play until a dot is encountered.
  • a follow-up image corresponding to the key frame needs to be periodically obtained from the video stream.
  • a preset period can be set, and one frame of follow-up practice can be obtained every preset period.
  • the image, the follow-up image corresponding to the key frame mentioned here refers to the image collected when the user simulates the follow-up action after watching the standard action in the key frame.
  • a frame of follow-up training images is obtained every 100ms, and the follow-up exercises of the human body identified in the follow-up training images are compared with the standard movements, and the training scores of the follow-up exercises in each frame of follow-up training images are obtained.
  • a time stamp is set for each frame of image in the video code stream collected by the image collector 232 , and the time stamp of each frame of image in the video code stream is the time stamp of the image collector 232 on the basis of the collection time.
  • the time delay compensation is set after time compensation, and the delay compensation is used to eliminate the delay caused by the transmission of the image from the image collector 232 to the controller 250 .
  • the controller 250 Before the controller 250 periodically obtains the follow-up image from the video code stream, according to the physical time played to the key frame, the time stamps of each frame image in the video code stream are compared to locate the follow-up image corresponding to the key frame, In order to obtain accurate follow-up images.
  • the present application utilizes physical time, that is, the time of the dotting position, to locate and acquire the follow-up image, rather than the progress bar time, so as to improve the accuracy of acquiring the follow-up image in a more accurate time matching manner.
  • the present application considers the compensation for the image transmission delay when setting the time stamp.
  • the image transmission delay is about 150ms, that is, after the delay of 150ms, it is transmitted to the controller 250, then the time stamp of each frame of image in the video stream can be set as Advance this frame acquisition time by 150ms.
  • Step S40 according to the maximum value of the training score of the follow-up action in each follow-up image, calculate the action matching degree between the standard action and the follow-up action.
  • the method before step S40 is performed, the method further includes: in response to reaching a termination condition, stopping acquiring the follow-up image from the video stream.
  • the controller 250 determines that the termination condition is reached in response to the training item video playing to the next key frame. That is, before switching from the standard action at the current dotting position to the next standard action, it is necessary to stop acquiring follow-up images to ensure that the collected follow-up images are related to the current standard action. In this case, the image acquisition process is followed by the dot position constraint, and all frames included between the two dot positions are acquired.
  • the applicant's research found that when the user performs follow-up training, the training score of each action is approximately a parabola with an opening downward, that is, when the user gradually adjusts the limbs to approach the standard action according to the action standardization prompt information, this process of training The score shows an upward trend.
  • the match between the follow-up actions and the standard actions gradually decreases, and the training score shows a downward trend.
  • the training score shows a significant downward trend, or the training score is low, it is not necessary to obtain this part of the training image from the video stream.
  • Each follow-up action score needs to be superimposed with the factor of the user's reflection time.
  • each time the controller 250 acquires a frame of follow-up image the number of acquired frames is cumulatively increased by 1, that is, the number of acquired frames is accurately recorded and updated during the process of acquiring the follow-up image.
  • the second quantity threshold is a preset value, such as 10 frames, which is used to limit the maximum number of frames that the controller 250 can obtain a follow-up image corresponding to each dotting position, and the value of the second quantity threshold is not limited.
  • the number threshold is used to constrain the acquisition process of follow-up images, and the number of follow-up images obtained is equal to the second number threshold, so as to filter out other subsequent frames, terminate the relatively ineffective follow-up image acquisition and score judgment in time, and improve the Matching degree of user scores, and improve the calculation efficiency of score and action matching degree, and improve user experience.
  • each time the controller 250 acquires a frame of follow-up training images it will match the training scores of the corresponding follow-up exercises, so as to obtain the variation trend of the training scores.
  • the i-th frame is 85 points
  • the i+1-th frame is 83 points
  • the i+2-th frame is 80 points
  • the training score shows a decreasing (declining) trend. Since the lower the training score is, the lower the matching degree between the follow-up action and the standard action is. Therefore, it is expected to retain the higher training score and filter out the lower training score.
  • M is the first quantity threshold, for example, M can be 3, and the value of the first quantity threshold M is not limited.
  • the score trend/trajectory during the user's follow-up practice is used to timely terminate the relatively ineffective follow-up image acquisition and score determination, improve the matching degree of the user's score, and improve the calculation efficiency of the score and action matching degree. experience.
  • the termination condition in the process of acquiring the follow-up image, it is necessary to determine whether the termination condition is reached. If the termination condition is not reached, continue to acquire follow-up image frames every preset period and match the training score; if the termination condition is reached, stop acquiring the follow-up image from the video stream, and execute step S40.
  • the target score after matching the target score obtained by the standard actions of the user in the follow-up dotting position, the target score may be recorded, so as to facilitate the subsequent statistics of the final score.
  • Step S50 controlling the display to display action matching prompt information in the second window according to the action matching degree.
  • the action matching prompt information includes the accuracy rate of the action
  • the action matching degree between the standard action and the follow-up action can be calculated by the target score
  • the action is displayed in a designated position on the second window of the follow-up interface.
  • Matching degree the action matching degree is displayed to the user in the form of the ratio value of the accuracy rate.
  • the action matching prompt information further includes encouraging words, and according to the action matching degree, matching encouraging words, such as "Good”, “Great”, “Perfect”, can be displayed in the second window of the follow-up interface. " and so on, each kind of encouragement corresponds to a range of action matching degree. For example, when the action matching degree is more than 90%, the encouragement is displayed as "Perfect".
  • the action matching prompt information further includes action standardization prompt information
  • the degree of deviation between the follow-up exercise and the standard action can be measured by the degree of motion matching, for example, when the degree of motion matching is lower than a preset threshold , indicating that the user's follow-up action is not standard, it is necessary to prompt the user to correct the action.
  • the standard action is displayed in the second window of the follow-up interface.
  • the prompt information is convenient for users to know the deficiencies of their own follow-up exercises, and to correct and adjust them, so as to improve the training score of the follow-up exercises until the highest score (ie, target score) of the movement is reached.
  • action matching prompt information is not limited to those described in the above embodiments, as long as the information content determined based on the action matching degree analysis belongs to the category of the action matching prompt information, and can be displayed in the follow-up interface according to actual needs. .
  • the target score is the maximum value of the training scores in the N frames of training images obtained
  • only the target training images corresponding to the target scores may be retained, and the other N-1 frames of the training images may be deleted,
  • the second window of the follow-up interface will display only the best follow-up movements that match the standard movements with the highest degree, and the reserved target follow-up movements will be displayed when viewing the follow-up images of the standard movements. practice images.
  • the second window of the follow-up interface will display the follow-up images of each frame in sequence according to the acquisition sequence, and display the action standardization prompt information according to the deviation between the follow-up action and the standard action, so that the user can The following exercises are gradually corrected until the optimal follow exercises are reached. Then after the termination condition is reached, the second window only keeps displaying the target follow-up image corresponding to the best follow-up action/target score until the next key frame of dotting arrives, and then starts the follow-up image acquisition, UI transformation of the second window and The above scoring and selection process.
  • the current training item follow-up practice ends, and the user's final score of the follow-up practice needs to be counted; After the preset time is exceeded, for example, after the user has practiced for 2 minutes, the user exits the training program video. In this case, the follow-up practice should also be ended, and the final score of the user's follow-up practice will be counted.
  • the controller 250 is in response to the training item video being played to the end point. Since the user's follow-up practice process has traversed all the standard actions of the dots in the video, it is necessary to count the target follow-up images corresponding to all key frames in the training item video.
  • the training score (that is, the target score) is accumulated and weighted to obtain the final score, that is, the cumulative addition of the target scores obtained by the user referring to each standard action training is the final score of this follow-up training.
  • the controller 250 responds to the operation of exiting the training item video after the follow-up time exceeds a preset duration, that is, the user only follows a part of the video clips that exceed the preset duration without traversing the entire video.
  • Standard actions so it is necessary to count the training scores of the target follow-up images of the currently traversed key frames, and accumulate the weight to obtain the final score. For example, if the user quits this follow-up practice after watching for 3 minutes, and within this 3-minute period, the key frames of the traversed dots are 6, that is, the user has completed the simulation training of 6 standard actions when exiting the follow-up practice, then The cumulative sum of the target scores of these 6 exercises is the final score of this follow-up exercise.
  • a training report can also be output synchronously, so that the follow-up user can learn the details of the current training.
  • the controller 250 controls the display 275 to display a training report interface as shown in FIG. 17 .
  • the training report interface displays the final score, as well as information such as the energy consumed by the follow-up training, the accuracy rate of the movements, and the training duration to the user.
  • the training report interface can also include retraining controls and switching controls. The retraining controls are used to repeat the training process of the currently ended training item when triggered; the switching controls are used to switch to the next training item list when triggered.
  • the user can click the “return” button on the remote control to exit the training report interface and return to the training item list interface as shown in FIG. 12 .
  • the user can either Continue to select the video you want to follow from the list of training items, or you can choose not to continue training and exit the fitness application.
  • the image collector does not involve external factors such as power failure, physical switch off, failure, and being occupied by other applications, since the follow-up user is active, there may be situations where it exits the portrait capture area/aperture during follow-up practice, resulting in image In the follow-up training images collected by the collector 232, the actual portrait cannot be recognized and detected, which is invalid follow-up training.
  • the controller 250 controls the display 275 to pause the training item video played in the first window, and at the same time pauses the training duration, energy consumption, accuracy rate, target score corresponding to the key frame of the hitting position, The statistics of the final score and other data accumulated during the follow-up practice make the follow-up mode in a suspended state to wait for the user to perform portrait recognition again.
  • the second window is controlled to display prompt information to prompt the user to move to the portrait collection area/aperture Re-recognize the portrait inside, and the UI is as shown in Figure 18 at this time.
  • the current pause frame is used as the starting point to start the video of the training project in the first window, and on the basis of the original training data, continue to count the training time, energy consumption, accuracy, target score and final score, etc.
  • the training item video when the training item video is played to a key frame, a frame of follow-up image is acquired at every preset period, and the follow-up image is compared with the key frame, and the given The training score of the follow-up exercise in each follow-up image, when the termination condition is met, stop acquiring the follow-up image from the video stream, and then select the score with the highest training score from the multi-frame follow-up images obtained as the The target score corresponding to this key frame, and the action matching degree is calculated based on the target score.
  • the acquisition of follow-up images is constrained by the second quantity threshold or the change trend/trajectory of the training score, and the acquisition and scoring of invalid follow-up images can be terminated in time, thereby reducing the processing resources consumed by the controller 250 and improving The scoring efficiency and accuracy of the follow-up mode.
  • This application collects a certain number of follow-up images within a certain period of time, and obtains the highest score as the target score corresponding to the standard action, so that the score of each action is maintained at the best matching degree, avoiding delay due to user feedback It can improve the user experience and improve the user's confidence and enthusiasm for training.
  • the second tool processing server can send the follow-up score to a display device, so that the display device can display a score prompt corresponding to the score, see FIG. 14 , a score prompt Can be "GOOD" and the rating prompt can be superimposed on the user's image.
  • the second tool processing server may calculate the accuracy rate of the user's action according to the accumulated follow-up score after the target media asset starts playing, and send the accuracy rate to the display device, so that The display device can display the accuracy rate.
  • the user wants to stop the follow-up practice, he can input an instruction to end the video playback to the display device.
  • the server sends the information of the end of the follow-up practice
  • the second tool processing server receives the information of the end of the follow-up practice, generates a follow-up practice record according to all the follow-up practice scores, and then sends the follow-up practice record to the display device, so that the display device can display the follow-up practice to the user.
  • the display device may send information that the follow-up practice is over to the second tool processing server, and the second tool processing server receives the information that the follow-up practice ends, and generates a score based on all the follow-up practice scores.
  • the follow-up practice record is then sent to the display device, so that the display device can display the follow-up practice record to the user.
  • the follow-up record can display the training score, energy consumption, accuracy and training duration.
  • the training score can be the follow-up exercise.
  • the average score of the score, the accuracy can be the average score of the similarity, the training duration is the playback duration of the target media asset, and the energy consumption can be determined according to some preset calculation rules.
  • the display device before or during the follow-up practice, can also handle some abnormal situations. For example, after the controller of the display device cannot receive the signal from the camera assembly, the display device can pause to play the target media asset, and display an abnormal prompt, see FIG. 16 , the abnormal prompt may include: “Camera not detected”, and the abnormal prompt may be displayed in the window of the user image.
  • the second tool processing server may also process some abnormal situations. For example, if the second tool processing server does not detect a skeleton key point in the user image, it may send a message to the display device.
  • the abnormal prompt and the play pause instruction enable the display device to pause the playback of the target media according to the play pause instruction, and display the abnormal prompt.
  • the abnormal prompt may include: "There is no one in front of the camera, pause playback", and the abnormal prompt may be displayed in the window of the user image.
  • the processing of the abnormal situation by the second tool processing server further includes: during the follow-up practice, if the second tool processing server detects that the position of the skeleton key point in the user image does not change within a period of time, then An abnormality prompt and a playback pause instruction can be sent to the display device, so that the display device can pause the playback of the target media asset according to the pause playback instruction, and display the abnormality prompt.
  • the abnormality prompt can be two arrows pointing to the characters in the user image, and the abnormality prompt can be displayed in the window of the user image.
  • the second tool processing server can perform the operation of scoring user actions and the operation of abnormal handling.
  • Complex data processing such as skeleton point detection and calculation scores requires low hardware level of the display device, which is conducive to the smooth operation of the display device.
  • the operations performed by the second tool processing server can also be performed by the display device.
  • the display device needs to download the action library or the management library before scoring. , there is no need to interact with the second tool processing server during scoring, which can reduce the occupation of network resources.
  • the user images can be scored according to the video frames dotted, which solves the problem that the target media may be possible when the user makes an action during real-time comparison.
  • the problem that the user's action score is low due to the fact that the data has been played to other actions has improved the scoring accuracy of the follow-up mode; and multiple scores are obtained by comparing multiple user images and the video frames corresponding to the dot recording, and the highest score is determined.
  • a follow-up score As a follow-up score, it reduces the probability of a low follow-up score; further, when managing the target media assets, a certain number of video frames are spaced apart to prevent users from being unable to keep up in time due to the excessively intensive management. Each action case enhances the user experience.

Abstract

Embodiments of the present application provide a display device and a media asset playing method. The display device comprises: a display; and a controller, connected to the display. The controller is configured to: receive a media asset playing instruction inputted by a user; in response to the media asset playing instruction, obtain a target video corresponding to the media asset playing instruction; when a control is not set above a first playing window corresponding to the target video, play the target video in the first playing window; and when the control is set above the first playing window corresponding to the target video, move a display position of the target video to a direction distant from the control in the first playing window, so that a center position of a picture of the target video is close to a center position of a target display area not shielded by the control in the first playing window for display. The control is non-transparent and shields one side of the first playing window. The present application improves the display effect of media asset playing.

Description

显示设备及媒资播放方法Display device and media resource playback method
本申请要求在2020年10月15日提交、申请号为202011102193.3,在2021年03月15日提交、申请号为202110275148.6,和在2021年04月25日提交、申请号为202110448074.1的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application requires Chinese patent applications filed on October 15, 2020 with application number 202011102193.3, filed on March 15, 2021 with application number 202110275148.6, and filed on April 25, 2021 with application number 202110448074.1 Priority, the entire contents of which are incorporated herein by reference.
技术领域technical field
本申请涉及显示技术领域,尤其涉及一种显示设备及媒资播放方法。The present application relates to the field of display technologies, and in particular, to a display device and a method for playing media assets.
背景技术Background technique
现如今,跟随电视上的健身视频进行健身是一种流行的健身方式,为了更好的掌握自己的健身动作是否标准,部分电视可通过摄像头采集用户图像,将用户图像与健身视频在电视上同时播放,使用户能在电视上看到自己的动作,从而可将自己的动作与健身视频终端中的动作进行比对分析。Nowadays, it is a popular way to exercise by following the fitness videos on TV. In order to better grasp whether your fitness movements are standard, some TVs can collect user images through cameras, and display the user images and fitness videos on the TV at the same time. Play, so that users can see their actions on the TV, so that they can compare and analyze their actions with the actions in the fitness video terminal.
目前大部分电视的显示比例为16:9,健身视频的比例通常也为16:9,如果电视只播放健身视频,则可将健身视频进行全屏显示,而如果将健身视频与用户图像同时显示,由于用户图像占据了电视的部分显示区域,会导致健身视频的显示区域可能不是16:9。相关技术中,当一个视频的比例与播放窗口的比例不一致时,通常会将该视频缩放到一个较小的尺寸,以便能够在播放窗口内进行展示,然而,这样会导致视频的周围出现黑边,并且,视频也会变得较小,对于健身视频,视频变小会导致一些健身动作难以看清,严重影响用户的观看体验。At present, the display ratio of most TVs is 16:9, and the ratio of fitness videos is usually 16:9. If the TV only plays fitness videos, the fitness videos can be displayed in full screen. If the fitness videos and user images are displayed at the same time, Since the user image occupies part of the display area of the TV, the display area of the fitness video may not be 16:9. In the related art, when the ratio of a video is inconsistent with the ratio of the playback window, the video is usually scaled to a smaller size so that it can be displayed in the playback window. However, this will cause black borders to appear around the video. , and the video will also become smaller. For fitness videos, the smaller video will make some fitness movements difficult to see, which will seriously affect the user's viewing experience.
发明内容SUMMARY OF THE INVENTION
第一方面,本申请提供了一种显示设备,该显示设备包括:In a first aspect, the present application provides a display device, the display device comprising:
显示器;monitor;
控制器,与所述显示器连接,所述控制器被配置为:a controller, connected to the display, the controller being configured to:
接收用户输入的媒资播放指令;Receive the media asset playback instruction input by the user;
响应于所述媒资播放指令,获取所述媒资播放指令对应的目标视频;In response to the media asset playback instruction, obtain a target video corresponding to the media asset playback instruction;
在对应所述目标视频的第一播放窗口上方未设置控件时,在所述第一播放窗口播放所述目标视频;When the control is not set above the first playback window corresponding to the target video, the target video is played in the first playback window;
在对应所述目标视频的第一播放窗口上方设置有所述控件时,在所述第一播放窗口中将所述目标视频的显示位置向远离所述控件的方向移动,以使所述目标视频的画面的中心位置靠近所述第一播放窗口中未被所述控件遮挡的目标显示区域的中心位置显示,其中,所述控件不透明,且遮挡所述第一播放窗口的一侧。When the control is provided above the first playback window corresponding to the target video, the display position of the target video is moved in the first playback window away from the control, so that the target video is displayed in a direction away from the control. The center position of the screen is displayed close to the center position of the target display area in the first play window that is not blocked by the controls, wherein the controls are opaque and block one side of the first play window.
第二方面,本申请提供了一种媒资播放方法,该方法包括:In a second aspect, the present application provides a method for playing media assets, the method comprising:
接收用户输入的媒资播放指令;Receive the media asset playback instruction input by the user;
响应于所述媒资播放指令,获取所述媒资播放指令对应的目标视频;In response to the media asset playback instruction, obtain a target video corresponding to the media asset playback instruction;
在对应所述目标视频的第一播放窗口上方未设置控件时,在所述第一播放窗口播放所述目标视频;When the control is not set above the first playback window corresponding to the target video, the target video is played in the first playback window;
在对应所述目标视频的第一播放窗口上方设置有所述控件时,在所述第一播放窗口中将所述目标视频的显示位置向远离所述控件的方向移动,以使所述目标视频的画面的中心位置靠近所述第一播放窗口中未被所述控件遮挡的目标显示区域的中心位置显示,其中,所述控件不透明,且遮挡所述第一播放窗口的一侧。When the control is provided above the first playback window corresponding to the target video, the display position of the target video is moved in the first playback window away from the control, so that the target video is displayed in a direction away from the control. The center position of the screen is displayed close to the center position of the target display area in the first play window that is not blocked by the controls, wherein the controls are opaque and block one side of the first play window.
附图说明Description of drawings
图1中示出了一些实施例中的显示设备与控制装置之间操作场景的示意图;FIG. 1 shows a schematic diagram of an operation scenario between a display device and a control device in some embodiments;
图2中示出了一些实施例中的健身首页示意图;FIG. 2 shows a schematic diagram of the fitness home page in some embodiments;
图3中示出了一些实施例中的媒资详情界面示意图;Figure 3 shows a schematic diagram of a media asset details interface in some embodiments;
图4中示出了一些实施例中的播放模式选择界面示意图;FIG. 4 shows a schematic diagram of a playback mode selection interface in some embodiments;
图5中示出了一些实施例中的普通模式下的全屏播放界面示意图;Figure 5 shows a schematic diagram of a full-screen playback interface in normal mode in some embodiments;
图6中示出了一些实施例中的跟练模式下的双窗口播放界面示意图;Figure 6 shows a schematic diagram of a dual-window playback interface in the follow-up mode in some embodiments;
图7中示出了一些实施例中的跟练模式下的双窗口播放界面示意图;Figure 7 shows a schematic diagram of a dual-window playback interface in the follow-up mode in some embodiments;
图8中示出了一些实施例中的图像移动的示意图;A schematic diagram of image movement in some embodiments is shown in FIG. 8;
图9中示出了一些实施例中的图像移动后的效果示意图;Figure 9 shows a schematic diagram of the effect after the image is moved in some embodiments;
图10中示出了一些实施例中的显示界面示意图;Figure 10 shows a schematic diagram of a display interface in some embodiments;
图11中示出了一些实施例中的目标媒资的打点交互示意图;Figure 11 shows a schematic diagram of the interaction of dots of target media assets in some embodiments;
图12中示出了一些实施例中的目标媒资的评分交互示意图;Figure 12 shows a schematic diagram of scoring interaction of target media assets in some embodiments;
图13中示出了一些实施例中的跟练过程中的评分方法的流程图;Figure 13 shows a flowchart of the scoring method in the follow-up practice process in some embodiments;
图14中示出了一些实施例中的跟练过程中的评分示意图;Figure 14 shows a schematic diagram of scoring during follow-up practice in some embodiments;
图15中示出了一些实施例中的跟练结束后的评分示意图;Figure 15 shows a schematic diagram of scoring after follow-up practice in some embodiments;
图16中示出了一些实施例中的显示设备的异常处理界面的示意图;Figure 16 shows a schematic diagram of an exception handling interface of a display device in some embodiments;
图17中示出了一些实施例中的显示设备的异常处理界面的示意图;Figure 17 shows a schematic diagram of an exception handling interface of a display device in some embodiments;
图18中示出了一些实施例中的显示设备的异常处理界面的示意图。FIG. 18 shows a schematic diagram of an exception handling interface of a display device in some embodiments.
具体实施方式Detailed ways
为使本申请的目的和实施方式更加清楚,下面将结合本申请示例性实施例中的附图,对本申请示例性实施方式进行清楚、完整地描述,显然,描述的示例性实施例仅是本申请一部分实施例,而不是全部的实施例。In order to make the purpose and implementation of the present application clearer, the exemplary embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the exemplary embodiments of the present application. Obviously, the described exemplary embodiments are only the Some embodiments are claimed, but not all embodiments.
图1为根据实施例中显示设备与控制装置之间操作场景的示意图。如图1所示,用户可通过智能设备300或控制装置100操作显示设备200。FIG. 1 is a schematic diagram of an operation scenario between a display device and a control apparatus according to an embodiment. As shown in FIG. 1 , a user can operate the display device 200 through the smart device 300 or the control device 100 .
在一些实施例中,控制装置100可以是遥控器,遥控器和显示设备的通信包括红外协议通信或蓝牙协议通信,及其他短距离通信方式,通过无线或有线方式来控制显示设备200。用户可以通过遥控器上按键、语音输入、控制面板输入等输入用户指令,来控制显示设备 200。In some embodiments, the control apparatus 100 may be a remote controller, and the communication between the remote controller and the display device includes infrared protocol communication or Bluetooth protocol communication, and other short-range communication methods, and the display device 200 is controlled wirelessly or wiredly. The user can control the display device 200 by inputting user instructions through keys on the remote control, voice input, control panel input, and the like.
在一些实施例中,也可以使用智能设备300(如移动终端、平板电脑、计算机、笔记本电脑等)以控制显示设备200。例如,使用在智能设备上运行的应用程序控制显示设备200。In some embodiments, a smart device 300 (eg, a mobile terminal, a tablet computer, a computer, a notebook computer, etc.) can also be used to control the display device 200 . For example, the display device 200 is controlled using an application running on the smart device.
在一些实施例中,显示设备200还可以采用除了控制装置100和智能设备300之外的方式进行控制,例如,可以通过显示设备200设备内部配置的获取语音指令的模块直接接收用户的语音指令控制,也可以通过显示设备200设备外部设置的语音控制设备来接收用户的语音指令控制。In some embodiments, the display device 200 can also be controlled in a manner other than the control apparatus 100 and the smart device 300. For example, the module for acquiring voice commands configured inside the display device 200 can directly receive the user's voice command for control. , the user's voice command control can also be received through a voice control device provided outside the display device 200 device.
在一些实施例中,显示设备200还与服务器400进行数据通信。可允许显示设备200通过局域网(LAN)、无线局域网(WLAN)和其他网络进行通信连接。服务器400可以向显示设备200提供各种内容和互动。服务器400可以是一个集群,也可以是多个集群,可以包括一类或多类服务器。In some embodiments, the display device 200 is also in data communication with the server 400 . The display device 200 may be allowed to communicate via local area network (LAN), wireless local area network (WLAN), and other networks. The server 400 may provide various contents and interactions to the display device 200 . The server 400 may be a cluster or multiple clusters, and may include one or more types of servers.
在一些实施例中,用户可在显示器260上显示的图形用户界面(GUI)输入用户命令,则用户输入接口通过图形用户界面(GUI)接收用户输入命令。或者,用户可通过输入特定的声音或手势进行输入用户命令,则用户输入接口通过传感器识别出声音或手势,来接收用户输入命令。In some embodiments, the user may input user commands on a graphical user interface (GUI) displayed on the display 260, and the user input interface receives the user input commands through the graphical user interface (GUI). Alternatively, the user may input a user command by inputting a specific sound or gesture, and the user input interface recognizes the sound or gesture through a sensor to receive the user input command.
在一些实施例中,“用户界面”,是应用程序或操作系统与用户之间进行交互和信息交换的介质接口,它实现信息的内部形式与用户可以接受形式之间的转换。用户界面常用的表现形式是图形用户界面(Graphic User Interface,GUI),是指采用图形方式显示的与计算机操作相关的用户界面。它可以是在电子设备的显示屏中显示的一个图标、窗口、控件等界面元素,其中控件可以包括图标、按钮、菜单、选项卡、文本框、对话框、状态栏、导航栏、Widget等可视的界面元素。In some embodiments, a "user interface" is a medium interface for interaction and information exchange between an application program or an operating system and a user, which enables conversion between an internal form of information and a form acceptable to the user. The commonly used form of user interface is Graphical User Interface (GUI), which refers to a user interface related to computer operations displayed in a graphical manner. It can be an icon, window, control and other interface elements displayed on the display screen of the electronic device, wherein the control can include icons, buttons, menus, tabs, text boxes, dialog boxes, status bars, navigation bars, Widgets, etc. visual interface elements.
在一些实施例中,显示设备启动后可以直接进入预置的视频点播程序的界面,视频点播程序的界面可以如图2中所示,至少包括导航栏和位于导航栏下方的内容显示区,内容显示区中显示的内容会随导航栏中被选中控件的变化而变化。应用程序层中的程序可以被集成在视频点播程序中通过导航栏的一个控件进行展示,也可以在导航栏中的应用控件被选中后进行进一步显示。In some embodiments, the display device can directly enter the interface of the preset VOD program after startup. The interface of the VOD program can be as shown in FIG. 2 , including at least a navigation bar and a content display area located below the navigation bar. The content displayed in the display area changes with the selected control in the navigation bar. The program in the application layer can be integrated in the video-on-demand program to be displayed through a control in the navigation bar, or it can be further displayed after the application control in the navigation bar is selected.
在一些实施例中,显示设备启动后可以直接进入上次选择的信号源的显示界面,或者信号源选择界面,其中信号源可以是预置的视频点播程序,还可以是HDMI接口,直播电视接口等中的至少一种,用户选择不同的信号源后,显示器可以显示从不同信号源获得的内容。In some embodiments, after the display device is started, it can directly enter the display interface of the last selected signal source, or the signal source selection interface, where the signal source can be a preset video-on-demand program, and can also be an HDMI interface, a live TV interface At least one of etc., after the user selects different signal sources, the display can display the content obtained from the different signal sources.
在一些实施例中,如图2所示,导航栏可设置有多个控件,如“我的”、“频道”、“影视”、“健身”、“VIP”、“教育”、“商城”、“游戏”和应用,不同的导航栏控件对应不同的频道界面,若用户想进行健身,可选中“健身”控件,图2所示的界面即“健身”控件被选中后的界面,用户可在该界面选择一个健身视频,以跟随健身视频进行健身。In some embodiments, as shown in FIG. 2 , the navigation bar may be provided with multiple controls, such as "My", "Channel", "Video", "Fitness", "VIP", "Education", "Mall" , "Games" and applications, different navigation bar controls correspond to different channel interfaces, if the user wants to exercise, he can select the "Fitness" control, the interface shown in Figure 2 is the interface after the "Fitness" control is selected, the user can Select a fitness video in this interface to follow the fitness video to exercise.
参见图3,用户在图2所示的界面点击一个健身视频后,显示设备根据被选中的视频控件对应的配置参数,请求服务器下发对应的详情页数据,然后根据接收到的详情页数据 进入图3所示的媒资详情界面。如图3所示,媒资详情界面可显示该健身视频的多个课程小节控件,也可以不包括,用户点击其中一个课程小节控件或者开始训练控件后,显示设备将该课程小节控件/详情页对应的健身视频,然后进入播放模式选择界面。在如下表述中,播放的健身视频也被称为目标视频。Referring to Figure 3, after the user clicks a fitness video on the interface shown in Figure 2, the display device requests the server to send the corresponding details page data according to the configuration parameters corresponding to the selected video control, and then enters the details page data according to the received details page data. The media asset details interface shown in Figure 3. As shown in Figure 3, the media asset details interface can display multiple course subsection controls of the fitness video, or it may not include it. After the user clicks one of the course subsection controls or starts training controls, the display device displays the course subsection controls/details page for the fitness video. The corresponding fitness video, and then enter the playback mode selection interface. In the following expressions, the played fitness video is also referred to as a target video.
参见图4,播放模式选择界面可展示三种模式控件,第一种模式控件对应的播放模式为普通模式,该模式也可称为第一模式,第二种模式控件对应的播放模式为跟练模式,该模式也可称为第二模式,第三种模式控件对应的播放模式为观影模式,该模式也可称为第三模式。每个该模式控件可显示有该播放模式的解释说明,示例性的,普通模式的解释说明可为:“屏蔽摄像头观看完整教学视频熟悉训练动作”,跟练模式的解释说明可为:“开启摄像头获得动作实时比对让动作更标准”,观影模式的解释说明可为:“屏蔽摄像头边看边锻炼运动效果不打折”。Referring to FIG. 4, the playback mode selection interface can display three mode controls. The playback mode corresponding to the first mode control is the normal mode, which can also be called the first mode, and the playback mode corresponding to the second mode control is the follow-up mode. mode, this mode may also be referred to as the second mode, and the playback mode corresponding to the third mode control is the movie viewing mode, which may also be referred to as the third mode. Each mode control can display an explanation of the playback mode. Exemplarily, the explanation of the normal mode can be: "Shield the camera to watch the complete teaching video to become familiar with the training action", and the explanation of the follow-up mode can be: "Turn on The camera obtains the real-time comparison of the action to make the action more standard", and the explanation of the viewing mode can be: "The effect of exercising while shielding the camera will not be discounted".
根据上述解释说明可知,在一些实施例中,在普通模式下,显示设备不启动摄像头,只在一新的界面上设置一个播放窗口来播放健身视频,在跟练模式下,显示设备启动摄像头,在显示器的新的界面上设置两个播放窗口,分别同时播放摄像头采集的图像和健身视频,在观影模式下,显示设备不启动摄像头,在显示器新的界面上设置两个播放窗口,分别同时播放一个健身视频和一个影片。According to the above explanation, in some embodiments, in the normal mode, the display device does not activate the camera, and only sets a playback window on a new interface to play the fitness video. In the follow-up mode, the display device activates the camera, Two playback windows are set on the new interface of the monitor to play the images captured by the camera and the fitness video at the same time. In the movie viewing mode, the display device does not start the camera, and two playback windows are set on the new interface of the monitor, respectively simultaneously. Play a workout video and a movie.
在一些实施例中,显示设备可设置有摄像头,摄像头可包括升降摄像头或非升降摄像头,摄像头可拍摄用户图像,得到本地摄像头数据,显示设备的控制器可将摄像头拍摄到的本地摄像头数据显示在显示设备的显示器上,使用户在显示器上看到自己的动作。In some embodiments, the display device may be provided with a camera, and the camera may include an elevating camera or a non-elevating camera, the camera may capture user images to obtain local camera data, and the controller of the display device may display the local camera data captured by the camera on the On the display of the display device, the user can see his actions on the display.
在一些实施例中,显示设备没有设置摄像头,但可连接一个摄像头,如通过USB连接一个外置摄像头,利用该摄像头拍摄用户图像,显示设备的控制器可将摄像头拍摄到的本地数据显示在显示设备的显示器上。In some embodiments, the display device is not provided with a camera, but a camera can be connected, such as an external camera connected through USB, and the camera is used to capture user images, and the controller of the display device can display the local data captured by the camera on the display device. on the device's display.
用户可根据上述解释说明,选择一个播放模式来观看健身视频。The user can select a playback mode to watch the fitness video according to the above explanation.
在一些实施例中,若用户在图4所示的播放模式选择界面点击普通模式控件,则显示设备生成媒资播放指令,该媒资播放指令包括播放模式和目标视频的信息,此时,播放模式为普通模式,目标视频的信息包括目标视频的播放地址。显示设备根据该媒资播放指令,从目标视频的播放地址获取目标视频的视频数据流,根据播放模式为普通模式,在新的界面上生成一个用于播放目标视频的第一播放窗口。在一些实施例中,第一播放窗口可为全屏窗口,显示比例为16:9。In some embodiments, if the user clicks the normal mode control on the playback mode selection interface shown in FIG. 4 , the display device generates a media asset playback instruction, and the media asset playback instruction includes the information of the playback mode and the target video. The mode is the normal mode, and the information of the target video includes the playback address of the target video. The display device obtains the video data stream of the target video from the playback address of the target video according to the media asset playback instruction, and generates a first playback window for playing the target video on the new interface according to the normal playback mode. In some embodiments, the first playback window may be a full-screen window with a display ratio of 16:9.
在一些实施例中,由于健身视频的比例通常为16:9,全屏窗口的比例也为16:9,因此,健身视频的比例与全屏窗口的比例相一致。在普通模式下,显示设备只生成一个第一播放窗口,不生成其他窗口,不会有其他窗口对第一播放窗口的显示内容进行遮挡,因此,显示设备可在生成第一播放窗口后,根据媒资播放指令的播放模式为普通模式,直接将健身视频在第一播放窗口中全屏播放。In some embodiments, since the ratio of the fitness video is generally 16:9, and the ratio of the full-screen window is also 16:9, the ratio of the fitness video is consistent with the ratio of the full-screen window. In the normal mode, the display device only generates a first playback window, and does not generate other windows, and no other windows will block the display content of the first playback window. Therefore, after generating the first playback window, the display device can The playback mode of the media asset playback instruction is the normal mode, which directly plays the fitness video in full screen in the first playback window.
在一些实施例中,为实现将健身视频在第一播放窗口中全屏播放,显示设备可将目标视频的图像进行缩放,使缩放后的目标视频的图像尺寸缩放至与全屏窗口的尺寸相一致。In some embodiments, in order to play the fitness video in full screen in the first playback window, the display device may zoom the image of the target video, so that the image size of the zoomed target video is scaled to be consistent with the size of the full screen window.
在一些实施例中,显示设备对目标视频进行缩放的方法可为:对目标视频的视频数据 流进行解析,得到目标视频的视频帧序列,取视频帧序列中的第一帧图像,根据该图像高度和所述目标显示区域的高度的比值得到缩放比例,然后按照所述缩放比例将所述目标视频的视频帧序列进行缩放。示例性的,目标视频的图像高度为100,显示设备的全屏窗口的高度为1000,其中,高度可为垂直方向上的像素点数量,则缩放比例为:100:1000=1:10,根据该缩放比例,将目标视频的视频帧序列放大10倍,则可使放大后的目标视频的图像能够充满整个全屏窗口。In some embodiments, the method for scaling the target video by the display device may be: parsing the video data stream of the target video to obtain a video frame sequence of the target video, taking the first frame image in the video frame sequence, and according to the image The ratio of the height to the height of the target display area obtains a scaling ratio, and then the video frame sequence of the target video is scaled according to the scaling ratio. Exemplarily, the image height of the target video is 100, the height of the full-screen window of the display device is 1000, where the height can be the number of pixels in the vertical direction, and the scaling ratio is: 100:1000=1:10, according to this The zoom ratio is to enlarge the video frame sequence of the target video by 10 times, so that the enlarged image of the target video can fill the entire full-screen window.
在一些实施例中,显示设备将目标视频进行缩放后,得到缩放后的目标视频的视频帧序列,将该视频帧序列发送给第一播放窗口,使第一播放窗口可连续播放该视频帧序列。In some embodiments, after zooming the target video, the display device obtains a video frame sequence of the zoomed target video, and sends the video frame sequence to the first playback window, so that the first playback window can continuously play the video frame sequence .
参见图5,为根据一些实施例的普通播放模式下的全屏播放界面示意图,如图5所示,在普通播放模式下,目标视频可实现全屏播放,图5中的人物可为健身教练,线条可表示人物背景,通常情况下,在目标视频的图像中,人物居中显示,人物后方的左侧和右侧均为背景图像。Referring to FIG. 5 , it is a schematic diagram of a full-screen playback interface in the normal playback mode according to some embodiments. As shown in FIG. 5 , in the normal playback mode, the target video can be played in full screen. The character in FIG. 5 can be a fitness coach. It can represent the background of the person. Usually, in the image of the target video, the person is displayed in the center, and the left and right sides behind the person are background images.
在一些实施例中,在图5的显示界面中用户可以唤出包含切换控件的控件列表,在用户选中切换控件后,切换为图6的跟练模式显示。或者在显示图5所示的界面时,通过预设按键键值,切换为图6的跟练模式显示。同样的,也可以通过上述手段从图6切换为图5的显示。In some embodiments, in the display interface of FIG. 5 , the user can call up a control list including the switching control, and after the user selects the switching control, the display is switched to the follow-up mode display in FIG. 6 . Or when the interface shown in FIG. 5 is displayed, the display of the follow-up training mode shown in FIG. 6 can be switched by preset key values. Similarly, it is also possible to switch from the display of FIG. 6 to the display of FIG. 5 by the above-mentioned means.
在一些实施例中,若用户在图4所示的播放模式选择界面点击跟练模式控件,则显示设备生成媒资播放指令,该媒资播放指令包括播放模式和目标视频的信息,此时,播放模式为跟练模式,目标视频的信息包括目标视频的播放地址。显示设备根据该媒资播放指令,从目标视频的播放地址获取目标视频的视频数据流,根据播放模式为跟练模式,在新的界面上生成一个用于播放目标视频的第一播放窗口,以及一个用于播放本地摄像头数据的第二播放窗口。In some embodiments, if the user clicks the follow-up mode control on the playback mode selection interface shown in FIG. 4 , the display device generates a media asset playback instruction, and the media asset playback instruction includes the information of the playback mode and the target video. At this time, The playback mode is the follow-up mode, and the information of the target video includes the playback address of the target video. The display device obtains the video data stream of the target video from the playback address of the target video according to the media asset playback instruction, and generates a first playback window for playing the target video on the new interface according to the playback mode as a follow-up mode, and A second playback window for playback of local camera data.
在一些实施例中,第二播放窗口叠加在所述第一播放窗口的上方;In some embodiments, the second play window is superimposed above the first play window;
在一些实施例中,第二播放窗口和第一播放窗口的高度一致,第二播放窗口的左边界和第一播放窗口的左边界重合,或,第二播放窗口的右边界和第一播放窗口的右边界重合。In some embodiments, the height of the second play window and the first play window are the same, the left border of the second play window and the left border of the first play window overlap, or, the right border of the second play window and the first play window The right borders of .
在一些实施例中,窗口的位置可以通过设置窗口在界面中的坐标参数来进行实现。In some embodiments, the position of the window can be realized by setting the coordinate parameters of the window in the interface.
在一些实施例中,参见图6,在跟练模式下,第一播放窗口可为全屏窗口,显示比例为16:9,第二播放窗口可为一个与显示器等高的窗口,显示比例可据摄像头的拍摄参数确定。第二播放窗口可以贴图的形式叠加显示在第一播放窗口的一侧,例如,第二播放窗口可显示在第一播放窗口的右侧,并与第一播放窗口的右侧边缘重合,对第一播放窗口的右侧显示区域构成遮挡。若显示设备将目标视频缩放至与全屏窗口相同尺寸后,直接在全屏窗口内显示缩放后的图像,则由于第二播放窗口的遮挡,如果按照正常的窗口显示逻辑,目标视频的部分图像无法被用户观看到,用户只能观看到目标视频中不被第二播放窗口遮挡的图像。在目标视频为健身视频时,用户需要跟随健身视频中的健身教练的动作进行运动,健身教练为目标视频中的人物,通常位于健身视频的图像中部,第二播放窗口可能会遮挡健身教练的部分身体,影响了健身视频观看效果。In some embodiments, referring to FIG. 6 , in the follow-up mode, the first playback window can be a full-screen window with a display ratio of 16:9, and the second playback window can be a window of the same height as the display, and the display ratio can be adjusted according to The shooting parameters of the camera are determined. The second playback window can be displayed on one side of the first playback window in the form of a texture. For example, the second playback window can be displayed on the right side of the first playback window and overlap with the right edge of the first playback window. The display area on the right side of a playback window constitutes a block. If the display device scales the target video to the same size as the full-screen window and directly displays the scaled image in the full-screen window, due to the occlusion of the second playback window, if the normal window display logic is followed, part of the target video cannot be displayed. When the user watches, the user can only watch the images in the target video that are not blocked by the second play window. When the target video is a fitness video, the user needs to follow the movements of the fitness coach in the fitness video. The fitness coach is the character in the target video, usually located in the middle of the fitness video, and the second playback window may block the fitness coach. The body affects the viewing effect of fitness videos.
在一些实施例中,为减小健身教练的身体被遮挡的概率,显示设备可在跟练模式下, 在第一播放窗口中确定一个目标显示区域,该目标显示区域为不被第二播放窗口遮挡的区域,显示设备可在该目标显示区域将目标视频的图像向左偏移显示,从而使用户能在目标显示区域看到较为完整的健身动作。In some embodiments, in order to reduce the probability that the body of the fitness coach is blocked, the display device may determine a target display area in the first playback window in the follow-up mode, and the target display area is not blocked by the second playback window. In the occluded area, the display device can display the image of the target video to the left in the target display area, so that the user can see a relatively complete exercise action in the target display area.
在一些实施例中,目标显示区域是根据所述第一播放窗口的位置坐标和第二播放窗口的位置坐标确定的。通过将第一播放窗口的位置坐标减去第二播放窗口的位置坐标,可得到目标显示区域的位置坐标。In some embodiments, the target display area is determined according to the position coordinates of the first play window and the position coordinates of the second play window. By subtracting the position coordinates of the first play window from the position coordinates of the second play window, the position coordinates of the target display area can be obtained.
需要说明的是,目标显示区域是指第一播放窗口中的较佳显示区域,在目标显示区域内显示的内容不会被其他图像遮挡,在跟练模式下,显示设备上除了有第一播放窗口,还有第二播放窗口,且第二播放窗口对第一播放窗口构成了部分遮挡,则显示设备可将第一播放窗口内位于第二播放窗口左侧未被第二播放窗口遮挡的区域确定为目标显示区域。参见图7,第二播放窗口叠加显示在第一播放窗口的右侧上方,可将第一播放窗口左侧未被第二播放窗口遮挡的区域确定为第一播放窗口的目标显示区域。It should be noted that the target display area refers to the preferred display area in the first playback window, and the content displayed in the target display area will not be blocked by other images. window, there is also a second playback window, and the second playback window forms a partial block to the first playback window, then the display device can display the area in the first playback window that is located on the left side of the second playback window and is not blocked by the second playback window. Determined as the target display area. Referring to FIG. 7 , the second play window is superimposed and displayed on the upper right side of the first play window, and the area on the left side of the first play window that is not blocked by the second play window can be determined as the target display area of the first play window.
需要说明的是,在跟练模式下,第一播放窗口是被第二播放窗口遮挡,显示设备根据第二播放窗口在第一播放窗口上的位置确定了目标显示区域。而在一些视频播放场景下,用于进行视频播放的第一播放窗口可能不是被用于播放摄像头本地数据的第二播放窗口遮挡,而是被其他图像遮挡,例如,被一个展示图片的不透明控件遮挡,这种情况下,也可应用本申请实施例提供的方法确定一个目标显示区域,在目标显示区域内播放视频,以达到较好的播放效果。此时,若控件遮挡第一播放窗口的位置与第二播放窗口遮挡第一播放窗口的位置相同,即都是遮挡在第一播放窗口的一侧,且控件的一个宽度边与第一播放窗口的一个宽度边重合,则可将目标显示区域确定为第一播放窗口中不被控件遮挡的矩形区域。若控件的全部边都不与第一播放窗口重合,此时,无论将目标显示区域设置在控件的哪一侧,目标显示区域的尺寸都将较小,因此,可将第一播放窗口的全部显示区域确定为目标显示区域,即在第一播放窗口全屏播放目标视频。It should be noted that, in the follow-up mode, the first playback window is blocked by the second playback window, and the display device determines the target display area according to the position of the second playback window on the first playback window. In some video playback scenarios, the first playback window used for video playback may not be blocked by the second playback window used to play the local data of the camera, but by other images, for example, by an opaque control for displaying pictures In this case, the method provided by the embodiment of the present application can also be applied to determine a target display area, and play a video in the target display area, so as to achieve a better playback effect. At this time, if the position where the control blocks the first playback window is the same as the position where the second playback window blocks the first playback window, that is, both are blocked on one side of the first playback window, and one width side of the control is the same as the first playback window. One of the width sides of , the target display area can be determined as a rectangular area in the first playback window that is not blocked by the controls. If all the sides of the control do not overlap with the first playback window, at this time, no matter which side of the control the target display area is set to, the size of the target display area will be smaller. The display area is determined as the target display area, that is, the target video is played in full screen in the first play window.
在一些实施例中,在跟练模式下,由于目标显示区域仅为第一播放窗口的部分显示区域,因此,目标显示区域的尺寸比例可能不是16:9,为获得较好的播放效果,可先将目标视频进行缩放,再将缩放后的目标视频的图像向左偏移,使得偏移后的图像的中心线靠近目标显示区域的中心线,或者使偏移后的图像的中心线与目标显示区域的中心线重合,从而可使用户在目标显示区域内看到较为完整的人物图像。In some embodiments, in the follow-up mode, since the target display area is only a partial display area of the first playback window, the size ratio of the target display area may not be 16:9. First zoom the target video, and then offset the image of the zoomed target video to the left, so that the center line of the offset image is close to the center line of the target display area, or the center line of the offset image is close to the center line of the target display area. The center lines of the display areas are coincident, so that the user can see a relatively complete person image in the target display area.
在一些实施例中,软件执行过程中,可以不确定目标显示区域,根据播放界面中第一播放窗口和第二播放窗口的位置参数,直接确定目标视频的第一移动距离。示例性的目标视频在播放器播放过程中会进行缩放,为了保证显示内容不变形,图像在高度方向和宽度方向一般进行等比例缩放,一般的缩放规则是按照高度(宽度)进行缩放因子的确认后对图像缩放,缩放后的图像在高度(宽度)上填满播放窗口的高度(宽度),另一维度可以进行插黑。可以以高度方向为基准,也可以以宽度方向为基准。根据第一播放器的宽度参数和第二播放器的宽度参数即可确定目标视频的图像在显示过程中需要移动的第一距离。以使得目标视频图像帧的中间位置尽可能在第一视频窗口上未被遮挡的区域显示。In some embodiments, during the software execution process, the target display area may be determined, and the first moving distance of the target video may be directly determined according to the position parameters of the first playback window and the second playback window in the playback interface. The exemplary target video will be scaled during the playback of the player. In order to ensure that the displayed content is not deformed, the image is generally scaled in equal proportions in the height and width directions. The general scaling rule is to confirm the scaling factor according to the height (width). After scaling the image, the scaled image fills the height (width) of the playback window in height (width), and black can be inserted in another dimension. The height direction may be used as the reference, or the width direction may be used as the reference. The first distance that the image of the target video needs to move during the display process can be determined according to the width parameter of the first player and the width parameter of the second player. So that the middle position of the target video image frame is displayed as far as possible in the unobstructed area on the first video window.
参见图8,将目标视频的图像向左偏移第一移动距离D后,可以看出,相较偏移之前, 目标视频中的人物图像更为靠近画面左侧。Referring to FIG. 8 , after shifting the image of the target video to the left by the first moving distance D, it can be seen that the image of the person in the target video is closer to the left side of the screen than before the shift.
参见图9,将目标视频的图像向左偏移第一移动距离D后,目标视频中的人物右侧的背景图像可通过第二播放窗口进行遮挡,此时,第二播放窗口不对第一播放窗口中的人物构成遮挡,达到了良好的显示效果。Referring to FIG. 9 , after shifting the image of the target video to the left by the first moving distance D, the background image on the right side of the character in the target video can be blocked by the second playback window. The characters in the window are occluded, which achieves a good display effect.
在一些实施例中,附图6和附图9第一播放窗口和第二播放窗口的错位显示只是为了表达存在两个独立控制的播放窗口,不表征实际的叠加后的显示效果。In some embodiments, the dislocation display of the first play window and the second play window in FIG. 6 and FIG. 9 is only to express the existence of two independently controlled play windows, and does not represent the actual superimposed display effect.
为计算第一移动距离D,在一些实施例中,可将目标视频的图像进行缩放,获取目标视频缩放到第一播放窗口的尺寸后的图像宽度,该图像宽度可称为待展示宽度;再获取第二播放窗口的宽度;将这两个宽度作差,可得到目标显示区域的宽度;取待展示宽度与目标显示区域的宽度的差值的一半,作为第一移动距离。根据该计算方法,目标视频左侧偏移的第一移动距离D与目标视频右侧被第二播放窗口遮挡的距离相同,实现了目标视频的中心线与目标显示区域的中心线重合。该计算方法可称为平均法,具有计算简单、计算速度快的优点,能快速确定第一移动距离D的大小。In order to calculate the first moving distance D, in some embodiments, the image of the target video may be zoomed to obtain the image width after the target video is zoomed to the size of the first playback window, and the image width may be referred to as the width to be displayed; Obtain the width of the second playback window; take the difference between the two widths to obtain the width of the target display area; take half of the difference between the width to be displayed and the width of the target display area as the first moving distance. According to this calculation method, the first moving distance D offset from the left side of the target video is the same as the distance that the right side of the target video is blocked by the second playback window, so that the center line of the target video and the center line of the target display area are overlapped. This calculation method may be called an average method, and has the advantages of simple calculation and fast calculation speed, and can quickly determine the size of the first moving distance D.
在一些实施例中,第一移动距离可以是小于两个播放窗口的宽度差的任一距离。In some embodiments, the first moving distance may be any distance less than the difference in width of the two playback windows.
在一些实施例中,第一移动距离不大于第二播放窗口的宽度。In some embodiments, the first moving distance is not greater than the width of the second playback window.
在一些实施例中,可以直接根据第二播放窗口的宽度确定第一移动距离,示例性的,将所述第二播放窗口的宽度的一半作为待移动距离,第一移动距离不大于待移动距离。In some embodiments, the first moving distance may be directly determined according to the width of the second playing window. Exemplarily, half of the width of the second playing window is used as the distance to be moved, and the first moving distance is not greater than the distance to be moved. .
在一些实施例中,还可以根据缩放后的影片的宽度和第二播放窗口的宽度做差来获取第一移动距离,例如,可以将差值的一半作为第一移动距离。In some embodiments, the first moving distance may also be obtained according to the difference between the width of the zoomed movie and the width of the second playback window, for example, half of the difference may be used as the first moving distance.
在一些实施例中,根据播放窗口的位置参数进行第一移动距离的确定。In some embodiments, the determination of the first movement distance is performed according to a position parameter of the playback window.
在一些实施例中,目标视频中的健身教练可能不位于图像中部,此时,若再采取将目标视频左侧偏移的第一移动距离D设置为与目标视频右侧被第二播放窗口遮挡的距离相同的方法,会导致目标视频中的健身教练偏左或偏右,显示效果不佳,为解决该技术问题,还可采取其他方法计算第一移动距离D。例如,还可对目标视频的图像帧进行人体识别,在识别到人体,即健身教练后,以人体的中轴线向两侧对称延伸,直至包含人体的显示区域宽度与目标显示区域的宽度相同,此时,将该包含人体的显示区域的宽度起点与第一播放窗口的宽度起点的差值作为第一移动距离,该计算方法得到的第一移动距离可能与上述平均法得到的第一移动距离不相同,若健身教练在目标视频的媒资图像中偏左,则该计算方法得到的第一移动距离小于上述平均法得到的第一移动距离,若健身教练在目标视频的媒资图像中偏右,则该计算方法得到的第一移动距离大于上述平均法得到的第一移动距离。In some embodiments, the fitness trainer in the target video may not be located in the middle of the image. At this time, if the first moving distance D offset from the left side of the target video is set to the right side of the target video and is blocked by the second playback window The method with the same distance will cause the fitness trainer in the target video to be skewed to the left or right, and the display effect is not good. In order to solve this technical problem, other methods can also be used to calculate the first moving distance D. For example, human body recognition can also be performed on the image frame of the target video. After identifying the human body, that is, the fitness coach, the central axis of the human body is symmetrically extended to both sides until the width of the display area containing the human body is the same as the width of the target display area. At this time, the difference between the width starting point of the display area containing the human body and the width starting point of the first playback window is used as the first moving distance. The first moving distance obtained by this calculation method may be the same as the first moving distance obtained by the above-mentioned averaging method. Not the same, if the fitness coach is to the left in the media image of the target video, then the first moving distance obtained by this calculation method is smaller than the first moving distance obtained by the above-mentioned averaging method, if the fitness coach is offset in the media image of the target video. Right, then the first moving distance obtained by this calculation method is greater than the first moving distance obtained by the above-mentioned averaging method.
在得到第一移动距离D后,在一些实施例中,显示设备可通过动态设置surfaceView(平面视图)的展示,实现目标视频的图像偏移显示。surfaceView的偏移输出函数可为:layoutParam.setMargins(0-D,0,0,0)。如图9所示,该偏移输出函数表示左侧偏移量为D,使得媒资图像的左侧起点为(0-D),媒资图像在第一播放窗口的左侧起点为0,由于第一播放窗口在展示图像时,需要从起点为0的位置开始展示图像。因此,根据上述偏移输出函数,可使得第一播放窗口在展示目标视频的每一帧图像时,在第一播放窗口内从目标视频在第一移动距离D右侧的像素点开始展示媒资图像,第一移动距离D内的图像超出了第 一播放窗口的展示范围,第一播放窗口不会展示该部分图像,实现了将目标视频的图像向左偏移显示的显示效果,在目标视频的图像右侧,第一播放窗口还空余一部分显示区域,这部分显示区域可显示黑边,通过将第二播放窗口置顶显示,可使第二播放窗口覆盖住目标视频的图像中的人物右侧的部分背景图像以及上述黑边,用户不会看到该部分背景图像以及黑边,不影响用户的观看体验。其中,第二播放窗口的置顶方式可为将setZOrderOnTop(true)设置置顶。After obtaining the first moving distance D, in some embodiments, the display device may implement the image offset display of the target video by dynamically setting the display of the surfaceView (planar view). The offset output function of surfaceView can be: layoutParam.setMargins(0-D,0,0,0). As shown in Figure 9, the offset output function indicates that the left offset is D, so that the left starting point of the media asset image is (0-D), and the starting point on the left side of the media asset image in the first playback window is 0, Since the first playback window is displaying an image, the image needs to be displayed from the position where the starting point is 0. Therefore, according to the above-mentioned offset output function, when displaying each frame of the target video in the first playback window, the first playback window starts to display the media assets from the pixel point on the right side of the target video at the first moving distance D in the first playback window. Image, the image within the first moving distance D exceeds the display range of the first playback window, and the first playback window will not display this part of the image, which realizes the display effect of shifting the image of the target video to the left and displaying it in the target video. On the right side of the image, the first playback window still has a part of the display area, and this part of the display area can display black borders. By placing the second playback window on top, the second playback window can cover the right side of the character in the image of the target video. part of the background image and the above black border, the user will not see this part of the background image and the black border, and will not affect the user's viewing experience. Wherein, the topping method of the second playback window may be setZOrderOnTop(true) to topping.
根据上述动态设置surfaceView的方法,可使显示设备在跟练模式下的播放界面如图10所示,目标视频的中部图像在第一播放窗口的目标显示区域内显示,用户图像在第二播放窗口内显示。According to the above method of dynamically setting the surfaceView, the playback interface of the display device in the follow-up mode is shown in Figure 10. The middle image of the target video is displayed in the target display area of the first playback window, and the user image is displayed in the second playback window. displayed inside.
在一些实施例中,显示设备在接收到目标视频之后,对目标视频进行解码,并按照第一播放窗口的参数进行缩放,然后在图像帧显示的过程中通过将图像帧的向左移动第一距离后的位置参数进行显示,由于移除第一播放窗口播放区域外的图像无法显示,靠近第二播放窗口的图像帧边缘被第二播放窗口遮挡,图像帧中央区域可以在第一播放窗口的未被遮挡的区域呈现。In some embodiments, after receiving the target video, the display device decodes the target video, scales it according to the parameters of the first playback window, and then moves the image frame to the left by the first The position parameter after the distance is displayed. Since the image outside the playback area of the first playback window cannot be displayed, the edge of the image frame close to the second playback window is blocked by the second playback window, and the central area of the image frame can be in the first playback window. Unoccluded areas are rendered.
在一些实施例中,第二播放窗口播放获取到的摄像头的视频数据。In some embodiments, the second play window plays the acquired video data of the camera.
由上述实施例可见,本申请实施例通过在显示设备上检测目标视频不被遮挡的显示区域,将其确定为目标显示区域,然后在将目标视频进行缩放后,控制目标视频的图像在目标显示区域进行偏移显示,解决了跟练模式下目标视频在播放时被第二播放窗口遮挡而导致的显示效果不佳的问题,提升了用户的观看体验。It can be seen from the above embodiments that in the embodiment of the present application, a display area where the target video is not blocked is detected on the display device, and it is determined as the target display area, and then after scaling the target video, the image of the target video is controlled to be displayed on the target. The area is offset and displayed, which solves the problem of poor display effect caused by the target video being blocked by the second playback window when playing in the follow-up mode, and improves the user's viewing experience.
在一些实施例中,视频播放应用还可根据一些预设的评分规则对用户的动作进行评分,使用户不用自己去对比自己的动作与目标媒资中的动作,也能得知自己的动作是否规范。In some embodiments, the video playback application can also score the user's actions according to some preset scoring rules, so that the user does not need to compare their actions with the actions in the target media assets, and can also know whether their actions are standardized .
在一些实施例中,一种预设的评分规则是:将目标媒资的图像与用户图像进行实时对比,根据用户图像中的用户动作与目标媒资中的动作的相似度,确定用户动作的评分,相似度越高,则评分越高,相似度越低,则评分越低。In some embodiments, a preset scoring rule is: compare the image of the target media asset with the user image in real time, and determine the score of the user action according to the similarity between the user action in the user image and the action in the target media asset , the higher the similarity, the higher the score, and the lower the similarity, the lower the score.
然而,在目标媒资的播放过程中,用户从看到目标媒资中的动作到做出动作需要一些时间,用户图像采集的过早或过晚,都容易导致用户动作的评分偏低,而且,目标媒资是持续播放的,如果用户在做出目标媒资中的动作后,目标媒资的画面已经切换为其他动作,将直接导致用户评分较低。However, during the playback of the target media asset, it takes some time for the user to see the action in the target media asset to act. If the user's image is captured too early or too late, the user's action score is likely to be low. The target media asset is continuously played. If the user performs an action in the target media asset, the screen of the target media asset has been switched to other actions, which will directly lead to a lower user rating.
为解决上述技术问题,一种预设的评分规则是:在目标媒资播放到一个特定的动作时,获取此时的目标媒资的图像,然后连续采集多张用户图像,将用户图像中的动作与目标媒资的图像中的动作进行动作比对,得到多个评分,将最高的评分作为该动作的评分,从而提高评分准确性。In order to solve the above technical problems, a preset scoring rule is: when the target media asset plays a specific action, the image of the target media asset at this time is acquired, and then multiple user images are continuously collected, and the The action is compared with the action in the image of the target media asset to obtain multiple scores, and the highest score is used as the score of the action, thereby improving the accuracy of the score.
在一些实施例中,目标媒资中用于进行评分的特定动作可根据动作库确定,其中,动作库可包括多张包含不同人物动作的样本图片和样本图片对应的动作数据,样本图片中的人物动作可为一些常见的健身动作,如下蹲动作,抬手动作等等。In some embodiments, the specific action used for scoring in the target media asset may be determined according to an action library, wherein the action library may include a plurality of sample pictures including actions of different characters and action data corresponding to the sample pictures. The movements can be some common fitness movements, such as squat movements, hand raising movements and so on.
在一些实施例中,动作库的动作数据可包括样本图片中人物的骨骼关键点的坐标位置和骨骼关键点类型,其中,骨骼关键点可通过已训练好的骨骼关键点检测模型得到,一种 示例性地骨骼关键点类型可包括鼻部骨骼关键点、脖子骨骼关键点、左肩骨骼关键点、左手肘骨骼关键点、左手腕骨骼关键点、右肩骨骼关键点、右手肘骨骼关键点、右手腕骨骼关键点、左髋部骨骼关键点、左膝部骨骼关键点、左脚腕骨骼关键点、右髋部骨骼关键点、右膝部骨骼关键点、右脚腕骨骼关键点、左眼骨骼关键点、右眼骨骼关键点、左耳骨骼关键点和右耳骨骼关键点。骨骼关键点检测模型可为基于深度神经网络的模型,将大量人工标注好骨骼关键点的图片输入深度神经网络模型,然后对深度神经网络进行训练,可使深度神经网络具有识别骨骼关键点的功能。当然,骨骼关键点也可通过人工标注得到。In some embodiments, the action data of the action library may include the coordinate position and type of the skeleton key point of the character in the sample picture, wherein the skeleton key point may be obtained by a trained skeleton key point detection model, a kind of Exemplary bone key types may include nose bone key, neck bone key, left shoulder bone key, left elbow bone key, left wrist bone key, right shoulder bone key, right elbow bone key, right Wrist Bone Key, Left Hip Bone Key, Left Knee Bone Key, Left Ankle Bone Key, Right Hip Bone Key, Right Knee Bone Key, Right Ankle Bone Key, Left Eye Bone Keys, Right Eye Bone Keys, Left Ear Bone Keys, and Right Ear Bone Keys. The skeleton key point detection model can be a model based on a deep neural network. A large number of pictures with manually marked skeleton key points are input into the deep neural network model, and then the deep neural network is trained, so that the deep neural network has the function of identifying the key points of the skeleton. . Of course, skeleton key points can also be obtained by manual annotation.
在一些实施例中,动作库的动作数据还可包括相邻骨骼关键点之间的位置关系,根据相邻骨骼关键点之间的位置关系可区分不同的人物动作。In some embodiments, the action data of the action library may further include the positional relationship between adjacent skeleton key points, and different character actions can be distinguished according to the positional relationship between the adjacent skeleton key points.
在一些实施例中,动作库的动作数据还包括样本图片中的人物动作的动作难度,动作难度可由运营人员确定,一种示例性地动作难度范围为0~10,动作难度的数值越大,则代表难度越大。In some embodiments, the action data of the action library also includes the action difficulty of the character actions in the sample pictures, and the action difficulty can be determined by the operator. An exemplary action difficulty range is 0-10. means more difficulty.
在一些实施例中,动作库的动作数据还包括动作标识,每一个人物动作可对应一个不同的动作标识,一种示例性地动作标识可为动作编号,根据该动作编号,可从动作库中快速查找出该动作编号对应的其他动作数据以及样本图片。In some embodiments, the action data of the action library further includes an action identifier, each character action may correspond to a different action identifier, an exemplary action identifier may be an action number, and according to the action number, it can be retrieved from the action library Quickly find out other action data and sample pictures corresponding to the action number.
在一些实施例中,确定目标媒资中用于进行评分的特定动作所在的图像帧的这一过程可称为对目标媒资进行打点,参见图11,为根据一些实施例的目标媒资的打点交互示意图。In some embodiments, the process of determining the image frame in the target media asset in which the specific action for scoring is located may be referred to as punctuating the target media asset, see FIG. 11 , which is the punctuation of the target media asset according to some embodiments Interactive diagram.
如图11所示,运营人员可利用第一工具处理服务器、媒资业务服务器和媒资内容服务器对目标媒资的打点。其中,第一工具处理服务器可对目标媒资进行打点,动作库可存储于该第一工具处理服务器中;媒资业务处理器可设置有每个目标媒资的媒资信息,该媒资信息可为目标媒资的提供者提供的原始信息,如媒资播放地址、媒资分辨率、媒资时长、媒资类型等信息,也可为运营人员对原始信息进行加工处理后的媒资信息,例如,加工处理后的媒资可包括校正后的媒资类型、媒资标签等一些新信息,原来的媒资类型被删除,示例性的,在原始信息中,媒资类型为运动,校正后的媒资类型可为健身;媒资内容服务器可为目标媒资的内容提供者上传目标媒资的视频流文件与原始信息的服务器。As shown in FIG. 11 , the operator can use the first tool to process the management of the target media assets by the server, the media asset service server and the media asset content server. The first tool processing server can manage the target media assets, and the action library can be stored in the first tool processing server; the media asset service processor can be provided with media asset information of each target media asset, the media asset information The original information that can be provided to the provider of the target media asset, such as the media asset playback address, media asset resolution, media asset duration, media asset type, etc., or the media asset information processed by the operators. For example, the processed media asset may include some new information such as the corrected media asset type, media asset label, etc., and the original media asset type is deleted. The latter type of media asset may be fitness; the media asset content server may be a server for uploading the video stream file and original information of the target media asset to the content provider of the target media asset.
在图11中,第一工具处理服务器、媒资业务服务器和媒资内容服务器是按照各自的功能进行区分的,在实际实施中,每个服务器可能部署在一个硬件设备上,也可能部署在多个硬件设备,这三个服务器也可能均部署在一个硬件设备上,本申请实施例对此不做具体限定。In Figure 11, the first tool processing server, media asset service server, and media asset content server are distinguished according to their respective functions. In actual implementation, each server may be deployed on one hardware device, or may be deployed on multiple Each of the three servers may also be deployed on one hardware device, which is not specifically limited in this embodiment of the present application.
在一些实施例中,运营人员可向第一工具处理服务器输入目标媒资的打点指令,该打点指令可包括目标媒资的媒资ID,第一工具处理服务器可根据该媒资ID,向媒资业务服务器获取该媒资ID对应的媒资信息,即目标媒资的媒资信息。In some embodiments, the operator may input a management instruction of the target media asset to the first tool processing server, where the management instruction may include the media asset ID of the target media asset, and the first tool processing server may send the media asset ID to the media asset according to the media asset ID. The asset service server obtains the media asset information corresponding to the media asset ID, that is, the media asset information of the target media asset.
在一些实施例中,内容提供者在媒资内容服务器上传了一个新的媒资后,媒资业务服务器可根据该新上传的媒资的原始信息生成对应的媒资信息,第一工具处理服务器可实时主动监测媒资业务服务器上新生成的媒资信息,根据媒资信息中的媒资类型判断是否将该新上传的媒资作为目标媒资,如果媒资类型为预设的打点类型,如健身类型,可将该新上传的媒资作为目标媒资,如果媒资类型不是预设的打点类型,可不将该新上传的媒资作为 待打点的目标媒资,跳过该媒资,继续判断下一个新上传的媒资是否为目标媒资。In some embodiments, after the content provider uploads a new media asset on the media asset content server, the media asset service server may generate corresponding media asset information according to the original information of the newly uploaded media asset, and the first tool processing server It can actively monitor the newly generated media asset information on the media asset service server in real time, and judge whether the newly uploaded media asset is used as the target media asset according to the media asset type in the media asset information. If the media asset type is the preset management type, Such as fitness type, the newly uploaded media asset can be used as the target media asset. If the media asset type is not the preset management type, the newly uploaded media asset can be skipped as the target media asset to be managed, and the media asset can be skipped. Continue to judge whether the next newly uploaded media asset is the target media asset.
在一些实施例中,内容提供者在媒资内容服务器上传了一个新的媒资后,已经对该媒资进行了打点,并在该媒资的原始信息中设置了一个打点标签,用于表示该媒资已经打点,媒资业务处理服务器在处理该原始信息以得到媒资信息时,若检测到打点标签,且该打点标签符合预设的规范,例如,打点标签中含有基于时间轴的打点视频帧的播放时间,则可在媒资信息中保留该打点标签,若否,则删除该打点标签。因此,第一工具处理服务器在处理一个目标媒资时,可判断该目标媒资的媒资信息中是否有打点标签,如果目标媒资的媒资信息中有打点标签,第一工具处理服务器可确定目标媒资已经打点,如果没有打点便签,则将该媒资作为待打点的目标媒资。In some embodiments, after the media asset content server uploads a new media asset, the content provider has already checked the media asset, and has set a check mark in the original information of the media asset, which is used to indicate The media asset has been clicked. When the media asset service processing server processes the original information to obtain the media asset information, if it detects a click-through label and the click-through label conforms to a preset specification, for example, the click-through label contains a time-axis-based click-through label. The playback time of the video frame, the dotted label can be retained in the media asset information, and if not, the dotted label is deleted. Therefore, when processing a target media asset, the first tool processing server can determine whether the media asset information of the target media asset has a dotted label, and if the media asset information of the target media asset has a dotted label, the first tool processing server can It is determined that the target media asset has been checked, and if there is no sticky note, the media asset is regarded as the target media asset to be checked.
在一些实施中,内容提供者在媒资内容服务器上传了一个新的媒资后,媒资业务服务器生成的媒资信息中的媒资类型不属于打点类型,但过了一段时间后,媒资业务服务器重新对该媒资信息进行核对后,在该媒资的媒资信息中增加了一个类型属性为打点类型,对于这种情况,第一工具处理服务器可实时主动监测媒资业务服务器上发生变化的媒资信息,如果变化的媒资信息中的媒资类型为打点类型,且该媒资信息中没有打点标签,则将该媒资确定为作为待打点的目标媒资。In some implementations, after the content provider uploads a new media asset on the media asset content server, the media asset type in the media asset information generated by the media asset service server does not belong to the management type, but after a period of time, the media asset After the service server re-checks the media asset information, a type attribute is added to the media asset information of the media asset as the dot type. In this case, the first tool processing server can actively monitor the occurrences on the media asset service server in real time. For the changed media asset information, if the media asset type in the changed media asset information is a management type and there is no management label in the media asset information, the media asset is determined as the target media asset to be managed.
在一些实施例中,内容提供者对媒资进行打点后,还可生成一个打点文件,该打点文件可存储在该媒资的原始信息中,媒资业务处理服务器在处理该原始信息以得到媒资信息时,可在媒资信息中保留该打点文件。因此,第一工具处理服务器在处理一个目标媒资时,可判断该目标媒资的媒资信息中是否有上述打点文件,如果目标媒资的媒资信息中有上述打点文件,第一工具处理服务器可确定目标媒资已经打点,如果没有打点文件,也没有打点标签,则可将该媒资作为待打点的目标媒资。In some embodiments, after the content provider manages the media asset, it can also generate a management file, and the management file can be stored in the original information of the media asset, and the media asset service processing server is processing the original information to obtain the media asset. When the asset information is stored, the dot file can be retained in the media asset information. Therefore, when processing a target media asset, the first tool processing server can determine whether the media asset information of the target media asset has the above-mentioned dot file, and if the media asset information of the target media asset has the above dotted file, the first tool processing The server can determine that the target media asset has been managed, and if there is no dot file and no dot tag, the media asset can be regarded as the target media asset to be dotted.
在一些实施例中,第一工具处理服务器在确定目标媒资已经打点后,如果之前是根据打点指令获取了该目标媒资的媒资信息,则可生成目标媒资打点完毕的提示信息,使运营人员得知该目标媒资已经打点;如果之前是第一工具处理服务器自动从媒资业务服务器中获取的目标媒资的媒资信息,则可直接跳过该目标媒资,继续处理下一个目标媒资。In some embodiments, after determining that the target media asset has been managed, the first tool processing server may generate a prompt message that the target media asset is managed if the media asset information of the target media asset was previously obtained according to the management instruction, so that the target media asset is managed. The operator knows that the target media asset has been managed; if the media asset information of the target media asset is automatically obtained by the first tool processing server from the media asset service server, the target media asset can be directly skipped and the next one can be processed. target media.
在一些实施例中,内容提供者对目标媒资的打点方法可能与第一工具处理服务器的打点方法不相同,因此,运营人员得知目标媒资已经打点后,可向第一工具处理服务器输入重新打点指令,使第一工具处理服务器对该目标媒资进行打点。In some embodiments, the content provider's management method for the target media asset may be different from the management method of the first tool processing server. Therefore, after knowing that the target media asset has been managed, the operator can input the first tool processing server. Re-managing the instruction to make the first tool processing server manage the target media asset.
在一些实施例中,第一工具处理服务器在确认需要对目标媒资进行打点后,可根据媒资信息中的媒资播放地址,向媒资内容服务器获取目标媒资的视频流文件,然后对视频流文件进行解析,得到目标媒资的视频帧,然后逐帧检测目标视频的视频帧,对视频帧进行人物动作识别,如果识别出的人物动作为所述动作库对应的其中一个人物动作,则生成一条打点记录,所述打点记录至少包括所述视频帧的播放时间。In some embodiments, after confirming that the target media asset needs to be managed, the first tool processing server can obtain the video stream file of the target media asset from the media asset content server according to the media asset playback address in the media asset information, and then Analyze the video stream file to obtain the video frame of the target media asset, then detect the video frame of the target video frame by frame, and perform character motion recognition on the video frame, if the recognized character action is one of the character actions corresponding to the action library, Then, a dot recording record is generated, and the dot recording record includes at least the playing time of the video frame.
在一些实施例中,第一工具处理服务器可通过已训练好的骨骼关键点检测模型检测视频帧中的骨骼关键点,然后将视频帧中相邻骨骼关键点之间的相对位置关系与动作库中的每个样本图片中的相应骨骼关键点之间的相对位置关系进行比较,根据相对位置关系的误差在预设的范围内,判定视频帧中的人物动作为与动作库中的样本图片中的人物动作。例 如,在目标媒资的一个视频帧中,左肩骨骼关键点、左手肘骨骼关键点和左手腕骨骼关键点在一条直线上,而在动作库中,一个样本图片对应的动作数据中,左肩骨骼关键点、左手肘骨骼关键点和左手腕骨骼关键点也在一条直线上,而则可认定为该视频帧中的动作为伸左手。In some embodiments, the first tool processing server may detect the skeleton key points in the video frame through the trained skeleton key point detection model, and then compare the relative positional relationship between the adjacent skeleton key points in the video frame with the action library Compare the relative positional relationship between the corresponding skeleton key points in each sample picture in character actions. For example, in a video frame of the target media asset, the key point of the left shoulder bone, the key point of the left elbow bone and the key point of the left wrist bone are on a straight line, while in the action library, in the action data corresponding to a sample image, the left shoulder bone key The key point, the key point of the left elbow bone and the key point of the left wrist bone are also on a straight line, and the action in the video frame can be regarded as extending the left hand.
在一些实施例中,在检测出目标媒资的视频帧中的人物动作为所述动作库对应的其中一个人物动作后,可获取所述视频帧在所述目标媒资中的播放时间,以及所述视频帧中的动作对应的动作标识,然后根据所述播放时间和动作标识生成一条打点记录,打点记录中可包括视频帧的播放时间和视频帧对应的动作标识。其中,相邻的视频帧之间的时间间隔通常为毫秒级,因此,在打点记录中,播放时间可精确到毫秒,便于确定视频帧。In some embodiments, after detecting that the character action in the video frame of the target media asset is one of the character actions corresponding to the action library, the playback time of the video frame in the target media asset, and the The action mark corresponding to the action in the video frame is generated, and then a dot record is generated according to the play time and the action mark, and the dot record may include the playback time of the video frame and the action mark corresponding to the video frame. Among them, the time interval between adjacent video frames is usually in milliseconds. Therefore, in the dot recording, the playback time can be accurate to milliseconds, which is convenient for determining the video frames.
在一些实施例中,如果一个目标媒资中打点的视频帧过于密集,则有可能导致目标媒资在播放时,用户由于没有来得及跟上目标媒资中的动作而导致打分偏低,因此,在目标媒资的视频帧中检测到人物动作为所述动作库对应的其中一个人物动作后,可先判断下是否满足打点条件,满足打点条件再打点,不满足打点条件则不打点,跳过该视频帧,继续检测下一个视频帧。一种示例性地打点条件可为:在所述视频帧中的人物动作为所述动作库对应的其中一个人物动作时,若所述视频帧的播放时间距离上一打点记录对应的播放时间大于预设时间,则可进行打点,生成一条打点记录,即在预设时间内,最多进行一次打点,预设时间可设置为10秒或其他时长。In some embodiments, if the dotted video frames in a target media asset are too dense, it may cause the user to give a low score because the user does not have time to keep up with the actions in the target media asset when the target media asset is playing. After it is detected in the video frame of the media asset that the action of a character is one of the actions of the characters corresponding to the action library, it is possible to first judge whether the dotting conditions are met, and then do the dots if the dotting conditions are satisfied, and skip the video if the dotting conditions are not met. frame, continue to detect the next video frame. An exemplary dotting condition may be: when the character action in the video frame is one of the character actions corresponding to the action library, if the playback time of the video frame is greater than the playback time corresponding to the previous dot recording If the preset time is set, it can be dotted to generate a dot record, that is, within the preset time, at most one dot can be done, and the preset time can be set to 10 seconds or other durations.
在一些实施例中,为防止目标媒资中打点的视频帧过于密集,还可在进行一次打点后,对目标媒资在本次打点后预设时间内的视频帧不进行人物动作识别,对预设时间后的视频帧再进行人物动作识别。In some embodiments, in order to prevent the dotted video frames in the target media asset from being too dense, after a dot dot is performed, no action recognition is performed on the video frames of the target media asset within a preset time after dot dot, and the pre-dot video frames are not identified. The video frame after the set time is used for character action recognition.
在一些实施例中,目标媒资的全部视频帧检测完毕后,或者当目标媒资检测完距离目标媒资的最后一个视频帧在预设时间内的视频帧后,可汇总打点记录,根据全部打点记录和目标媒资的时间轴生成打点文件和/或打点标签,将打点文件和/或打点标签存储到目标媒资的媒资信息中。In some embodiments, after the detection of all the video frames of the target media asset is completed, or after the target media asset detects the video frames within a preset time from the last video frame of the target media asset, it can be aggregated and recorded. Dot records and the timeline of the target media asset generate a dot file and/or a dot tag, and store the dot file and/or the dot tag in the media asset information of the target media asset.
在一些实施例中,也可只生成打点文件,不生成打点标签,或只生成打点标签,不生成打点文件。In some embodiments, only a dotted file may be generated without a dotted label, or only a dotted label may be generated without a dotted file.
在一些实施例中,第一工具处理服务器在生成目标媒资的打点文件和/或打点标签后,如果之前是根据打点指令进行打点的,则可生成目标媒资打点完毕的提示信息,使运营人员得知该目标媒资已经打点完毕;如果之前是自动识别出目标媒资,则可继续处理下一个目标媒资。In some embodiments, the first tool processing server may generate a notification message indicating that the target media asset has been managed after generating the dotting file and/or the dotting label of the target media asset, if the dosing instruction was previously done according to the dosing instruction, so that the operation The personnel know that the management of the target media asset has been completed; if the target media asset is automatically identified before, the next target media asset can be processed.
在一些实施例中,第一工具处理服务器还可根据打点记录对应的人物动作的动作数据,生成所述目标媒资对应的打点库。第一工具处理服务器可将打点库存储在媒资业务服务器中该目标媒资的媒资信息中,媒资业务服务器在向显示设备下发目标媒资的媒资信息时,可被配置为下发打点库。当然,第一工具处理服务器也可将打点库直接存储在第一工具处理服务器内。In some embodiments, the first tool processing server may further generate a dotting library corresponding to the target media asset according to the action data of the action of the character corresponding to the dotting record. The first tool processing server can store the management library in the media asset information of the target media asset in the media asset service server, and the media asset service server can be configured to download the media asset information of the target media asset to the display device. Send RBI library. Of course, the first tool processing server may also directly store the management library in the first tool processing server.
在用户使用跟练模式观看目标媒资时,显示设备可根据上述实施例中得到的打点记录,在达到打点记录中的视频帧的播放时间时,采集用户图像,对用户图像中的用户动作进行 动作比对,在比对后,还可为对用户动作进行评分。When the user uses the follow-up mode to watch the target media assets, the display device can collect the user image according to the dot recording obtained in the above-mentioned embodiment, when the playback time of the video frame in the dot recording is reached, and perform user actions in the user image. Action comparison, after comparison, can also be used to score user actions.
参见图12,为根据一些实施例的目标媒资的评分交互示意图,如图12所示,用户在显示设备观看一个来源于媒资内容服务器上的一个目标媒资时,第二工具处理服务器可与显示设备进行交互,进而对用户动作进行评分,生成跟练记录,将跟练记录反馈给显示设备,使显示设备可显示跟练记录。Referring to FIG. 12 , which is a schematic diagram of scoring interaction of target media assets according to some embodiments, as shown in FIG. 12 , when a user watches a target media asset from a media asset content server on a display device, the second tool processing server may It interacts with the display device, then scores the user's actions, generates a follow-up practice record, and feeds the follow-up practice record back to the display device, so that the display device can display the follow-up practice record.
在图12中,第二工具处理服务器和媒资内容服务器是按照各自的功能进行区分的,在实际实施中,每个服务器可能部署在一个硬件设备上,也可能部署在多个硬件设备,这两个服务器也可能均部署在一个硬件设备上,本申请实施例对此不做具体限定。In Figure 12, the second tool processing server and the media asset content server are distinguished according to their respective functions. In actual implementation, each server may be deployed on one hardware device or may be deployed on multiple hardware devices. Both servers may also be deployed on one hardware device, which is not specifically limited in this embodiment of the present application.
在一些实施例中,显示设备可从媒资信息中检测到打点标签,根据打点标签确认该目标媒资支持动作评分,进而从媒资信息中获取目标媒资的打点文件,得到目标媒资的打点记录。In some embodiments, the display device can detect the dotted label from the media asset information, confirm that the target media asset supports the action score according to the dotted label, and then obtain the dotted file of the target media asset from the media asset information, and obtain the target media asset's dotted file. Dot record.
在一些实施例中,显示设备也可对媒资信息进行检测,判断媒资信息中是否包含打点文件和/或打点标签,如果包含,则可从打点文件和/或打点标签中得到目标媒资的打点记录。In some embodiments, the display device can also detect the media asset information to determine whether the media asset information contains a dot file and/or dot label, and if so, the target media resource can be obtained from the dot file and/or dot label. 's hit record.
在目标视频的播放过程中,用户可跟随目标视频做出相应的动作。During the playback of the target video, the user can follow the target video to make corresponding actions.
在一些实施例中,显示设备在检测到目标视频播放到一条打点记录对应的时间时,可获取目标视频此时的媒资图像,并开始采集多张时间递进的用户图像,将目标视频的媒资图像和用户图像发送给第二工具处理服务器。示例性地,在目标视频播放到一条打点记录对应的时间时,显示设备可每间隔一段时间,就向第二工具处理服务器上传一张用户图像,针对一条打点记录,显示设备可上传预设数量的用户图像,其中,上传的用户图像的时间间隔可为100毫秒,预设数量可为10张,或者上传的用户图像的时间间隔为50毫秒,预设数量为20张。In some embodiments, when the display device detects that the target video is played to a time corresponding to a dot record, the display device may acquire the media image of the target video at this time, and start to collect multiple user images with time progression, The media asset image and the user image are sent to the second tool processing server. Exemplarily, when the target video is played to the time corresponding to a dotting record, the display device can upload a user image to the second tool processing server every time interval, and for a dotting record, the display device can upload a preset number of , wherein the time interval of the uploaded user images may be 100 milliseconds, and the preset number may be 10, or the time interval of the uploaded user images may be 50 milliseconds, and the preset number may be 20.
在一些实施例中,第二工具处理服务器在接收到媒资图像和用户图像后,可按照用户图像的时间顺序,将用户图像与媒资图像进行动作比对,得到用户图像的动作评分。示例性地,动作比对方法包括:通过已训练好的骨骼关键点检测模型检测用户图像中的骨骼关键点和媒资图像中的骨骼关键点,将用户图像中相邻骨骼关键点之间的相对位置与媒资图像中相应骨骼关键点之间的相对位置进行比较,即将用户图像中的动作数据与媒资图像中的动作数据进行比较,得到相对位置的误差,根据两张图像中所有相对位置的误差,以及媒资图像的动作难度,计算得到用户图像中的用户动作与媒资图像中的动作的相似度,根据相似度得到用户动作的动作评分。其中,相对位置的误差与相似度之间的映射关系,以及相似度、动作难度与动作评分的映射关系可预先制定,并可进行调整。例如,在误差在预设范围内的相对位置的数量一定时,动作难度越大,则动作评分相对较高。In some embodiments, after receiving the media asset image and the user image, the second tool processing server may perform an action comparison between the user image and the media asset image according to the time sequence of the user image to obtain an action score of the user image. Exemplarily, the action comparison method includes: detecting the skeleton key points in the user image and the skeleton key points in the media image by using the trained skeleton key point detection model, and comparing the adjacent skeleton key points in the user image. The relative position is compared with the relative position between the corresponding skeleton key points in the media image, that is, the action data in the user image is compared with the action data in the media image, and the error of the relative position is obtained. The error of the position and the action difficulty of the media image are calculated to obtain the similarity between the user action in the user image and the action in the media image, and the action score of the user action is obtained according to the similarity. Among them, the mapping relationship between the error of the relative position and the similarity, as well as the mapping relationship between the similarity, the action difficulty and the action score can be formulated in advance and can be adjusted. For example, when the number of relative positions whose errors are within the preset range is constant, the greater the difficulty of the action, the higher the action score.
在一些实施例中,为提高评分效率,以及减小显示设备上传数据量,显示设备还可将目标媒资的播放指令发送给第二工具处理服务器,使第二工具处理服务器可响应于目标媒资的播放指令,从媒资业务服务器或第一工具处理服务器中下动作库。显示设备可上传用户图像以及动作标识,不上传媒资图像,第二工具处理服务器根据动作标识,将用户图像中的动作数据与动作库中相应样本图片的动作数据进行比对,得到用户动作的动作评分。In some embodiments, in order to improve scoring efficiency and reduce the amount of data uploaded by the display device, the display device may also send a playback instruction of the target media asset to the second tool processing server, so that the second tool processing server can respond to the target media The playback instruction of the asset is downloaded from the media asset service server or the first tool processing server. The display device can upload the user image and the action ID, but not the asset image. The second tool processing server compares the action data in the user image with the action data of the corresponding sample picture in the action library according to the action ID, and obtains the action data of the user action. Action rating.
在一些实施例中,第二工具处理服务器可响应于目标媒资的播放指令,从第一工具处理服务器中下载打点库,第二工具处理服务器还可将用户图像的动作数据与目标媒资的打点库中相应的动作数据进行比对,得到用户动作的动作评分,避免了动作库可能较大,下载动作库和从动作库中查找动作数据较慢的问题。In some embodiments, the second tool processing server may download the dot library from the first tool processing server in response to a playback instruction of the target media asset, and the second tool processing server may also compare the motion data of the user image with the target media asset’s action data. The corresponding action data in the management library is compared to obtain the action score of the user's action, which avoids the problem that the action library may be large and that downloading the action library and searching for the action data from the action library are slow.
在一些实施例中,动作库和/或打点库还可直接存储在第二工具处理服务器上,避免了第二工具处理服务器需要下载动作库和/或打点库而耗费时间的问题。In some embodiments, the action library and/or the management library can also be directly stored on the second tool processing server, which avoids the time-consuming problem that the second tool processing server needs to download the action library and/or the management library.
在一些实施例中,第二工具处理服务器在达到本次比对的终止条件时,则停止比对下一张用户图像与媒资图像。示例性地,终止条件可为已经比对了第一预设数量的用户图像,或需要进行下一个动作的比对,如接收到下一张媒资图像,或连续第二预设数量的动作评分呈下降趋势,第一预设数量可为10,第二预设数量可为3。In some embodiments, the second tool processing server stops comparing the next user image and the media asset image when the termination condition of the current comparison is reached. Exemplarily, the termination condition may be that a first preset number of user images have been compared, or a comparison needs to be performed for the next action, such as receiving the next media image, or a second preset number of consecutive actions. The score is on a downward trend, the first preset number may be 10, and the second preset number may be 3.
由于用户看到目标媒资的图像后,需要一定时间才能做出目标媒资中的动作,在做完动作后,可能会恢复到初始状态,如立正状态,或着接着去做下一个动作,因此,将时间递进的用户图像进行评分后,多个动作评分按照时间顺序可构成一个近似开口向下的抛物线,抛物线的顶点即动作评分中的最高分,可将该最高分最为本次动作的跟练得分,当然,也可根据其他方式确定跟练得分,例如,去掉几个较低的分数后,将剩余分数的平均分作为本次动作的跟练得分。Since the user sees the image of the target media asset, it takes a certain amount of time to perform the action in the target media asset. After the action is completed, it may return to the initial state, such as standing upright, or proceed to the next action. Therefore, , after the time-progressing user images are scored, multiple action scores can form a parabola with an approximate opening downward in chronological order. The vertex of the parabola is the highest score in the action score, and the highest score can be the highest score in this action. The follow-up score, of course, can also be determined in other ways. For example, after removing several lower scores, the average score of the remaining scores is used as the follow-up score for this action.
在一些实施例中,图13示出了一种跟练模式控制方法,该方法是由显示设备中控制器250被配置执行,即控制器250为方法的执行主体,所述方法包括如下程序步骤:In some embodiments, FIG. 13 shows a follow-up mode control method, the method is configured and executed by the controller 250 in the display device, that is, the controller 250 is the execution subject of the method, and the method includes the following program steps :
步骤S10,响应于接收到启动训练项目视频的操作,在跟练界面的第一窗口中显示训练项目视频,以及在第二窗口中显示由图像采集器采集并发送的视频码流中的本地图像。这一步骤是用户跟练的基础和前提,方便用户根据跟练界面的引导进行训练。Step S10, in response to receiving the operation of starting the training item video, displaying the training item video in the first window of the follow-up training interface, and displaying the local image in the video code stream collected and sent by the image collector in the second window . This step is the basis and premise of user follow-up training, which is convenient for users to train according to the guidance of the follow-up interface.
步骤S20,响应于所述训练项目视频播放至关键帧,从所述视频码流中周期性地获取对应于关键帧的跟练图像。Step S20, in response to the video of the training item being played to a key frame, periodically acquiring a follow-up image corresponding to the key frame from the video stream.
步骤S30,将所述跟练图像中的跟练动作与所述关键帧中的标准动作进行比对,分别得到各跟练图像中所述跟练动作的训练得分。Step S30, compare the follow-up exercises in the follow-up images with the standard actions in the key frame, and obtain the training scores of the follow-up actions in each follow-up image respectively.
如果训练项目视频未播放到打点位置的关键帧处,则继续播放直至遇到打点。本申请在训练项目视频每播放到一个关键帧时,需要从视频码流中周期性获取对应于关键帧的跟练图像,比如可以设置预设周期,并每间隔预设周期获取一帧跟练图像,这里所述对应于关键帧的跟练图像,是指用户观看关键帧中的标准动作后,模拟摆出跟练动作时对应采集到的图像,预设周期比如可选为100ms,即每隔100ms获取一帧跟练图像,并将跟练图像中识别到的人体的跟练动作与标准动作进行比对,得到每帧跟练图像中跟练动作的训练得分。If the training item video is not played to the key frame of the dot position, it will continue to play until a dot is encountered. In this application, every time a key frame is played in the video of the training project, a follow-up image corresponding to the key frame needs to be periodically obtained from the video stream. For example, a preset period can be set, and one frame of follow-up practice can be obtained every preset period. The image, the follow-up image corresponding to the key frame mentioned here refers to the image collected when the user simulates the follow-up action after watching the standard action in the key frame. A frame of follow-up training images is obtained every 100ms, and the follow-up exercises of the human body identified in the follow-up training images are compared with the standard movements, and the training scores of the follow-up exercises in each frame of follow-up training images are obtained.
在一些实施例中,为图像采集器232采集的视频码流中的每帧图像设置时间戳,所述视频码流中各帧图像的时间戳是图像采集器232在采集时间的基础上进行延时补偿后设定的,所述延时补偿用于消除图像从图像采集器232传输至控制器250所产生的延时。控制器250在从视频码流中周期性地获取跟练图像之前,根据播放到关键帧的物理时间,对比视频码流中各帧图像的时间戳,来定位对应于关键帧的跟练图像,从而精准获取跟练图像。 本申请利用物理时间,即打点位置时间来定位并获取跟练图像,而非进度条时间,从而以更精准的时间匹配方式来提高获取跟练图像的准确性。本申请在设置时间戳时考虑对图像传输延时的补偿,比如图像传输延时大约为150ms,即延迟150ms后传输到控制器250,则可以将视频码流中每帧图像的时间戳设置为超前本帧采集时间150ms。In some embodiments, a time stamp is set for each frame of image in the video code stream collected by the image collector 232 , and the time stamp of each frame of image in the video code stream is the time stamp of the image collector 232 on the basis of the collection time. The time delay compensation is set after time compensation, and the delay compensation is used to eliminate the delay caused by the transmission of the image from the image collector 232 to the controller 250 . Before the controller 250 periodically obtains the follow-up image from the video code stream, according to the physical time played to the key frame, the time stamps of each frame image in the video code stream are compared to locate the follow-up image corresponding to the key frame, In order to obtain accurate follow-up images. The present application utilizes physical time, that is, the time of the dotting position, to locate and acquire the follow-up image, rather than the progress bar time, so as to improve the accuracy of acquiring the follow-up image in a more accurate time matching manner. The present application considers the compensation for the image transmission delay when setting the time stamp. For example, the image transmission delay is about 150ms, that is, after the delay of 150ms, it is transmitted to the controller 250, then the time stamp of each frame of image in the video stream can be set as Advance this frame acquisition time by 150ms.
步骤S40,根据各跟练图像中跟练动作的训练得分的最大值,计算所述标准动作与所述跟练动作的动作匹配度。Step S40, according to the maximum value of the training score of the follow-up action in each follow-up image, calculate the action matching degree between the standard action and the follow-up action.
在一些实施例中,在执行步骤S40之前,所述方法还包括:响应于达到终止条件,停止从所述视频码流中获取所述跟练图像。In some embodiments, before step S40 is performed, the method further includes: in response to reaching a termination condition, stopping acquiring the follow-up image from the video stream.
在一些实施例中,控制器250响应于训练项目视频播放至下一个关键帧,则确定达到所述终止条件。即从当前打点位置的标准动作切换至下一个标准动作前,则需要停止获取跟练图像,确保采集的跟练图像是与当前标准动作有关。这种情况下是用打点位置约束跟练图像的获取进程,获取的是两个打点位置之间包含的全部帧。In some embodiments, the controller 250 determines that the termination condition is reached in response to the training item video playing to the next key frame. That is, before switching from the standard action at the current dotting position to the next standard action, it is necessary to stop acquiring follow-up images to ensure that the collected follow-up images are related to the current standard action. In this case, the image acquisition process is followed by the dot position constraint, and all frames included between the two dot positions are acquired.
相邻两个打点位置的关键帧之间具有一个的时间间隔,比如“1/4箭步蹲”这一动作需要保持20秒,20秒后切换到下一个标准动作,即两个打点位置之间间隔20秒,如果每间隔100ms就要获取一帧跟练图像,则获取的跟练图像帧数会非常多,需要计算大量跟练图像帧中跟练动作的训练得分,从而导致控制器250计算和处理资源的浪费,还会导致动作评分和动作匹配度的计算效率低。There is a time interval between the key frames of two adjacent dotting positions. For example, the action of "1/4 lunge squat" needs to be held for 20 seconds. After 20 seconds, switch to the next standard action, that is, between the two dotting positions. The interval is 20 seconds. If a frame of follow-up training images is acquired every 100ms, the number of follow-up training image frames obtained will be very large, and it is necessary to calculate the training scores of the follow-up exercises in a large number of follow-up training image frames, which will cause the controller 250 to calculate And the waste of processing resources will also lead to low computational efficiency of action scoring and action matching.
此外,申请人研究发现,用户进行跟练时,每个动作的训练得分近似为一个开口向下的抛物线,即用户根据动作标准化提示信息,逐渐调整肢体趋近于标准动作时,这一过程训练得分呈上升趋势,当用户出现疲惫或想要切换动作时,跟练动作与标准动作匹配度逐渐降低,训练得分则呈现下降趋势。对于训练得分呈显著下降趋势,或训练得分数值较低时,则没有必要从视频码流中获取这部分跟练图像。每个跟练动作评分时都需要叠加上用户反映时间的因素。In addition, the applicant's research found that when the user performs follow-up training, the training score of each action is approximately a parabola with an opening downward, that is, when the user gradually adjusts the limbs to approach the standard action according to the action standardization prompt information, this process of training The score shows an upward trend. When the user is tired or wants to switch actions, the match between the follow-up actions and the standard actions gradually decreases, and the training score shows a downward trend. When the training score shows a significant downward trend, or the training score is low, it is not necessary to obtain this part of the training image from the video stream. Each follow-up action score needs to be superimposed with the factor of the user's reflection time.
因此在一些实施例中,控制器250每获取一帧跟练图像,将获取帧数累积加1,即在获取跟练图像的过程中准确记录和更新获取帧数,当检测到获取帧数等于第二数量阈值时,则确定达到所述终止条件。其中第二数量阈值是预设值,比如10帧,它用于限制控制器250在每个打点位置能对应获取跟练图像的最大帧数,第二数量阈值的取值不限定。这种情况下是利用数量阈值来约束跟练图像的获取进程,跟练图像的获取数量等于第二数量阈值,从而过滤掉后续其他帧,及时终止相对无效的跟练图像获取和得分判定,提高用户得分的匹配度,并提高评分和动作匹配度的计算效率,提升用户体验。Therefore, in some embodiments, each time the controller 250 acquires a frame of follow-up image, the number of acquired frames is cumulatively increased by 1, that is, the number of acquired frames is accurately recorded and updated during the process of acquiring the follow-up image. When it is detected that the number of acquired frames is equal to When the second quantity threshold is reached, it is determined that the termination condition is reached. The second quantity threshold is a preset value, such as 10 frames, which is used to limit the maximum number of frames that the controller 250 can obtain a follow-up image corresponding to each dotting position, and the value of the second quantity threshold is not limited. In this case, the number threshold is used to constrain the acquisition process of follow-up images, and the number of follow-up images obtained is equal to the second number threshold, so as to filter out other subsequent frames, terminate the relatively ineffective follow-up image acquisition and score judgment in time, and improve the Matching degree of user scores, and improve the calculation efficiency of score and action matching degree, and improve user experience.
或者,在一些实施例中,控制器250每获取一帧跟练图像,都会匹配出对应的跟练动作的训练得分,从而获取训练得分的变化趋势。比如第i帧为85分,第i+1帧为83分,第i+2帧为80分,则显然训练得分是呈现递减(下降)趋势的。由于训练得分越低,说明跟练动作与标准动作匹配度越低,因此期望保留较高的训练得分,过滤掉较低的训练得分,对此如果连续M帧跟练图像对应的训练得分呈递减趋势,则确定达到所述终止条件,其中M为第一数量阈值,比如M可取值为3,第一数量阈值M的取值不限定。这种情况下是利用用户跟练时的得分趋势/轨迹,来及时终止相对无效的跟练图像获取和得分判定,提高 用户得分的匹配度,并提高评分和动作匹配度的计算效率,提升用户体验。Alternatively, in some embodiments, each time the controller 250 acquires a frame of follow-up training images, it will match the training scores of the corresponding follow-up exercises, so as to obtain the variation trend of the training scores. For example, the i-th frame is 85 points, the i+1-th frame is 83 points, and the i+2-th frame is 80 points, so obviously the training score shows a decreasing (declining) trend. Since the lower the training score is, the lower the matching degree between the follow-up action and the standard action is. Therefore, it is expected to retain the higher training score and filter out the lower training score. For this, if the training scores corresponding to the follow-up images of consecutive M frames show a decreasing trend trend, it is determined that the termination condition is reached, where M is the first quantity threshold, for example, M can be 3, and the value of the first quantity threshold M is not limited. In this case, the score trend/trajectory during the user's follow-up practice is used to timely terminate the relatively ineffective follow-up image acquisition and score determination, improve the matching degree of the user's score, and improve the calculation efficiency of the score and action matching degree. experience.
在上述终止条件的各实施例的基础上,在获取跟练图像的过程中,需要判断是否达到终止条件。如果未达到终止条件,则继续每间隔预设周期获取跟练图像帧,并匹配训练得分;如果达到终止条件,则停止从视频码流中获取所述跟练图像,并执行步骤S40。On the basis of the above embodiments of the termination condition, in the process of acquiring the follow-up image, it is necessary to determine whether the termination condition is reached. If the termination condition is not reached, continue to acquire follow-up image frames every preset period and match the training score; if the termination condition is reached, stop acquiring the follow-up image from the video stream, and execute step S40.
无论采取何种终止条件,当达到终止条件时,假设共获取到N帧跟练图像,每帧跟练图像中跟练动作的训练得分为Scorej,1≤j≤N,则本申请中用户模拟关键帧中的标准动作而做出的跟练动作的得分,即命名为目标得分,目标得分=max{Scorej,1≤j≤N},即目标得分为获取到的N帧跟练图像中跟练动作的训练得分的最大值。No matter what termination condition is adopted, when the termination condition is reached, it is assumed that N frames of follow-up images are obtained in total, and the training score of the follow-up actions in each frame of follow-up images is Scorej, 1≤j≤N, then the user simulation in this application The score of the follow-up action based on the standard action in the key frame is named as the target score. The maximum training score for the exercise.
在一些实施例中,当匹配出用户在跟练打点位置的标准动作所得的目标得分后,可以记录下该目标得分,便于后续统计最终得分。In some embodiments, after matching the target score obtained by the standard actions of the user in the follow-up dotting position, the target score may be recorded, so as to facilitate the subsequent statistics of the final score.
步骤S50,根据所述动作匹配度,控制显示器在所述第二窗口中显示动作匹配提示信息。Step S50, controlling the display to display action matching prompt information in the second window according to the action matching degree.
在一些实施例中,所述动作匹配提示信息包括动作的准确率,通过目标得分可以计算标准动作与跟练动作间的动作匹配度,并于跟练界面的第二窗口上的指定位置显示动作匹配度,动作匹配度比如以准确率的比率值形式展示给用户。In some embodiments, the action matching prompt information includes the accuracy rate of the action, the action matching degree between the standard action and the follow-up action can be calculated by the target score, and the action is displayed in a designated position on the second window of the follow-up interface. Matching degree, the action matching degree is displayed to the user in the form of the ratio value of the accuracy rate.
在一些实施例中,所述动作匹配提示信息还包括鼓励语,通过动作匹配度,可以在跟练界面的第二窗口中显示相匹配的鼓励语,比如“Good”、“Great”、“Perfect”等等,每种鼓励语对应于一个动作匹配度的范围。比如,动作匹配度在90%以上时,鼓励语显示为“Perfect”。In some embodiments, the action matching prompt information further includes encouraging words, and according to the action matching degree, matching encouraging words, such as "Good", "Great", "Perfect", can be displayed in the second window of the follow-up interface. " and so on, each kind of encouragement corresponds to a range of action matching degree. For example, when the action matching degree is more than 90%, the encouragement is displayed as "Perfect".
在一些实施例中,所述动作匹配提示信息还包括动作标准化提示信息,通过动作匹配度,即可衡量跟练动作与标准动作之间的偏差程度,比如当动作匹配度低于预设阈值时,说明用户的跟练动作不标准,则有必要提示用户改正动作。在第二窗口显示跟练图像的过程中,根据标准动作与每帧跟练图像中跟练动作之间在位置、肢体姿态等方面上的偏差,在跟练界面的第二窗口中显示动作标准化提示信息,方便用户知晓自身跟练动作的不足之处,并加以改正和调整,提升跟练动作的训练得分,直至达到动作的最高得分(即目标得分)。In some embodiments, the action matching prompt information further includes action standardization prompt information, and the degree of deviation between the follow-up exercise and the standard action can be measured by the degree of motion matching, for example, when the degree of motion matching is lower than a preset threshold , indicating that the user's follow-up action is not standard, it is necessary to prompt the user to correct the action. In the process of displaying the follow-up image in the second window, according to the deviation in position, limb posture, etc. between the standard action and the follow-up action in each frame of the follow-up image, the standard action is displayed in the second window of the follow-up interface. The prompt information is convenient for users to know the deficiencies of their own follow-up exercises, and to correct and adjust them, so as to improve the training score of the follow-up exercises until the highest score (ie, target score) of the movement is reached.
需要说明的是,动作匹配提示信息不限于上述各实施例所述,只要是基于动作匹配度分析确定的信息内容都属于动作匹配提示信息的范畴,并可根据实际需要在跟练界面中进行显示。It should be noted that the action matching prompt information is not limited to those described in the above embodiments, as long as the information content determined based on the action matching degree analysis belongs to the category of the action matching prompt information, and can be displayed in the follow-up interface according to actual needs. .
在一些实施例中,由于目标得分是获取到的N帧跟练图像中训练得分的最大值,因此可以仅保留下目标得分对应的目标跟练图像,并删除其他N-1帧跟练图像,这样在终止获取跟练图像之后,跟练界面的第二窗口中就会仅展示出与标准动作匹配度最高的最佳跟练动作,并且在查看标准动作的跟练图像时显示保留的目标跟练图像。In some embodiments, since the target score is the maximum value of the training scores in the N frames of training images obtained, only the target training images corresponding to the target scores may be retained, and the other N-1 frames of the training images may be deleted, In this way, after the acquisition of the follow-up images is terminated, the second window of the follow-up interface will display only the best follow-up movements that match the standard movements with the highest degree, and the reserved target follow-up movements will be displayed when viewing the follow-up images of the standard movements. practice images.
在终止获取跟练图像之前,跟练界面的第二窗口中会按照采集时序,依次显示各帧跟练图像,并根据跟练动作与标准动作之间的偏差,显示动作标准化提示信息,使得用户的跟练动作逐渐被纠正,直至达到所述最佳跟练动作。然后达到终止条件之后,第二窗口仅保持显示最佳跟练动作/目标得分对应的目标跟练图像,直至下一个打点的关键帧到来,再 启动跟练图像获取、第二窗口的UI变换和上述评分、选分等流程。Before the acquisition of follow-up images is terminated, the second window of the follow-up interface will display the follow-up images of each frame in sequence according to the acquisition sequence, and display the action standardization prompt information according to the deviation between the follow-up action and the standard action, so that the user can The following exercises are gradually corrected until the optimal follow exercises are reached. Then after the termination condition is reached, the second window only keeps displaying the target follow-up image corresponding to the best follow-up action/target score until the next key frame of dotting arrives, and then starts the follow-up image acquisition, UI transformation of the second window and The above scoring and selection process.
在一些实施例中,当训练项目视频播放至终点,即在播放条进度移动至结尾时,当前的训练项目跟练结束,需要统计用户本次跟练的最终得分;或者,用户在跟练时间超过预设时长后,比如用户跟练2分钟后,退出训练项目视频,这种情况下也要结束本次跟练,并统计用户本次跟练的最终得分。In some embodiments, when the video of the training item is played to the end, that is, when the progress of the play bar moves to the end, the current training item follow-up practice ends, and the user's final score of the follow-up practice needs to be counted; After the preset time is exceeded, for example, after the user has practiced for 2 minutes, the user exits the training program video. In this case, the follow-up practice should also be ended, and the final score of the user's follow-up practice will be counted.
在一些实施例中,控制器250响应于训练项目视频播放至终点,由于用户的跟练过程遍历了视频中所有打点的标准动作,因此需要统计训练项目视频中所有关键帧对应的目标跟练图像的训练得分(即目标得分),并累积加权得到最终得分,即将用户参照每个标准动作训练得到的目标得分累积相加,即为本次跟练的最终得分。In some embodiments, the controller 250 is in response to the training item video being played to the end point. Since the user's follow-up practice process has traversed all the standard actions of the dots in the video, it is necessary to count the target follow-up images corresponding to all key frames in the training item video. The training score (that is, the target score) is accumulated and weighted to obtain the final score, that is, the cumulative addition of the target scores obtained by the user referring to each standard action training is the final score of this follow-up training.
在一些实施例中,控制器250响应于在跟练时间超过预设时长后退出训练项目视频的操作,即用户仅跟练了超过预设时长的部分视频片段,而没有遍历到视频中的全部标准动作,因此需要统计当前已遍历过的关键帧的目标跟练图像的训练得分,并累积加权得到最终得分。比如,用户在观看3分钟后退出本次跟练,而这3分钟时段内,已遍历过的打点的关键帧为6个,即用户退出跟练时已完成6个标准动作的模拟训练,则将这6次练习的目标得分累积相加即为本次跟练的最终得分。In some embodiments, the controller 250 responds to the operation of exiting the training item video after the follow-up time exceeds a preset duration, that is, the user only follows a part of the video clips that exceed the preset duration without traversing the entire video. Standard actions, so it is necessary to count the training scores of the target follow-up images of the currently traversed key frames, and accumulate the weight to obtain the final score. For example, if the user quits this follow-up practice after watching for 3 minutes, and within this 3-minute period, the key frames of the traversed dots are 6, that is, the user has completed the simulation training of 6 standard actions when exiting the follow-up practice, then The cumulative sum of the target scores of these 6 exercises is the final score of this follow-up exercise.
在一些实施例中,当统计出每次跟练的最终得分后,还可以同步输出训练报告,方便跟练用户了解本次训练的详情。控制器250控制显示器275显示如图17示出的训练报告界面,训练报告界面中向用户展示最终得分,以及跟练所消耗的能量、动作的准确率和训练时长等信息。训练报告界面中还可包括重练控件和切换控件,重练控件用于被触发时再一次重复当前结束的训练项目的跟练流程;切换控件用于被触发时,切换到训练项目列表中下一个训练项目视频,并启动对下一个训练项目视频的跟练模式的控制流程。In some embodiments, after the final score of each follow-up exercise is counted, a training report can also be output synchronously, so that the follow-up user can learn the details of the current training. The controller 250 controls the display 275 to display a training report interface as shown in FIG. 17 . The training report interface displays the final score, as well as information such as the energy consumed by the follow-up training, the accuracy rate of the movements, and the training duration to the user. The training report interface can also include retraining controls and switching controls. The retraining controls are used to repeat the training process of the currently ended training item when triggered; the switching controls are used to switch to the next training item list when triggered. A training project video, and start the control flow of the follow-up training mode for the next training project video.
在一些实施例中,当显示训练报告界面时,用户可以点击遥控器上的“返回”按键,从而退出训练报告界面,并返回至如图12所示的训练项目列表界面,这时用户既可以继续从训练项目列表中选取想要跟练的视频,也可以选择不再继续训练,退出健身类应用程序。In some embodiments, when the training report interface is displayed, the user can click the “return” button on the remote control to exit the training report interface and return to the training item list interface as shown in FIG. 12 . At this time, the user can either Continue to select the video you want to follow from the list of training items, or you can choose not to continue training and exit the fitness application.
当图像采集器不涉及断电、物理开关关闭、故障和被其他应用占用等外部因素时,由于跟练用户处于活动状态,可能存在跟练时退出人像采集区域/光圈之外的情况,导致图像采集器232采集的跟练图像中无法识别和检测到实际的人像,属于无效的跟练。When the image collector does not involve external factors such as power failure, physical switch off, failure, and being occupied by other applications, since the follow-up user is active, there may be situations where it exits the portrait capture area/aperture during follow-up practice, resulting in image In the follow-up training images collected by the collector 232, the actual portrait cannot be recognized and detected, which is invalid follow-up training.
在一些实施例中,为了避免无效的跟练,控制器250控制显示器275暂停第一窗口中播放的训练项目视频,同时暂停训练时长、能量消耗、准确率、打点位置关键帧对应的目标得分、跟练过程中累积的最终得分等数据的统计,使得跟练模式处于暂停状态,以等待用户重新进行人像识别,同时,控制第二窗口显示提示信息,用于提示用户移动至人像采集区域/光圈内重新进行人像识别,此时UI如图18的显示。当用户人像识别成功后,以当前所处的暂停帧为起点,起播第一窗口中训练项目视频,并在原训练数据的基础上,继续统计训练时长、能量消耗、准确率、目标得分和最终得分等信息。In some embodiments, in order to avoid invalid follow-up training, the controller 250 controls the display 275 to pause the training item video played in the first window, and at the same time pauses the training duration, energy consumption, accuracy rate, target score corresponding to the key frame of the hitting position, The statistics of the final score and other data accumulated during the follow-up practice make the follow-up mode in a suspended state to wait for the user to perform portrait recognition again. At the same time, the second window is controlled to display prompt information to prompt the user to move to the portrait collection area/aperture Re-recognize the portrait inside, and the UI is as shown in Figure 18 at this time. When the user's portrait is successfully recognized, the current pause frame is used as the starting point to start the video of the training project in the first window, and on the basis of the original training data, continue to count the training time, energy consumption, accuracy, target score and final score, etc.
由以上各实施例的技术方案可知,本申请当训练项目视频每播放到一个关键帧时,每间隔预设周期获取一帧跟练图像,并将跟练图像与关键帧进行比对,给出每个跟练图像中 跟练动作的训练得分,在满足终止条件时,停止从视频码流中获取跟练图像,然后从获取到的多帧跟练图像中,筛选出训练得分最高的分数作为这一关键帧对应的目标得分,并基于目标得分计算动作匹配度。终止条件中,通过第二数量阈值或者训练得分的变化趋势/轨迹来约束跟练图像的获取,可以及时终止无效的跟练图像的获取和评分,从而降低控制器250消耗的处理资源,并提高跟练模式的评分效率和准确性。本申请通过采集一定数量/某一个时间段内的跟练图像,并获取其中最高得分作为对应于标准动作的目标得分,使得每次动作评分保持在最佳的匹配度,避免因用户反映延时和图像采集延时等因素导致的跟练动作评分低、匹配度差的问题,从而提升用户体验,有利于提升用户训练的自信心和积极性。As can be seen from the technical solutions of the above embodiments, in the present application, when the training item video is played to a key frame, a frame of follow-up image is acquired at every preset period, and the follow-up image is compared with the key frame, and the given The training score of the follow-up exercise in each follow-up image, when the termination condition is met, stop acquiring the follow-up image from the video stream, and then select the score with the highest training score from the multi-frame follow-up images obtained as the The target score corresponding to this key frame, and the action matching degree is calculated based on the target score. In the termination condition, the acquisition of follow-up images is constrained by the second quantity threshold or the change trend/trajectory of the training score, and the acquisition and scoring of invalid follow-up images can be terminated in time, thereby reducing the processing resources consumed by the controller 250 and improving The scoring efficiency and accuracy of the follow-up mode. This application collects a certain number of follow-up images within a certain period of time, and obtains the highest score as the target score corresponding to the standard action, so that the score of each action is maintained at the best matching degree, avoiding delay due to user feedback It can improve the user experience and improve the user's confidence and enthusiasm for training.
在一些实施例中,在得到一个跟练得分后,第二工具处理服务器可将该跟练得分发送给显示设备,使显示设备可显示该得分对应的评分提示,参见图14,一种评分提示可为“GOOD”,评分提示可叠加显示在用户图像上方。In some embodiments, after obtaining a follow-up score, the second tool processing server can send the follow-up score to a display device, so that the display device can display a score prompt corresponding to the score, see FIG. 14 , a score prompt Can be "GOOD" and the rating prompt can be superimposed on the user's image.
在一些实施例中,在得到一个跟练得分后,第二工具处理服务器可根据目标媒资开始播放后累计的跟练得分计算出用户动作的准确率,并将准确率发送给显示设备,使显示设备可将准确率进行显示。In some embodiments, after obtaining a follow-up score, the second tool processing server may calculate the accuracy rate of the user's action according to the accumulated follow-up score after the target media asset starts playing, and send the accuracy rate to the display device, so that The display device can display the accuracy rate.
在一些实施例中,在目标媒资播放过程中,用户如果想停止跟练,可向显示设备输入结束播放视频的指令,显示设备可根据该指令结束播放目标媒资,并向第二工具处理服务器发送跟练结束的信息,第二工具处理服务器接收到该跟练结束的信息,根据所有跟练得分生成跟练记录,然后将跟练记录发送给显示设备,使显示设备可向用户展示该跟练记录。In some embodiments, during the playback of the target media asset, if the user wants to stop the follow-up practice, he can input an instruction to end the video playback to the display device. The server sends the information of the end of the follow-up practice, and the second tool processing server receives the information of the end of the follow-up practice, generates a follow-up practice record according to all the follow-up practice scores, and then sends the follow-up practice record to the display device, so that the display device can display the follow-up practice to the user. Follow up practice records.
在一些实施例中,在目标媒资播放结束后,显示设备可向第二工具处理服务器发送跟练结束的信息,第二工具处理服务器接收到该跟练结束的信息,根据所有跟练得分生成跟练记录,然后将跟练记录发送给显示设备,使显示设备可向用户展示该跟练记录。In some embodiments, after the playback of the target media asset ends, the display device may send information that the follow-up practice is over to the second tool processing server, and the second tool processing server receives the information that the follow-up practice ends, and generates a score based on all the follow-up practice scores. The follow-up practice record is then sent to the display device, so that the display device can display the follow-up practice record to the user.
参见图15,为一种示例性的跟练记录的界面示意图,如图15所示,跟练记录可显示训练评分、能量消耗、准确度和训练时长,示例性地,训练评分可为跟练评分的平均分,准确度可为相似度的平均分,训练时长为目标媒资的播放时长,能量消耗可根据一些预设的计算规则确定。Referring to FIG. 15 , it is a schematic interface diagram of an exemplary follow-up record. As shown in FIG. 15 , the follow-up record can display the training score, energy consumption, accuracy and training duration. Exemplarily, the training score can be the follow-up exercise. The average score of the score, the accuracy can be the average score of the similarity, the training duration is the playback duration of the target media asset, and the energy consumption can be determined according to some preset calculation rules.
在一些实施例中,在跟练之前或跟练过程中,显示设备还可针对一些异常情况进行处理,例如,在显示设备的控制器接收不到摄像组件的信号后,显示设备可暂停播放目标媒资,并显示异常提示,参见图16,异常提示可包括:“未检测到摄像头”,该异常提示可显示在用户图像的窗口。In some embodiments, before or during the follow-up practice, the display device can also handle some abnormal situations. For example, after the controller of the display device cannot receive the signal from the camera assembly, the display device can pause to play the target media asset, and display an abnormal prompt, see FIG. 16 , the abnormal prompt may include: “Camera not detected”, and the abnormal prompt may be displayed in the window of the user image.
在一些实施例中,在跟练过程中,第二工具处理服务器还可针对一些异常情况进行处理,例如,第二工具处理服务器在用户图像中没有检测到骨骼关键点,则可向显示设备发送异常提示以及暂停播放指令,使显示设备可根据暂停播放指令暂停播放目标媒资,并显示该异常提示。参见图17,异常提示可包括:“摄像头前无人,暂停播放”,该异常提示可显示在用户图像的窗口。In some embodiments, during the follow-up training process, the second tool processing server may also process some abnormal situations. For example, if the second tool processing server does not detect a skeleton key point in the user image, it may send a message to the display device. The abnormal prompt and the play pause instruction enable the display device to pause the playback of the target media according to the play pause instruction, and display the abnormal prompt. Referring to FIG. 17 , the abnormal prompt may include: "There is no one in front of the camera, pause playback", and the abnormal prompt may be displayed in the window of the user image.
在一些实施例中,第二工具处理服务器对异常情况的处理还包括:在跟练过程中,如果第二工具处理服务器在用户图像中检测到骨骼关键点的位置在一段时间内没有变化,则 可向显示设备发送异常提示以及暂停播放指令,使显示设备可根据暂停播放指令暂停播放目标媒资,并显示该异常提示。参见图18,异常提示可为两个箭头指向用户图像中的人物,该异常提示可显示在用户图像的窗口。In some embodiments, the processing of the abnormal situation by the second tool processing server further includes: during the follow-up practice, if the second tool processing server detects that the position of the skeleton key point in the user image does not change within a period of time, then An abnormality prompt and a playback pause instruction can be sent to the display device, so that the display device can pause the playback of the target media asset according to the pause playback instruction, and display the abnormality prompt. Referring to FIG. 18 , the abnormality prompt can be two arrows pointing to the characters in the user image, and the abnormality prompt can be displayed in the window of the user image.
可见,在跟练过程中,可由第二工具处理服务器执行对用户动作进行评分的操作以及异常处理的操作,在这种操作方式中,显示设备不需要存储动作库、打点库、也不需要进行骨骼点检测、计算评分等复杂的数据处理,对显示设备的硬件水平要求较低,有利于显示设备地流畅运行。而在一些实施例中,当显示设备的硬件水平较高时,上述第二工具处理服务器执行的操作也可由显示设备完成,这种情况下,显示设备在评分之前需要下载好动作库或打点库,在评分时就不需要与第二工具处理服务器进行交互了,能够减少对网络资源的占用。It can be seen that during the follow-up practice, the second tool processing server can perform the operation of scoring user actions and the operation of abnormal handling. Complex data processing such as skeleton point detection and calculation scores requires low hardware level of the display device, which is conducive to the smooth operation of the display device. In some embodiments, when the hardware level of the display device is relatively high, the operations performed by the second tool processing server can also be performed by the display device. In this case, the display device needs to download the action library or the management library before scoring. , there is no need to interact with the second tool processing server during scoring, which can reduce the occupation of network resources.
由上述实施例可见,本申请实施例通过预先对目标媒资进行打点,使得在进行评分时,可根据打点的视频帧对用户图像进行评分,解决了实时比较时用户做出动作时可能目标媒资已经播放到其他动作而导致用户动作评分偏低的问题,提升了跟练模式的评分准确性;并且通过比较多张用户图像与打点记录对应的视频帧得出多个评分,将最高的评分作为跟练得分,减少了跟练得分偏低的几率;进一步的,在对目标媒资进行打点时,通过间隔一定数量的视频帧进行打点,避免了由于打点过于密集而导致用户不能及时跟上每一个动作的情况,提升了用户体验。It can be seen from the above-mentioned embodiments that in the embodiment of the present application, by pre-dotting the target media assets, when scoring, the user images can be scored according to the video frames dotted, which solves the problem that the target media may be possible when the user makes an action during real-time comparison. The problem that the user's action score is low due to the fact that the data has been played to other actions has improved the scoring accuracy of the follow-up mode; and multiple scores are obtained by comparing multiple user images and the video frames corresponding to the dot recording, and the highest score is determined. As a follow-up score, it reduces the probability of a low follow-up score; further, when managing the target media assets, a certain number of video frames are spaced apart to prevent users from being unable to keep up in time due to the excessively intensive management. Each action case enhances the user experience.
为了方便解释,已经结合具体的实施方式进行了上述说明。但是,上述在一些实施例中讨论不是意图穷尽或者将实施方式限定到上述公开的具体形式。根据上述的教导,可以得到多种修改和变形。上述实施方式的选择和描述是为了更好的解释原理以及实际的应用,从而使得本领域技术人员更好的使用实施方式以及适于具体使用考虑的各种不同的变形的实施方式。For the convenience of explanation, the above description has been made in conjunction with specific embodiments. However, the above discussion in some embodiments is not intended to be exhaustive or to limit implementations to the specific forms disclosed above. Numerous modifications and variations are possible in light of the above teachings. The above embodiments have been chosen and described to better explain the principles and practical applications, so as to enable those skilled in the art to better utilize the embodiments and various modified embodiments suitable for specific use considerations.

Claims (10)

  1. 一种显示设备,包括:A display device comprising:
    显示器;monitor;
    控制器,与所述显示器连接,所述控制器被配置为:a controller, connected to the display, the controller being configured to:
    接收用户输入的媒资播放指令;Receive the media asset playback instruction input by the user;
    响应于所述媒资播放指令,获取所述媒资播放指令对应的目标视频;In response to the media asset playback instruction, obtain a target video corresponding to the media asset playback instruction;
    在对应所述目标视频的第一播放窗口上方未设置控件时,在所述第一播放窗口播放所述目标视频;When the control is not set above the first playback window corresponding to the target video, the target video is played in the first playback window;
    在对应所述目标视频的第一播放窗口上方设置有所述控件时,在所述第一播放窗口中将所述目标视频的显示位置向远离所述控件的方向移动,以使所述目标视频的画面的中心位置靠近所述第一播放窗口中未被所述控件遮挡的目标显示区域的中心位置显示,其中,所述控件不透明,且遮挡所述第一播放窗口的一侧。When the control is provided above the first playback window corresponding to the target video, the display position of the target video is moved in the first playback window away from the control, so that the target video is displayed in a direction away from the control. The center position of the screen is displayed close to the center position of the target display area in the first play window that is not blocked by the controls, wherein the controls are opaque and block one side of the first play window.
  2. 根据权利要求1所述的显示设备,所述控件包括第二播放窗口,所述第二播放窗口包括所述控制器响应于所述媒资播放指令生成的窗口,所述第二播放窗口用于播放接收到的本地摄像头数据。The display device according to claim 1, wherein the control comprises a second playback window, the second playback window comprises a window generated by the controller in response to the media asset playback instruction, the second playback window is used for Play the received local camera data.
  3. 根据权利要求1所述的显示设备,所述控制器还配置为:所述第一播放窗口中将所述目标视频的显示位置向远离所述控件的方向移动之前:The display device according to claim 1, wherein the controller is further configured to: before moving the display position of the target video away from the control in the first play window:
    确定所述目标视频在高度方向上完全展示在所述第一播放窗口时的所述目标视频的待展示宽度;determining the to-be-displayed width of the target video when the target video is completely displayed in the first playback window in the height direction;
    根据所述待展示宽度和目标显示区域的宽度确定第一移动距离,所述第一移动距离为在所述第一播放窗口中,需要将所述目标视频的显示位置向远离所述控件的方向移动的距离。The first moving distance is determined according to the width to be displayed and the width of the target display area, and the first moving distance is that in the first play window, the display position of the target video needs to be moved away from the control distance moved.
  4. 根据权利要求3所述的显示设备,所述控制器配置为通过下述确定所述目标视频在高度方向上完全展示在所述第一播放窗口时的所述目标视频的待展示宽度:The display device according to claim 3, wherein the controller is configured to determine the to-be-displayed width of the target video when the target video is completely displayed in the first play window in the height direction by:
    根据所述目标视频的图像高度和所述目标显示区域的高度的比值得到缩放比例;Obtain the scaling ratio according to the ratio of the image height of the target video to the height of the target display area;
    根据所述目标视频的图像宽度和所述缩放比例得到所述目标视频的待展示宽度。The to-be-displayed width of the target video is obtained according to the image width of the target video and the zoom ratio.
  5. 根据权利要求3所述的显示设备,所述控制器还配置为通过下述根据所述待展示宽度和目标显示区域的宽度确定第一移动距离:The display device according to claim 3, the controller is further configured to determine the first moving distance according to the width to be displayed and the width of the target display area by:
    将所述待展示宽度和所述目标显示区域的宽度的差值的一半作为第一移动距离。Taking half of the difference between the width to be displayed and the width of the target display area as the first moving distance.
  6. 根据权利要求1所述的显示设备,所述目标显示区域是根据所述第一播放窗口的位置坐标和所述控件的位置坐标确定的。The display device according to claim 1, wherein the target display area is determined according to the position coordinates of the first play window and the position coordinates of the control.
  7. 根据权利要求1所述的显示设备,所述控件的高度与所述第一播放窗口的高度相同。The display device according to claim 1, wherein the height of the control is the same as the height of the first play window.
  8. 根据权利要求1所述的显示设备,所述控件的右侧边与所述第一播放窗口的右侧边重合。The display device according to claim 1, wherein the right side of the control is coincident with the right side of the first play window.
  9. 根据权利要求1所述的显示设备,所述控制器还配置为:响应于所述媒资播放指 令,获取所述媒资播放指令对应的目标视频之后,The display device according to claim 1, the controller is further configured to: in response to the media asset play instruction, after acquiring the target video corresponding to the media asset play instruction,
    响应于所述媒资播放指令对应的播放模式为第一模式,加载第一播放页面,其中,所述第一播放页面中包括第一播放窗口,不包括第二播放窗口,所述第一播放窗口用于播放所述目标视频,所述控件包括所述第二播放窗口;In response to the play mode corresponding to the media asset play instruction being the first mode, load a first play page, wherein the first play page includes a first play window but does not include a second play window, and the first play page is The window is used to play the target video, and the control includes the second playback window;
    响应于所述媒资播放指令对应的播放模式为第二模式,加载第二播放页面,其中,所述第一播放页面中包括第一播放窗口,以及位于所述第一播放窗口上方的第二播放窗口,所述第一播放窗口用于播放所述目标视频,所述第二播放窗口用于播放接收到的本地摄像头数据。In response to the play mode corresponding to the media asset play instruction being the second mode, a second play page is loaded, wherein the first play page includes a first play window, and a second play window located above the first play window. A play window, the first play window is used to play the target video, and the second play window is used to play the received local camera data.
  10. 一种媒资播放方法,包括:A method for playing media assets, comprising:
    接收用户输入的媒资播放指令;Receive the media asset playback instruction input by the user;
    响应于所述媒资播放指令,获取所述媒资播放指令对应的目标视频;In response to the media asset playback instruction, obtain a target video corresponding to the media asset playback instruction;
    在对应所述目标视频的第一播放窗口上方未设置控件时,在所述第一播放窗口播放所述目标视频;When the control is not set above the first playback window corresponding to the target video, the target video is played in the first playback window;
    在对应所述目标视频的第一播放窗口上方设置有所述控件时,在所述第一播放窗口中将所述目标视频的显示位置向远离所述控件的方向移动,以使所述目标视频的画面的中心位置靠近所述第一播放窗口中未被所述控件遮挡的目标显示区域的中心位置显示,其中,所述控件不透明,且遮挡所述第一播放窗口的一侧。When the control is provided above the first playback window corresponding to the target video, the display position of the target video is moved in the first playback window away from the control, so that the target video is displayed in a direction away from the control. The center position of the screen is displayed close to the center position of the target display area in the first play window that is not blocked by the controls, wherein the controls are opaque and block one side of the first play window.
PCT/CN2021/119052 2020-10-15 2021-09-17 Display device and media asset playing method WO2022078154A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202180068337.4A CN116324700A (en) 2020-10-15 2021-09-17 Display equipment and media asset playing method

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
CN202011102193.3A CN112272324B (en) 2020-10-15 2020-10-15 Follow-up mode control method and display device
CN202011102193.3 2020-10-15
CN202110275148.6 2021-03-15
CN202110275148.6A CN113051435B (en) 2021-03-15 2021-03-15 Server and medium resource dotting method
CN202110448074.1A CN113051432B (en) 2021-04-25 2021-04-25 Display device and media asset playing method
CN202110448074.1 2021-04-25

Publications (1)

Publication Number Publication Date
WO2022078154A1 true WO2022078154A1 (en) 2022-04-21

Family

ID=81207701

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/119052 WO2022078154A1 (en) 2020-10-15 2021-09-17 Display device and media asset playing method

Country Status (2)

Country Link
CN (1) CN116324700A (en)
WO (1) WO2022078154A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115640414A (en) * 2022-08-10 2023-01-24 荣耀终端有限公司 Image display method and electronic equipment

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200633514A (en) * 2005-03-08 2006-09-16 Teco Elec & Machinery Co Ltd Method and apparatus for adding background of auxiliary information of TV program
US20070028183A1 (en) * 2005-07-27 2007-02-01 Microsoft Corporation Media user interface layers and overlays
US20100299630A1 (en) * 2009-05-22 2010-11-25 Immersive Media Company Hybrid media viewing application including a region of interest within a wide field of view
US20130082957A1 (en) * 2011-09-27 2013-04-04 Z124 Gallery video player movement iconography
CN104904126A (en) * 2013-01-07 2015-09-09 三星电子株式会社 Method and mobile device for displaying image
CN105808040A (en) * 2014-12-30 2016-07-27 华为终端(东莞)有限公司 Display method of graphical user interface, and mobile terminal
CN105872710A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Video playing method, device and client
CN105898397A (en) * 2015-12-14 2016-08-24 乐视网信息技术(北京)股份有限公司 Multimedia play method and device and mobile terminal equipment based on Android platform
CN106648375A (en) * 2016-12-30 2017-05-10 合网络技术(北京)有限公司 Transverse video playing page operating method and system of mobile terminal
CN112162672A (en) * 2020-10-19 2021-01-01 腾讯科技(深圳)有限公司 Information flow display processing method and device, electronic equipment and storage medium
CN112272324A (en) * 2020-10-15 2021-01-26 聚好看科技股份有限公司 Follow-up mode control method and display device
CN112423086A (en) * 2020-11-10 2021-02-26 北京达佳互联信息技术有限公司 Video display method and device, electronic equipment and storage medium
CN113051435A (en) * 2021-03-15 2021-06-29 聚好看科技股份有限公司 Server and media asset dotting method
CN113051432A (en) * 2021-04-25 2021-06-29 聚好看科技股份有限公司 Display device and media asset playing method

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW200633514A (en) * 2005-03-08 2006-09-16 Teco Elec & Machinery Co Ltd Method and apparatus for adding background of auxiliary information of TV program
US20070028183A1 (en) * 2005-07-27 2007-02-01 Microsoft Corporation Media user interface layers and overlays
US20100299630A1 (en) * 2009-05-22 2010-11-25 Immersive Media Company Hybrid media viewing application including a region of interest within a wide field of view
US20130082957A1 (en) * 2011-09-27 2013-04-04 Z124 Gallery video player movement iconography
CN104904126A (en) * 2013-01-07 2015-09-09 三星电子株式会社 Method and mobile device for displaying image
CN105808040A (en) * 2014-12-30 2016-07-27 华为终端(东莞)有限公司 Display method of graphical user interface, and mobile terminal
CN105898397A (en) * 2015-12-14 2016-08-24 乐视网信息技术(北京)股份有限公司 Multimedia play method and device and mobile terminal equipment based on Android platform
CN105872710A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Video playing method, device and client
CN106648375A (en) * 2016-12-30 2017-05-10 合网络技术(北京)有限公司 Transverse video playing page operating method and system of mobile terminal
CN112272324A (en) * 2020-10-15 2021-01-26 聚好看科技股份有限公司 Follow-up mode control method and display device
CN112162672A (en) * 2020-10-19 2021-01-01 腾讯科技(深圳)有限公司 Information flow display processing method and device, electronic equipment and storage medium
CN112423086A (en) * 2020-11-10 2021-02-26 北京达佳互联信息技术有限公司 Video display method and device, electronic equipment and storage medium
CN113051435A (en) * 2021-03-15 2021-06-29 聚好看科技股份有限公司 Server and media asset dotting method
CN113051432A (en) * 2021-04-25 2021-06-29 聚好看科技股份有限公司 Display device and media asset playing method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115640414A (en) * 2022-08-10 2023-01-24 荣耀终端有限公司 Image display method and electronic equipment
CN115640414B (en) * 2022-08-10 2023-09-26 荣耀终端有限公司 Image display method and electronic device

Also Published As

Publication number Publication date
CN116324700A (en) 2023-06-23

Similar Documents

Publication Publication Date Title
CN112272324B (en) Follow-up mode control method and display device
CN102402382A (en) Information processing device and information processing method
WO2021032092A1 (en) Display device
US20130265448A1 (en) Analyzing Human Gestural Commands
CN108462729B (en) Method and device for realizing interaction of terminal equipment, terminal equipment and server
CN103139481A (en) Camera device and camera method
US20190110002A1 (en) Method for Using Deep Learning for Facilitating Real-Time View Switching and Video Editing on Computing Devices
US20230209204A1 (en) Display apparatus and camera tracking method
WO2022078154A1 (en) Display device and media asset playing method
WO2022100262A1 (en) Display device, human body posture detection method, and application
KR102355008B1 (en) Method of providing personal training service and recording medium thereof
CN113051435B (en) Server and medium resource dotting method
CA3185967A1 (en) Systems and methods for personalized exercise protocols and tracking thereof
CN109739414A (en) A kind of image processing method, mobile terminal, computer readable storage medium
US20180369678A1 (en) System and Apparatus for Sports Training
CN111857338A (en) Method suitable for using mobile application on large screen
US20220366811A1 (en) Systems and methods for sports and movement training
CN114339149A (en) Electronic device and learning supervision method
CN115244503A (en) Display device
CN105491418A (en) Remote control device and method, and electronic device
CN113678137A (en) Display device
WO2024055661A1 (en) Display device and display method
CN113055707A (en) Video display method and device
WO2022135177A1 (en) Control method and electronic device
KR102589169B1 (en) Method for golf lesson using motion picture

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21879200

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 090823)

122 Ep: pct application non-entry in european phase

Ref document number: 21879200

Country of ref document: EP

Kind code of ref document: A1