WO2016192013A1 - 多媒体处理方法及装置 - Google Patents

多媒体处理方法及装置 Download PDF

Info

Publication number
WO2016192013A1
WO2016192013A1 PCT/CN2015/080518 CN2015080518W WO2016192013A1 WO 2016192013 A1 WO2016192013 A1 WO 2016192013A1 CN 2015080518 W CN2015080518 W CN 2015080518W WO 2016192013 A1 WO2016192013 A1 WO 2016192013A1
Authority
WO
WIPO (PCT)
Prior art keywords
multimedia
behavior
viewer
changes
screen
Prior art date
Application number
PCT/CN2015/080518
Other languages
English (en)
French (fr)
Inventor
刘洁
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to EP15893672.4A priority Critical patent/EP3306463A4/en
Priority to PCT/CN2015/080518 priority patent/WO2016192013A1/zh
Priority to CN201580080439.2A priority patent/CN107615236A/zh
Priority to US15/578,566 priority patent/US20180160174A1/en
Publication of WO2016192013A1 publication Critical patent/WO2016192013A1/zh

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44218Detecting physical presence or behaviour of the user, e.g. using sensors to detect if the user is leaving the room or changes his face expression during a TV program
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/11Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/466Learning process for intelligent management, e.g. learning user preferences for recommending movies
    • H04N21/4667Processing of monitored end-user data, e.g. trend analysis based on the log file of viewer selections
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47217End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for controlling playback functions for recorded or on-demand content, e.g. using progress bars, mode or play-point indicators or bookmarks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8549Creating video summaries, e.g. movie trailer

Definitions

  • the embodiments of the present invention relate to communication technologies, and in particular, to a multimedia processing method and apparatus.
  • the video website provides a single video viewing mode for the user, for example, watching the live broadcast online, the played video cannot be viewed back, the user may miss some wonderful clips due to something leaving; watching the replay online or downloading through the network Video resources to the local, during the viewing process, if the user wants to selectively watch, for example, only watch the highlights, you can only watch by fast-forward, or drag the play bar, etc., you can not directly locate the highlights.
  • the prior art multimedia processing method provides a single viewing mode and the human-computer interaction is not intelligent enough.
  • the embodiment of the invention provides a multimedia processing method and device to improve the intelligence of human-computer interaction.
  • a first aspect of the embodiments of the present invention provides a multimedia processing method, including:
  • the specific content is a first part content or a second part content; the processing the specific content of the multimedia, including:
  • the second portion of content of the multimedia is tagged according to a change in behavior of the multimedia viewer.
  • the storing, by the multimedia viewer, the content of the first part of the multimedia including :
  • the multimedia player is monitored to change the multimedia behavior to watch the multimedia behavior at the second time; if the first The time interval between the time and the second time is greater than the first preset threshold, and the playing content of the multimedia in the first time to the second time period is stored.
  • the monitoring that the multimedia viewer changes from viewing multimedia behavior to not viewing multimedia behavior includes:
  • the monitoring that the multimedia viewer changes from viewing multimedia behavior to viewing multimedia behavior includes:
  • the detecting, by the second time, that the multimedia viewer changes from viewing multimedia behavior to viewing multimedia behavior Previously it also included:
  • the monitoring that the multimedia viewer changes from viewing multimedia behavior to not viewing multimedia behavior includes:
  • the number of the multimedia viewers is determined from Watching multimedia behavior changes to not watching multimedia behavior;
  • the monitoring that the multimedia viewer has never watched the multimedia behavior change to watch the multimedia Physical behavior including:
  • the number of the multimedia viewers is determined to be from the multimedia viewer. Not watching multimedia behavior changes to watch multimedia behavior;
  • Determining multimedia if it is detected that the distance between the facial image in the multimedia viewer and the screen of the multimedia player changes from greater than the first preset distance to less than or equal to the first preset distance. The viewer has never watched the multimedia behavior change to watch multimedia behavior;
  • Determining that the multimedia viewer has never watched the multimedia behavior change if it is detected that the line of sight in the multimedia viewer changes from outside the display area of the screen to the number of people located in the display area of the screen is greater than the preset number of people For watching multimedia behavior.
  • the marking according to the behavior change of the multimedia viewer, marking the second content of the multimedia, including:
  • the content of the multimedia playing in the fourth time period is the candidate exciting content
  • the marking the candidate exciting content is a wonderful content, including:
  • a thumbnail of the multimedia in the time interval is generated in a time interval in which the highlight content is in the playback progress of the multimedia.
  • a second aspect of the embodiments of the present invention provides a multimedia processing apparatus, including:
  • a monitoring module for monitoring behavior changes of a multimedia viewer
  • An identification module configured to identify a specific content of the multimedia according to a behavior change of the multimedia viewer
  • a processing module configured to process the specific content of the multimedia.
  • the specific content is a first part content or a second part content;
  • the processing module includes:
  • a storage unit configured to store the first part of the content of the multimedia according to the behavior change of the multimedia viewer
  • a marking unit configured to mark the second part of the content of the multimedia according to the behavior change of the multimedia viewer.
  • the storage unit is specifically configured to: if the monitoring module monitors the multimedia viewer at a first time Changing from watching multimedia behavior to not watching multimedia behavior; monitoring that the multimedia viewer changes from viewing multimedia behavior to viewing multimedia behavior at a second time; if the time interval between the first time and the second time is greater than the first pre- Setting a threshold, storing the content of the multimedia in the first time to the second time period.
  • the monitoring module is specifically configured to: if it is detected that the angle between the facial image of the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset angle to be greater than the first preset angle, determine the multimedia viewer from Watching multimedia behavior changes to not watching multimedia behavior;
  • the monitoring module is specifically configured to determine a multimedia viewer if the distance between the facial image of the multimedia viewer and the screen of the multimedia player is changed from less than or equal to a first preset distance to greater than a first preset distance. Change from watching multimedia behavior to not watching multimedia behavior;
  • the monitoring module is specifically configured to: if it is detected that the line of sight of the multimedia viewer changes from being within the display area of the screen to being outside the display area of the screen, determining that the multimedia viewer changes from viewing the multimedia behavior to not viewing Multimedia behavior;
  • the monitoring module is specifically configured to: if the angle between the facial image of the multimedia viewer and the screen of the multimedia player is changed from greater than a first preset angle to less than or equal to a first preset angle, determining the multimedia viewer from Not watching multimedia behavior changes to watch multimedia behavior;
  • the monitoring module is specifically configured to determine a multimedia viewer if the distance between the facial image of the multimedia viewer and the screen of the multimedia player is changed from greater than a first preset distance to less than or equal to a first preset distance. None watched multimedia behavior changes to watch multimedia behavior;
  • the monitoring module is specifically configured to: if it is detected that the line of sight of the multimedia viewer changes from being outside the display area of the screen to being located in the display area of the screen, determining that the multimedia viewer changes from viewing multimedia behavior to viewing Multimedia behavior.
  • the monitoring module is further configured to determine that the multimedia viewer is the first time and the first Time monitors that the multimedia viewer is the same viewer.
  • the monitoring module is specifically configured to: if it is detected that the angle between the facial image in the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset angle to greater than the first preset angle, the number of the persons greater than the preset number of people , determining that the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia behavior;
  • the monitoring module is specifically configured to: if it is detected that a distance between a facial image in the multimedia viewer and a screen of the multimedia player changes from less than or equal to a first preset distance to a number greater than a first preset distance, The preset number of people determines that the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia behavior;
  • the monitoring module is specifically configured to determine multimedia viewing if it is detected that a line of sight in the multimedia viewer changes from a display area located in the screen to a number of people outside the display area of the screen is greater than the preset number of people Change from watching multimedia behavior to not watching multimedia behavior;
  • the monitoring module is specifically configured to: if it is detected that the angle between the facial image in the multimedia viewer and the screen of the multimedia player changes from greater than the first preset angle to less than or equal to the first preset angle, the number of the digits is greater than the preset number of people , determining that the multimedia viewer has never watched the multimedia behavior change to watch the multimedia behavior;
  • the monitoring module is specifically configured to: if the distance between the facial image in the multimedia viewer and the screen of the multimedia player is changed from greater than a first preset distance to less than or equal to a first preset distance, The preset number of people determines that the multimedia viewer has never watched the multimedia behavior change to watch the multimedia behavior;
  • the monitoring module is specifically configured to determine multimedia viewing if it is detected that a line of sight in the multimedia viewer changes from a display area located outside the screen to a number of people located in a display area of the screen greater than the preset number of people People have never watched multimedia behavior changes to watch multimedia behavior.
  • the marking unit is specifically configured to: if the monitoring module monitors the multimedia viewer at a third time Changing from the expression calm to the expression is not calm; monitoring the multimedia viewer from the unsettled expression to the calm expression in the fourth time, marking the third time to the fourth time period, the multimedia playing content is a candidate The exciting content; if the number of times the multimedia playing content is marked as the candidate exciting content in the fourth time period to the fourth time period is greater than or equal to the second preset threshold, marking the candidate exciting content as the exciting content.
  • the marking unit is specifically configured to: time interval in the playing progress of the multimedia according to the exciting content Generating a playback heat curve of the multimedia; or
  • the marking unit is specifically configured to generate a thumbnail of the multimedia in the time interval in a time interval in which the wonderful content is in a playing progress of the multimedia.
  • the multimedia processing method and apparatus can detect the specific behavior of the multimedia viewer according to the behavior change of the multimedia viewer, and process the specific content of the multimedia, which can be changed according to the behavior of the multimedia viewer.
  • Special for multimedia The content is processed accordingly, thereby providing a variety of viewing modes and improving the intelligence of human-computer interaction.
  • Embodiment 1 is a schematic flowchart of Embodiment 1 of a multimedia processing method according to the present invention
  • Embodiment 2 is a schematic flowchart of Embodiment 2 of a multimedia processing method according to the present invention
  • Embodiment 3 is a schematic flowchart of Embodiment 3 of a multimedia processing method according to the present invention.
  • Embodiment 1 of a multimedia processing apparatus is a schematic structural diagram of Embodiment 1 of a multimedia processing apparatus according to the present invention.
  • FIG. 5 is a schematic structural diagram of Embodiment 2 of a multimedia processing apparatus according to the present invention.
  • FIG. 6 is a schematic structural diagram of Embodiment 3 of a multimedia processing apparatus according to the present invention.
  • the multimedia in the present invention may be a text, a picture/photo, or a video or the like.
  • the present invention uses a video as an example to describe the embodiment, and other multimedia forms are similar to the video scene, and are not described again.
  • the video viewing scenario of the present invention for example, it can be applied to a scene in which a live broadcast, an online viewing replay, or a local video is viewed.
  • the video viewer provides a single video viewing mode, by monitoring the behavior change of the video viewer, Identifying the specific content of the video according to the behavior change of the video viewer, processing the specific content, and performing corresponding processing on the specific content of the video according to the behavior change of the video viewer, for example, changing the video according to the behavior of the video viewer
  • the content is stored; or, the content of the video is marked according to the behavior change of the video viewer, thereby providing a plurality of viewing modes, thereby improving the intelligence of human-computer interaction.
  • FIG. 1 is a schematic flowchart diagram of Embodiment 1 of a multimedia processing method according to the present invention. As shown in FIG. 1, the video processing method in this embodiment includes:
  • the multimedia in this embodiment is played by a multimedia player
  • the multimedia player may be a user terminal having a multimedia playing function, such as a mobile phone, a tablet computer, and a vehicle-mounted computer.
  • the behavior of the multimedia viewer can be monitored by the front camera of the multimedia player.
  • the behavior change of the multimedia viewer is a behavior change made by the multimedia viewer during the multimedia viewing process, and may be a multimedia viewer changing from watching the multimedia behavior to not watching the multimedia behavior or changing the multimedia behavior to watching the multimedia behavior, or In the process of watching multimedia, multimedia viewers change from calm expressions to unsettled expressions or from calm expressions to calm expressions.
  • S102 Identify the specific content of the multimedia according to the behavior change of the multimedia viewer.
  • the multimedia player can determine the specific content of the multimedia based on changes in the behavior of the multimedia viewer.
  • the specific content of the multimedia may be content or wonderful content that the multimedia viewer misses. For example, if it is detected that the multimedia viewer changes from a calm expression to an unsettled expression, and changes from an unsettled expression to a calm expression, the multimedia content in the time period is identified as a specific content.
  • the multimedia player processes the specific content of the multimedia, for example, storing or marking the determined specific content of the multimedia.
  • the multimedia processing method provided in this embodiment monitors the behavior change of the multimedia viewer, identifies the specific content of the multimedia according to the behavior change of the multimedia viewer, and processes the specific content of the multimedia, and can change the multimedia according to the behavior of the multimedia viewer. Specific content is processed accordingly, thereby providing a variety of viewing modes and improving the intelligence of human-computer interaction.
  • S103 specifically includes: storing specific content of the multimedia according to a behavior change of the multimedia viewer; or marking the specific content of the multimedia according to a behavior change of the multimedia viewer.
  • the storage of the specific content of the multimedia according to the behavior change of the multimedia viewer can be applied to the scene of watching the live multimedia online, for example, can be applied in the scene of watching the live video, when the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia.
  • Behavior when the multimedia behavior change is never viewed to watch the multimedia behavior, the multimedia in the multimedia viewer's departure time is stored, and when the multimedia viewer changes to watch the multimedia behavior, the multimedia viewer is reminded whether the stored multimedia needs to be viewed. Solved the problem of missing videos due to video viewers leaving during live viewing of live programs.
  • the specific implementation is shown in Figure 2.
  • FIG. 2 is a schematic flowchart diagram of Embodiment 2 of a multimedia processing method according to the present invention.
  • the multimedia processing method of this embodiment includes:
  • the multimedia can be monitored by monitoring changes in the angle between the multimedia viewer's face image and the multimedia player screen, changes in the multimedia viewer's facial image and screen distance, or changes in the multimedia viewer's line of sight. Viewers change from watching multimedia behavior to not watching multimedia behavior.
  • the angle between the face image of the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset angle to be greater than the first preset angle, determining that the multimedia viewer changes from viewing the multimedia behavior to not Watch multimedia behavior.
  • the angle between the face image of the multimedia viewer and the screen of the multimedia player is the angle between the plane in which the face image of the multimedia viewer is located and the plane in which the screen of the multimedia player is located.
  • the first preset angle is preset in the multimedia player, and the first preset angle indicates that the multimedia viewer can see the multimedia played on the screen of the multimedia player, and the first preset angle indicates that the multimedia viewer cannot See the multimedia.
  • the first preset angle can be obtained through empirical data, or can be obtained experimentally.
  • the multimedia viewer can adjust the angle between the facial image and the screen in front of the screen of the multimedia player, when the multimedia viewer can just watch it.
  • the angle between the facial image and the screen is the first preset angle.
  • the distance between the facial image of the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset distance to be greater than the first preset distance
  • determining that the multimedia viewer changes from viewing the multimedia behavior to not viewing the multimedia behavior The first preset distance is preset in the multimedia player, and less than or equal to the first preset distance indicates that the multimedia viewer can see more
  • the multimedia played on the screen of the media player, greater than the first preset distance indicates that the multimedia viewer cannot see the multimedia.
  • the distance of the multimedia viewer's facial image from the screen can be measured by the distance sensor.
  • the multimedia viewer's line of sight changes from within the display area of the screen of the multimedia player to outside the display area of the screen, it is determined that the multimedia viewer changes from viewing the multimedia behavior to not viewing the multimedia behavior.
  • the multimedia viewer's line of sight is located in the display area of the screen of the multimedia player to indicate that the multimedia viewer can view the multimedia played on the line screen, and the multimedia viewer's line of sight is located outside the display area of the screen to indicate that the multimedia viewer cannot see the multimedia.
  • Monitoring the implementation of multimedia viewers can be achieved by tracking the eye track of the multimedia viewer.
  • the above three cases satisfy any one, that is, the multimedia viewer is considered to change from watching multimedia behavior to not watching multimedia behavior.
  • the number of people who change the angle between the face image of the multimedia viewer and the screen, the number of people whose face image and the distance of the screen change, or the number of people whose multimedia viewer's line of sight changes may be Monitor multimedia viewers from watching multimedia behavior to not watching multimedia behavior.
  • the number of the multimedia viewers is determined to be viewed from the multimedia viewer. Multimedia behavior changes to not watching multimedia behavior.
  • the multimedia viewer is determined to view the multimedia from the multimedia viewer. Behavior changes to not watching multimedia behavior.
  • the multimedia viewer changes from being within the display area of the screen to the number of people outside the display area of the screen being greater than the preset number of people, it is determined that the multimedia viewer changes from viewing the multimedia behavior to not viewing the multimedia behavior.
  • the preset number of people may be a fixed number of people set by the multimedia player, the number of multimedia viewers who initially watch multimedia, or the number of people based on a ratio of the number of people who initially watched the multimedia.
  • the preset number of people is 1, it means that if the angle between the facial image of the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset angle to be greater than the first preset angle, Multimedia view The viewer changes from watching multimedia behavior to not watching multimedia behavior.
  • the preset number of people When the preset number of people is 0, it means that if the angle between the facial image of one or more multimedia viewers and the screen of the multimedia player changes from less than or equal to the first preset angle to be greater than the first preset angle, the multimedia is considered Viewers change from watching multimedia behavior to not watching multimedia behavior.
  • the preset number of people can be 7 people, which means that only 21 of the 21 multimedia viewers are observed that when the angle between the face image and the screen is greater than 7 Equal to the first preset angle change being greater than the first preset angle, the distance between the facial image and the screen is changed from less than or equal to the first preset distance to be greater than the first preset distance or the line of sight is changed from being within the display area of the screen to being located on the screen When the display area is outside, it is considered that the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia behavior.
  • the angle between the facial image of each of the at least two multimedia viewers and the screen, the distance of each facial image from the screen, or the line of sight of each multimedia viewer is monitored.
  • the above three cases satisfy any one, that is, the multimedia viewer is considered to change from watching multimedia behavior to not watching multimedia behavior.
  • S202 Monitor the multimedia viewer to watch the multimedia behavior change to watch the multimedia behavior in the second time.
  • the multimedia viewer is one person, specifically, if it is detected that the angle between the face image of the multimedia viewer and the screen of the multimedia player changes from greater than the first preset angle to less than or equal to the first preset angle, determining multimedia viewing People have never watched multimedia behavior changes to watch multimedia behavior.
  • the default is that the angle between the multimedia viewer and the screen is greater than the first preset angle.
  • the distance between the facial image of the multimedia viewer and the screen of the multimedia player changes from greater than the first preset distance to less than or equal to the first preset distance, determining that the multimedia viewer changes from viewing the multimedia behavior to viewing the multimedia behavior.
  • the default is that the distance between the multimedia viewer and the screen is greater than the first preset distance.
  • the multimedia viewer's line of sight changes from outside the display area of the screen to within the display area of the screen, it is determined that the multimedia viewer has changed from viewing the multimedia behavior to watching the multimedia behavior.
  • the default is that the multimedia viewer's line of sight falls outside the display area of the screen.
  • the multimedia viewer is at least two people, specifically, if the angle between the facial image in the multimedia viewer and the screen of the multimedia player is changed from greater than the first preset angle to less than or equal to the first preset angle More than the preset number of people, it is determined that the multimedia viewer has never watched the multimedia behavior change to watch the multimedia behavior.
  • the multimedia viewer is determined to never be Watch multimedia behavior changes to watch multimedia behavior.
  • the multimedia viewer if it is detected that the line of sight in the multimedia viewer changes from outside the display area of the screen to the number of people located in the display area of the screen is greater than the preset number of people, it is determined that the multimedia viewer has changed from viewing the multimedia behavior to viewing the multimedia behavior.
  • the preset number of people in this step may be the same as or different from the preset number of people in S201.
  • the multimedia viewer when the time interval between the first time and the second time is greater than the first preset threshold, the multimedia viewer is considered to have missed a piece of multimedia, and the content of the multimedia in the first time to the second time period is stored.
  • the multimedia player can be stored by downloading the multimedia from the first time to the second time from the multimedia website server.
  • the multimedia viewer may Inferring the content of the missed multimedia; when the multimedia viewer is at least two, it is considered that during the time interval between the first time and the second time, the multimedia viewer can know the content of the missed multimedia according to the explanation of other multimedia viewers. .
  • the multimedia play content may be stored from the first time. If the time interval between the first time and the second time is less than the first preset threshold, the playback of the multimedia stored from the first time is deleted. content.
  • the multimedia playing content stored in the first time to the second time period is played to the multimedia view after monitoring that the multimedia viewer has changed from watching the multimedia behavior to watching the multimedia behavior. Seeker. Specifically, the multimedia viewer can be reminded whether it is necessary to watch the playback content of the multimedia stored in the time period from the first time to the second time.
  • the stored multimedia content is played when the multimedia viewer selects the play content of the multimedia stored in the time period from the first time to the second time. When playing, it can be in the form of a separate window so as not to affect normal multimedia viewing.
  • the multimedia processing method provided in this embodiment can be used in a scenario in which live multimedia is viewed online.
  • the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia behavior at the first time
  • the multimedia viewer is monitored from the second time.
  • the multimedia behavior change is not observed to watch the multimedia behavior.
  • the time interval between the first time and the second time is greater than the first preset threshold
  • the multimedia playing content is stored in the first time to the second time period, thereby solving the online viewing.
  • the problem of missing multimedia due to the multimedia viewer leaving for a period of time improves the intelligence of human-computer interaction.
  • the method further includes: determining that the multimedia viewer is the same viewer as the multimedia viewer monitored at the first time.
  • the method further includes: determining, by the second time, the multimedia viewer who has changed from watching the multimedia behavior to watching the multimedia behavior, and the monitoring of the multimedia behavior observed from the first time.
  • the multimedia viewers who are not watching multimedia behavior are the same viewers. Specifically, whether the same or the same batch of viewers can be determined by comparing the multimedia viewers monitored at the second time with the facial features of the multimedia viewers monitored at the first time.
  • FIG. 3 is a schematic flowchart of Embodiment 3 of a multimedia processing method according to the present invention.
  • the multimedia processing method of this embodiment includes:
  • monitoring whether the multimedia viewer changes from calm expression to unsettled expression can monitor whether the multimedia viewer's facial congestion condition exceeds a third preset threshold, or whether the degree of change of the multimedia viewer's eye movement trajectory exceeds the fourth.
  • Preset thresholds are implemented. When the facial congestion condition of the multimedia viewer changes from less than or equal to a third preset threshold to be greater than a third preset threshold, or the degree of change of the eye track of the multimedia viewer is monitored to change from less than or equal to a fourth preset threshold to be greater than Four preset thresholds determine that the multimedia viewer changes from a calm expression to an unsettled expression.
  • the facial congestion condition of the multimedia viewer changes from greater than a third preset threshold to less than or equal to a third preset threshold, or the degree of change of the eye movement trajectory of the multimedia viewer is monitored from greater than a fourth preset threshold
  • the change is less than or equal to the fourth preset threshold, and it is determined that the multimedia viewer changes from an unsettled expression to a calm expression.
  • the playback content of the multimedia is marked several times.
  • the second preset threshold may be determined according to the total number of times the multimedia is played.
  • the second preset threshold may be one-half of the total number of times the multimedia is played, that is, if the third time is When the playback content of the multimedia in the four time period is marked as the candidate highlight content is greater than one-half of the total number of times the multimedia is played, the candidate candidate content is marked as the highlight content.
  • the multimedia playing heat curve may be generated according to the time interval of the multimedia content in the playing progress of the multimedia content, or the multimedia thumbnail in the time interval may be generated in the time interval of the exciting content in the multimedia playing progress to mark the candidate exciting content as Wonderful content.
  • the multimedia playing heat curve can be traversed by the multimedia playing progress.
  • the number of times the multimedia is marked as the candidate highlight is the heat curve of the multimedia generated by the ordinate. It is also possible to generate a multimedia playing heat curve by using the multimedia playing progress as the abscissa and the marked as the exciting content as the ordinate, and according to the time interval of the exciting content in the multimedia playing progress.
  • a thumbnail of the multimedia in the time interval is generated in a time interval in which the highlight content is in the playback progress of the multimedia, that is, a thumbnail of the highlight content is generated.
  • the multimedia processing method provided in this embodiment may be specifically used in online viewing of replay multimedia or in a scene of watching local multimedia.
  • the multimedia player marks the multimedia playing content in the third time to the fourth time period as the candidate exciting content according to the monitored behavior of the multimedia viewer, and sends the marked data to the multimedia.
  • the website server marks the candidate exciting content as the exciting content by generating the multimedia playing heat curve according to the data sent by the multimedia player and the second preset threshold or by generating the thumbnail of the multimedia in the time interval. The more times a multimedia is played, the more accurate the markup of the highlights based on statistical rules.
  • the multimedia processing method provided by the embodiment is marked by the third time when the multimedia viewer is monitored to change from the expression calm to the expression is not calm, and the multimedia viewer is monitored from the unsettled expression to the calm expression at the fourth time.
  • the content of the multimedia play is the candidate wonderful content in the fourth time period, and if the number of times the multimedia play content in the third time period to the fourth time period is marked as the candidate wonderful content is greater than or equal to the second preset threshold, the mark candidate is wonderful.
  • the content is wonderful content, so that multimedia viewers can quickly locate the wonderful content of multimedia, and improve the intelligence of human-computer interaction.
  • FIG. 4 is a schematic structural diagram of Embodiment 1 of a multimedia processing apparatus according to the present invention.
  • the multimedia processing apparatus provided in this embodiment includes: a monitoring module 41, configured to monitor a behavior change of a multimedia viewer, and an identification module 42 for identifying a specific content of the multimedia according to a behavior change of the multimedia viewer, and processing The module 43 is configured to process specific content of the multimedia.
  • the device provided in this embodiment is correspondingly applicable to the technical solution of the method embodiment shown in FIG. 1 , and the implementation principle is similar, and details are not described herein again.
  • the multimedia processing device monitors the behavior change of the multimedia viewer through the monitoring module, the identification module determines the specific content of the multimedia, and the processing module processes the specific content of the multimedia, and can change the specificity of the multimedia according to the behavior of the multimedia viewer.
  • Content phase The processing should be, thus providing a variety of viewing methods, improving the intelligence of human-computer interaction.
  • FIG. 5 is a schematic structural diagram of Embodiment 2 of a multimedia processing apparatus according to the present invention.
  • the processing module 43 includes: a storage unit 501, configured to store specific content of the multimedia according to the behavior change of the multimedia viewer.
  • the storage unit 501 is specifically configured to: if the monitoring module 41 detects that the multimedia viewer changes from viewing the multimedia behavior to the unwatched multimedia behavior at the first time; and monitors, at the second time, the multimedia viewer changes from viewing the multimedia behavior to watching The multimedia behavior; if the time interval between the first time and the second time is greater than the first preset threshold, storing the content of the multimedia in the first time to the second time period.
  • the monitoring module 41 is specifically configured to: if it is detected that the angle between the facial image of the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset angle to greater than the first preset angle Then, it is determined that the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia behavior.
  • the monitoring module 41 is specifically configured to: if the distance between the facial image of the multimedia viewer and the screen of the screen of the multimedia player is changed from less than or equal to the first preset distance to be greater than the first preset distance, determine the multimedia viewer from Watch multimedia behavior changes to not watching multimedia behavior.
  • the monitoring module 41 is specifically configured to determine that the multimedia viewer changes from viewing the multimedia behavior to not viewing the multimedia behavior if it is detected that the visual line of the multimedia viewer changes from being located outside the display area of the screen to being outside the display area of the screen.
  • the monitoring module 41 is specifically configured to: if it is detected that the angle between the facial image of the multimedia viewer and the screen of the multimedia player changes from greater than the first preset angle to less than or equal to the first preset angle, determining that the multimedia viewer has never viewed the multimedia Behavior changes to watch multimedia behavior.
  • the monitoring module 41 is specifically configured to: if it is detected that the distance between the facial image of the multimedia viewer and the screen of the multimedia player changes from greater than the first preset distance to less than or equal to the first preset distance, determining that the multimedia viewer has never viewed Multimedia behavior changes to watch multimedia behavior.
  • the monitoring module 41 is specifically configured to determine that the multimedia viewer changes from viewing multimedia behavior to viewing multimedia behavior if it is detected that the multimedia viewer's line of sight changes from being outside the display area of the screen to being located in the display area of the screen.
  • the monitoring module 41 is further configured to determine the second time monitoring of the multimedia viewer and the first time monitoring The multimedia viewer was found to be the same viewer.
  • the monitoring module 41 is specifically configured to: if it is detected that the angle between the facial image in the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset angle to greater than the first preset If the number of people in the angle is greater than the preset number of people, it is determined that the multimedia viewer changes from watching the multimedia behavior to not watching the multimedia behavior.
  • the monitoring module 41 is specifically configured to: if it is detected that the distance between the facial image in the multimedia viewer and the screen of the multimedia player changes from less than or equal to the first preset distance to more than the first preset distance, It is determined that the multimedia viewer changes from watching multimedia behavior to not watching multimedia behavior.
  • the monitoring module 41 is specifically configured to: if it is detected that the line of sight in the multimedia viewer changes from being within the display area of the screen to the number of people outside the display area of the screen being greater than a preset number of people, determining that the multimedia viewer changes from viewing the multimedia behavior to Did not watch multimedia behavior.
  • the monitoring module 41 is specifically configured to: if it is detected that the angle between the facial image in the multimedia viewer and the screen of the multimedia player changes from greater than the first preset angle to less than or equal to the first preset angle, the number of the determined number is greater than the preset number of people, then determining Multimedia viewers have never watched multimedia behavior changes to watch multimedia behavior.
  • the monitoring module 41 is specifically configured to: if it is detected that the distance between the facial image in the multimedia viewer and the screen of the multimedia player changes from greater than the first preset distance to less than or equal to the first preset distance, the number of the persons is greater than the preset number of people. It is determined that multimedia viewers have never watched multimedia behavior changes to watch multimedia behavior.
  • the monitoring module 41 is specifically configured to: if it is detected that the line of sight in the multimedia viewer changes from outside the display area of the screen to the number of people in the display area of the screen is greater than a preset number of people, determining that the multimedia viewer has never watched the multimedia behavior change For watching multimedia behavior.
  • the apparatus provided in this embodiment is correspondingly used to implement the technical solution of the method embodiment shown in FIG. 2, and the implementation principle is similar, and details are not described herein again.
  • the multimedia processing device may be used in a scenario in which live multimedia is viewed online, and the storage unit is specifically configured to: when the monitoring module detects that the multimedia viewer changes from viewing the multimedia behavior to not viewing the multimedia behavior at the first time, The second time is that the multimedia viewer changes the multimedia behavior to watch the multimedia behavior. If the time interval between the first time and the second time is greater than the first preset threshold, the multimedia is stored in the first time to the second time period.
  • the playback content thus, solves the problem of missed multimedia in the process of watching online live multimedia, because the multimedia viewer leaves for a period of time, and improves the intelligence of human-computer interaction.
  • FIG. 6 is a schematic structural diagram of Embodiment 3 of a multimedia processing apparatus according to the present invention.
  • the processing module 43 includes a marking unit 601 for marking the specific content of the multimedia according to the behavior change of the multimedia viewer.
  • the marking unit 601 is specifically configured to: if the monitoring module 41 detects that the multimedia viewer changes from the expression calm to the expression is not calm at the third time; and monitors, at the fourth time, the multimedia viewer changes from the expression to the calm expression And marking the multimedia playing content in the third time to the fourth time period as the candidate exciting content; if the playing content of the multimedia in the third time to the fourth time period is marked as the candidate exciting content, the number of times is greater than or equal to the second preset threshold , mark the candidate's exciting content as exciting content.
  • the marking unit 601 is specifically configured to generate a multimedia playing heat curve according to a time interval in the playing progress of the multimedia according to the wonderful content.
  • the marking unit is specifically configured to generate a thumbnail of the multimedia in the time interval in a time interval in which the highlight content is in the playing progress of the multimedia.
  • the device provided in this embodiment is correspondingly used to implement the technical solution of the method embodiment shown in FIG. 3, and the implementation principle is similar, and details are not described herein again.
  • the multimedia processing device is specifically configured by the marking unit to detect, when the monitoring module detects that the multimedia viewer changes from the expression to the expression calmly at the third time, and monitors that the multimedia viewer changes from the expression in the fourth time. If the expression is calm, the content of the multimedia played in the third time to the fourth time is the candidate wonderful content, and if the content of the multimedia played in the third time to the fourth time period is marked as the candidate wonderful content, the number of times is greater than or equal to the second. By preset the threshold, the candidate wonderful content is marked as a wonderful content, thereby enabling the multimedia viewer to quickly locate the wonderful content of the multimedia, thereby improving the intelligence of human-computer interaction.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Social Psychology (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Security & Cryptography (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

本发明实施例提供一种多媒体处理方法及装置,通过监测多媒体观看者的行为变化,根据多媒体观看者的行为变化识别多媒体的特定内容,对多媒体的特定内容进行处理,能根据多媒体观看者的行为变化对多媒体的特定内容进行相应的处理,从而,提供多种观看方式,提高了人机交互的智能化。

Description

多媒体处理方法及装置 技术领域
本发明实施例涉及通信技术,尤其涉及一种多媒体处理方法及装置。
背景技术
随着计算机网络技术的飞速发展,人们越来越习惯于通过网络观看多媒体,在观看视频的场景中,例如:直接在线观看直播、在线观看重播或者通过网络下载视频资源到本地观看等。
现有技术中,视频网站为用户提供的视频观看方式单一,例如:在线观看直播,已播放的视频无法回看,用户可能因某事离开而错过某些精彩片段;在线观看重播或通过网络下载视频资源到本地,在观看过程中,用户如果想选择性观看,例如:只观看精彩片段,只能通过快进,或者拖动播放条等方式观看,无法直接定位精彩片段等。
因此,现有技术的多媒体处理方法,提供的观看方式单一,人机交互不够智能化。
发明内容
本发明实施例提供一种多媒体处理方法及装置,以提高人机交互的智能化。
本发明实施例第一方面提供一种多媒体处理方法,包括:
监测多媒体观看者的行为变化;
根据所述多媒体观看者的行为变化识别多媒体的特定内容;
对所述多媒体的特定内容进行处理。
结合第一方面,在第一方面的第一种可能的实现方式中,所述特定内容为第一部分内容或第二部分内容;所述对所述多媒体的特定内容进行处理,包括:
根据所述多媒体观看者的行为变化对多媒体的所述第一部分内容进行存储;或者,
根据所述多媒体观看者的行为变化对多媒体的所述第二部分内容进行标记。
结合第一方面的第一种可能的实现方式,在第一方面的第二种可能的实现方式中,所述根据所述多媒体观看者的行为变化对多媒体的所述第一部分内容进行存储,包括:
若在第一时间监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为;在第二时间监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为;若所述第一时间与第二时间的时间间隔大于第一预设阈值,则存储所述第一时间到所述第二时间段内多媒体的播放内容。
结合第一方面的第二种可能的实现方式,在第一方面的第三种可能的实现方式中,当所述多媒体观看者为一人时:
所述监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为,包括:
若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
所述监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为,包括:
若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
结合第一方面的第三种可能的实现方式,在第一方面的第四种可能的实现方式中,所述在第二时间监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为之前,还包括:
确定第二时间监测到所述多媒体观看者与所述第一时间监测到所述多媒体观看者为同一观看者。
结合第一方面的第二种可能的实现方式,在第一方面的第五种可能的实现方式中,当所述多媒体观看者为至少两人时,
所述监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为,包括:
若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
所述监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒 体行为,包括:
若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
结合第一方面的第一种可能的实现方式,在第一方面的第六种可能的实现方式中,所述根据所述多媒体观看者的行为变化对多媒体的第二部分内容进行标记,包括:
若在第三时间监测到所述多媒体观看者从表情平静变化到表情不平静;在第四时间监测到所述多媒体观看者从表情不平静变化到表情平静,则标记所述第三时间到所述第四时间段内多媒体的播放内容为候选精彩内容;
若所述第三时间到所述第四时间段内多媒体的播放内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记所述候选精彩内容为精彩内容。
结合第一方面的第六种可能的实现方式,在第一方面的第七种可能的实现方式中,所述标记所述候选精彩内容为精彩内容,包括:
根据所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述多媒体的播放热度曲线;或者,
在所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述时间区间内的多媒体的缩略图。
本发明实施例第二方面提供一种多媒体处理装置,包括:
监测模块,用于监测多媒体观看者的行为变化;
识别模块,用于根据所述多媒体观看者的行为变化识别多媒体的特定内容;
处理模块,用于对所述多媒体的特定内容进行处理。
结合第二方面,在第二方面的第一种可能的实现方式中,所述特定内容为第一部分内容或第二部分内容;所述处理模块包括:
存储单元,用于根据所述多媒体观看者的行为变化对多媒体的所述第一部分内容进行存储;或者,
标记单元,用于根据所述多媒体观看者的行为变化对多媒体的所述第二部分内容进行标记。
结合第二方面的第一种可能的实现方式,在第二方面的第二种可能的实现方式中,所述存储单元具体用于若所述监测模块在第一时间监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为;在第二时间监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为;若所述第一时间与第二时间的时间间隔大于第一预设阈值,则存储所述第一时间到所述第二时间段内多媒体的播放内容。
结合第二方面的第二种可能的实现方式,在第二方面的第三种可能的实现方式中,当所述多媒体观看者为一人时:
所述监测模块具体用于若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
所述监测模块具体用于若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
结合第二方面的第三种可能的实现方式,在第二方面的第四种可能的实现方式中,所述监测模块还用于确定第二时间监测到所述多媒体观看者与所述第一时间监测到所述多媒体观看者为同一观看者。
结合第二方面的第二种可能的实现方式,在第二方面的第五种可能的实现方式中,当所述多媒体观看者为至少两人时,
所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体 行为;
所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
或者,
所述监测模块具体用于若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
结合第二方面的第一种可能的实现方式,在第二方面的第六种可能的实现方式中,所述标记单元具体用于若所述监测模块在第三时间监测到所述多媒体观看者从表情平静变化到表情不平静;在第四时间监测到所述多媒体观看者从表情不平静变化到表情平静,则标记所述第三时间到所述第四时间段内多媒体的播放内容为候选精彩内容;若所述第三时间到所述第四时间段内多媒体的播放内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记所述候选精彩内容为精彩内容。
结合第二方面的第六种可能的实现方式,在第二方面的第七种可能的实现方式中,所述标记单元具体用于根据所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述多媒体的播放热度曲线;或者,
所述标记单元具体用于在所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述时间区间内的多媒体的缩略图。
本发明实施例提供的多媒体处理方法及装置,通过监测多媒体观看者的行为变化,根据多媒体观看者的行为变化识别多媒体的特定内容,对多媒体的特定内容进行处理,能根据多媒体观看者的行为变化对多媒体的特 定内容进行相应的处理,从而,提供多种观看方式,提高了人机交互的智能化。
附图说明
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。
图1为本发明多媒体处理方法实施例一的流程示意图;
图2为本发明多媒体处理方法实施例二的流程示意图;
图3为本发明多媒体处理方法实施例三的流程示意图;
图4为本发明多媒体处理装置实施例一的结构示意图;
图5为本发明多媒体处理装置实施例二的结构示意图;
图6为本发明多媒体处理装置实施例三的结构示意图。
具体实施方式
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。
本发明中的多媒体可以为文字、图片/照片,或视频等。本发明以视频为例进行实施例的描述,其他多媒体形式与视频场景类似,不再赘述。本发明的视频观看场景中,例如,可应用在在线观看直播、在线观看重播或观看本地视频的场景中等,为了解决视频网站提供的视频观看方式单一的问题,通过监测视频观看者的行为变化,根据视频观看者的行为变化识别视频的特定内容,对该特定内容进行处理,能根据视频观看者的行为变化对视频的特定内容进行相应的处理,例如:根据视频观看者的行为变化对视频的内容进行存储;或者,根据视频观看者的行为变化对视频的内容进行标记,从而,提供多种观看方式,提高了人机交互的智能化。
下面以具体地实施例对本发明的技术方案进行详细说明。下面这几个 具体的实施例可以相互结合,对于相同或相似的概念或过程可能在某些实施例不再赘述。
图1为本发明多媒体处理方法实施例一的流程示意图。如图1所示,本实施例的视频处理方法包括:
S101:监测多媒体观看者的行为变化。
具体地,本实施例中的多媒体通过多媒体播放器播放,多媒体播放器可以是具有多媒体播放功能的用户终端,例如,手机、平板电脑和车载电脑等。
可选的,可以通过多媒体播放器的前置摄像头监测多媒体观看者的行为变化。
多媒体观看者的行为变化是多媒体观看者在观看多媒体过程中做出的行为变化,可以是多媒体观看者从观看多媒体行为变化为未观看多媒体行为或从未观看多媒体行为变化为观看多媒体行为,或者,多媒体观看者在观看多媒体的过程中从表情平静变化到表情不平静或从表情不平静变化为表情平静等。
S102:根据多媒体观看者的行为变化识别多媒体的特定内容。
具体地,多媒体播放器能根据多媒体观看者的行为变化确定多媒体的特定内容。多媒体的特定内容可以是多媒体观看者错过的内容或精彩内容。例如,若监测到多媒体观看者从表情平静变化到表情不平静,又从表情不平静变化为表情平静,则识别该时间段内的多媒体内容为特定内容。
S103:对多媒体的特定内容进行处理。
具体地,多媒体播放器对多媒体的特定内容进行处理,例如,将确定出的多媒体的特定内容进行存储或者标记。
本实施例提供的多媒体处理方法,通过监测多媒体观看者的行为变化,根据多媒体观看者的行为变化识别多媒体的特定内容,对多媒体的特定内容进行处理,能根据多媒体观看者的行为变化对多媒体的特定内容进行相应的处理,从而,提供多种观看方式,提高了人机交互的智能化。
进一步地,在上述实施例的基础上,S103具体包括:根据多媒体观看者的行为变化对多媒体的特定内容进行存储;或者,根据多媒体观看者的行为变化对多媒体的特定内容进行标记。
根据多媒体观看者的行为变化对多媒体的特定内容进行存储可以应用在在线观看直播多媒体的场景中,例如,可以应用在观看直播视频的场景中,当多媒体观看者从观看多媒体行为变化为未观看多媒体行为,再从未观看多媒体行为变化为观看多媒体行为时,则存储多媒体观看者离开时间内的多媒体,在多媒体观看者变化为观看多媒体行为时,提醒多媒体观看者是否需要对存储的多媒体进行观看,解决了在线观看直播节目过程中,因视频观看者离开而错过视频的问题。具体的实现方式如图2所示。
图2为本发明多媒体处理方法实施例二的流程示意图。本实施例的多媒体处理方法包括:
S201:若在第一时间监测到多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
当多媒体观看者为一人时,可以通过监测多媒体观看者的面部图像与多媒体播放器屏幕的夹角的变化、多媒体观看者的面部图像与屏幕距离的变化或多媒体观看者的视线的变化来监测多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
具体地,若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。多媒体观看者的面部图像与多媒体播放器的屏幕的夹角是多媒体观看者的面部图像所在的平面和多媒体播放器的屏幕所在的平面之间的夹角。第一预设角度是预先设置在多媒体播放器中的,小于等于第一预设角度表示多媒体观看者能看清多媒体播放器的屏幕上播放的多媒体,大于第一预设角度表示多媒体观看者不能看清多媒体。第一预设角度可以通过经验数据获得,也可以通过试验的方式获得,例如,可以让多媒体观看者在多媒体播放器的屏幕前调整面部图像与屏幕的夹角,当多媒体观看者刚好能从看到多媒体变化为看不到多媒体时面部图像与屏幕的夹角为第一预设角度。
或者,若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。第一预设距离是预先设置在多媒体播放器中的,小于等于第一预设距离表示多媒体观看者能看清多 媒体播放器的屏幕上播放的多媒体,大于第一预设距离表示多媒体观看者不能看清多媒体。多媒体观看者的面部图像与屏幕的距离可以通过距离感应器来测量。
或者,若监测到多媒体观看者的视线从位于多媒体播放器的屏幕的显示区域内变化到位于屏幕的显示区域外,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。多媒体观看者的视线位于多媒体播放器的屏幕的显示区域内表示多媒体观看者能看行屏幕上播放的多媒体,多媒体观看者的视线位于屏幕的显示区域外表示多媒体观看者不能看清多媒体。监测多媒体观看者的实现可以通过跟踪多媒体观看者的眼动轨迹来实现,
上述三种情况满足任一种即认为多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
当多媒体观看者为至少两人时,可以通过监测多媒体观看者的面部图像与屏幕的夹角发生变化的人数、面部图像与屏幕的距离发生变化的人数或者多媒体观看者的视线发生变化的人数来监测多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
具体地,若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度的人数大于预定人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
或者,若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕距离从小于等于第一预设距离变化到大于第一预设距离的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
或者,若监测到多媒体观看者中的视线从位于屏幕的显示区域内变化到位于屏幕的显示区域外的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
需要说明的是,预设人数可以是多媒体播放器设置的一个固定的人数,也可以是最开始观看多媒体的多媒体观看者的人数,还可以是根据最开始观看多媒体的人数的一个比值设置的人数。当预设人数为1时,表示多媒体观看者中如果有两人以上的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化为大于第一预设角度时,即认为多媒体观 看者从观看多媒体行为变化为未观看多媒体行为。当预设人数为0时,表示多媒体观看者中如果有一人以上的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化为大于第一预设角度时,即认为多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
举例来说,当最开始观看的多媒体人数为21人时,预设人数可以为7人,这表示在21个多媒体观看者中只有监测到当大于7个人的面部图像与屏幕的夹角从小于等于第一预设角度变化为大于第一预设角度、面部图像与屏幕的距离从小于等于第一预设距离变化为大于第一预设距离或者视线从位于屏幕的显示区域内变化到位于屏幕的显示区域外时,才认为多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
监测至少两个多媒体观看者中每个多媒体观看者的面部图像与屏幕的夹角、每个面部图像与屏幕的距离或者每个多媒体观看者的视线。
上述三种情况满足任一种即认为多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
S202:在第二时间监测到多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
当多媒体观看者为一人时,具体地,若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
需要说明的是,若未监测到多媒体观看者,则默认为多媒体观看者与屏幕的夹角大于第一预设角度。
或者,若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
需要说明的是,若未监测到多媒体观看者,则默认为多媒体观看者与屏幕的距离大于第一预设距离。
或者,若监测到多媒体观看者的视线从位于屏幕的显示区域外变化到位于屏幕的显示区域内,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
需要说明的是,若未监测到多媒体观看者,则默认为多媒体观看者的视线落在屏幕的显示区域外。
当多媒体观看者为至少两人时,具体地,若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
或者,若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
或者,若监测到多媒体观看者中的视线从位于屏幕的显示区域外变化到位于屏幕的显示区域内的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
需要说明的是,本步骤中的预设人数与S201中的预设人数可以相同也可以不相同。
S203:若第一时间与第二时间的时间间隔大于第一预设阈值,则存储第一时间到第二时间段内多媒体的播放内容。
具体地,当第一时间与第二时间的时间间隔大于第一预设阈值时,认为多媒体观看者错过了一段多媒体,则存储第一时间到第二时间段内多媒体的播放内容。多媒体播放器可以通过从多媒体网站服务器下载第一时间到第二时间的多媒体来进行存储。
若第一时间与第二时间的时间间隔小于第一预设阈值,当多媒体观看者为一人时,认为在第一时间与第二时间的时间间隔内,根据多媒体的上下情景,多媒体观看者可以推断出错过的多媒体的内容;当多媒体观看者为至少两人时,认为在第一时间与第二时间的时间间隔内,多媒体观看者可以根据其他多媒体观看者的讲解而获知错过的多媒体的内容。
需要说明的是,也可以从第一时间即开始存储多媒体的播放内容,若第一时间与第二时间的时间间隔小于第一预设阈值时,则删除从第一时间开始存储的多媒体的播放内容。
存储的第一时间到第二时间段内的多媒体的播放内容在监测到多媒体观看者从未观看多媒体行为变化为观看多媒体行为后播放给多媒体观 看者。具体地,可以提醒多媒体观看者是否需要观看第一时间到第二时间的时间段内存储的多媒体的播放内容。当多媒体观看者选择需要观看第一时间到第二时间的时间段内存储的多媒体的播放内容时,播放存储的多媒体内容。播放时,可以采用独立窗口的形式,以不影响正常的多媒体的观看。
本实施例提供的多媒体处理方法可以运用在在线观看直播多媒体的场景中,当在第一时间监测到多媒体观看者从观看多媒体行为变化为未观看多媒体行为,在第二时间监测到多媒体观看者从未观看多媒体行为变化为观看多媒体行为,第一时间与第二时间的时间间隔大于第一预设阈值,则存储第一时间到第二时间段内多媒体的播放内容,从而,解决了在观看在线直播多媒体的过程中,因为多媒体观看者离开一段时间而错过多媒体的问题,提高了人机交互的智能化。
进一步地,在实施例二的基础上,当多媒体观看者为一人时,在S202之前,还包括:确定第二时间监测到多媒体观看者与第一时间监测到的多媒体观看者为同一观看者。当多媒体观看者为至少两人时,在S202之前,还包括:确定第二时间监测到的从未观看多媒体行为变化为观看多媒体行为的多媒体观看者与第一时间监测到的从观看多媒体行为变化为未观看多媒体行为的多媒体观看者为同一批观看者。具体地,可以通过对比第二时间监测到的多媒体观看者和第一时间监测到的多媒体观看者的五官的吻合度来确定是否同一或同一批观看者。
根据多媒体观看者的行为变化对多媒体的特定内容进行标记可以应用在在线观看重播多媒体或观看本地多媒体的场景中,多媒体观看者在观看多媒体内容的过程中,会因为多媒体的内容而做出不同的行为,例如,如果多媒体内容比较精彩,多媒体观看者面部表情会比较丰富,根据多媒体观看者的行为,对相应多媒体的内容进行标记,在后来的多媒体观看者在观看多媒体的过程中,可以根据标记的结果选择多媒体进行观看,解决了多媒体观看者无法直接定位精彩片段的问题。如图3所示,图3为本发明多媒体处理方法实施例三的流程示意图。本实施例的多媒体处理方法包括:
S301:若在第三时间监测到多媒体观看者从表情平静变化到表情不平 静。
具体地,监测多媒体观看者从表情平静变化到表情不平静可以通过监测多媒体观看者的面部充血情况是否超过第三预设阈值,或者通过监测多媒体观看者的眼动轨迹的变化程度是否超过第四预设阈值来实现。当多媒体观看者的面部充血情况从小于等于第三预设阈值变化为大于第三预设阈值,或者监测到多媒体观看者的眼动轨迹的变化程度从小于等于第四预设阈值变化为大于第四预设阈值,则确定多媒体观看者从表情平静变化到表情不平静。
S302:在第四时间监测到多媒体观看者从表情不平静变化到表情平静,则标记第三时间到第四时间段内多媒体的播放内容为候选精彩内容。
具体地,当监测到多媒体观看者的面部充血情况从大于第三预设阈值变化为小于等于第三预设阈值,或者监测到多媒体观看者的眼动轨迹的变化程度从大于第四预设阈值变化为小于等于第四预设阈值,则确定多媒体观看者从表情不平静变化到表情平静。标记第三时间到第四时间段内多媒体的播放内容为候选精彩内容:当多媒体观看者为一人时,对第三时间到第四时间段内多媒体的播放内容标记一次;当多媒体观看者为至少两人时,当监测到有几个多媒体观看者在第三时间从表情平静变化到表情不平静,在第四时间从表情不平静变化到表情平静,则对第三时间到第四时间段内多媒体的播放内容标记几次。
S303:若第三时间到第四时间段内多媒体的播放内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记候选精彩内容为精彩内容。
具体地,第二预设阈值可以根据多媒体被播放的总次数来确定,例如,第二预设阈值可以是多媒体被播放总次数的二分之一,也就是说,若在第三时间到第四时间段内多媒体的播放内容被标记为候选精彩内容的次数大于多媒体被播放的总次数的二分之一时,则标记候选精彩内容为精彩内容。
可以根据精彩内容在多媒体的播放进度中的时间区间,生成多媒体的播放热度曲线,或者在精彩内容在多媒体的播放进度中的时间区间,生成时间区间内的多媒体的缩略图来标记候选精彩内容为精彩内容。
具体地,多媒体的播放热度曲线可以通过以多媒体的播放进度为横坐 标、以多媒体被标记为候选精彩内容的次数为纵坐标生成多媒体的热度曲线。也可以通过以多媒体的播放进度为横坐标、以被标记为精彩内容为纵坐标,根据精彩内容在多媒体的播放进度中的时间区间,生成多媒体的播放热度曲线。
或者,在精彩内容在多媒体的播放进度中的时间区间,生成时间区间内的多媒体的缩略图,即生成精彩内容的缩略图。
本实施例提供的多媒体处理方法具体可以运用在在线观看重播多媒体或者在观看本地多媒体的场景中。当运用在在线观看重播多媒体的场景中时,多媒体播放器根据监测到的多媒体观看者的行为标记第三时间到第四时间段内多媒体的播放内容为候选精彩内容,将标记的数据发送给多媒体网站服务器,网站服务器根据多媒体播放器发送的数据与第二预设阈值通过生成多媒体的播放热度曲线或者通过生成时间区间内的多媒体的缩略图的方式标记候选精彩内容为精彩内容。当多媒体被播放的次数越多,基于统计的规律,对精彩内容的标记就越精确。
本实施例提供的多媒体处理方法通过当在第三时间监测到多媒体观看者从表情平静变化为表情不平静,在第四时间监测到多媒体观看者从表情不平静变化为表情平静,则标记第三时间到第四时间内多媒体的播放内容为候选精彩内容,若第三时间到第四时间段内的多媒体的播放内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记候选精彩内容为精彩内容,从而,使多媒体观看者可以快速定位多媒体的精彩内容,提高了人机交互的智能化。
图4为本发明多媒体处理装置实施例一的结构示意图。如图4所示,本实施例提供的多媒体处理装置包括:监测模块41,用于监测多媒体观看者的行为变化,识别模块42,用于根据多媒体观看者的行为变化识别多媒体的特定内容,处理模块43,用于对多媒体的特定内容进行处理。
具体地,本实施例提供的装置对应地可用于执行图1所示方法实施例的技术方案,其实现原理类似,此处不再赘述。
本实施例提供的多媒体处理装置,通过监测模块监测多媒体观看者的行为变化,识别模块确定多媒体的特定内容,处理模块对多媒体的特定内容进行处理,能根据多媒体观看者的行为变化对多媒体的特定内容进行相 应的处理,从而,提供多种观看方式,提高了人机交互的智能化。
图5为本发明多媒体处理装置实施例二的结构示意图。如图5所示,在实施例一的基础上,处理模块43包括:存储单元501,用于根据所述多媒体观看者的行为变化对多媒体的特定内容进行存储。
具体地,存储单元501具体用于若监测模块41在第一时间监测到多媒体观看者从观看多媒体行为变化为未观看多媒体行为;在第二时间监测到多媒体观看者从未观看多媒体行为变化为观看多媒体行为;若第一时间与第二时间的时间间隔大于第一预设阈值,则存储第一时间到第二时间段内多媒体的播放内容。
当所述多媒体观看者为一人时:监测模块41具体用于若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
或者,监测模块41具体用于若监测到所述多媒体观看者的视线从位于屏幕的显示区域内变化到位于屏幕的显示区域外,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
监测模块41具体用于若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者的面部图像与多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者的视线从位于屏幕的显示区域外变化到位于屏幕的显示区域内,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
监测模块41还用于确定第二时间监测到多媒体观看者与第一时间监 测到多媒体观看者为同一观看者。
当多媒体观看者为至少两人时,监测模块41具体用于若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者中的视线从位于屏幕的显示区域内变化到位于屏幕的显示区域外的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为。
监测模块41具体用于若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者中的面部图像与多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
或者,监测模块41具体用于若监测到多媒体观看者中的视线从位于屏幕的显示区域外变化到位于屏幕的显示区域内的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
具体地,本实施例提供的装置对应地可用于执行图2所示方法实施例的技术方案,其实现原理类似,此处不再赘述。
本实施例提供的多媒体处理装置,可以运用在在线观看直播多媒体的场景中,通过存储单元具体用于若监测模块在第一时间监测到多媒体观看者从观看多媒体行为变化为未观看多媒体行为,在第二时间监测到多媒体观看者从未观看多媒体行为变化为观看多媒体行为,若第一时间与第二时间的时间间隔大于第一预设阈值,则存储第一时间到第二时间段内多媒体 的播放内容,从而,解决了在观看在线直播多媒体的过程中,因为多媒体观看者离开一段时间而错过多媒体的问题,提高了人机交互的智能化。
图6为本发明多媒体处理装置实施例三的结构示意图。如图6所示,在实施例一的基础上,处理模块43包括:标记单元601,用于根据多媒体观看者的行为变化对多媒体的特定内容进行标记。
具体地,标记单元601具体用于若监测模块41在第三时间监测到多媒体观看者从表情平静变化到表情不平静;在第四时间监测到所述多媒体观看者从表情不平静变化到表情平静,则标记第三时间到第四时间段内多媒体的播放内容为候选精彩内容;若第三时间到第四时间段内多媒体的播放内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记候选精彩内容为精彩内容。
标记单元601具体用于根据精彩内容在多媒体的播放进度中的时间区间,生成多媒体的播放热度曲线。
或者,标记单元具体用于在精彩内容在多媒体的播放进度中的时间区间,生成时间区间内的多媒体的缩略图。
本实施例提供的装置对应地可用于执行图3所示方法实施例的技术方案,其实现原理类似,此处不再赘述。
本实施例提供的多媒体处理装置,通过标记单元具体用于若监测模块在第三时间监测到多媒体观看者从表情平静变化为表情不平静,在第四时间监测到多媒体观看者从表情不平静变化为表情平静,则标记第三时间到第四时间内多媒体的播放内容为候选精彩内容,若第三时间到第四时间段内的多媒体的播放内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记候选精彩内容为精彩内容,从而,使多媒体观看者可以快速定位多媒体的精彩内容,提高了人机交互的智能化。
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。

Claims (16)

  1. 一种多媒体处理方法,其特征在于,包括:
    监测多媒体观看者的行为变化;
    根据所述多媒体观看者的行为变化识别多媒体的特定内容;
    对所述多媒体的特定内容进行处理。
  2. 根据权利要求1所述的方法,其特征在于,所述对所述多媒体的特定内容进行处理,包括:
    根据所述多媒体观看者的行为变化对多媒体的所述特定内容进行存储;或者,
    根据所述多媒体观看者的行为变化对多媒体的所述特定内容进行标记。
  3. 根据权利要求2所述的方法,其特征在于,所述根据所述多媒体观看者的行为变化对多媒体的所述特定内容进行存储,包括:
    若在第一时间监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为;在第二时间监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为;若所述第一时间与第二时间的时间间隔大于第一预设阈值,则存储所述第一时间到所述第二时间段内的多媒体内容。
  4. 根据权利要求3所述的方法,其特征在于,当所述多媒体观看者为一人时:
    所述监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为,包括:
    若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域内变 化到位于所述屏幕的显示区域外,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    所述监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为,包括:
    若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
  5. 根据权利要求4所述的方法,其特征在于,所述在第二时间监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为之前,还包括:
    确定第二时间监测到所述多媒体观看者与所述第一时间监测到所述多媒体观看者为同一观看者。
  6. 根据权利要求3所述的方法,其特征在于,当所述多媒体观看者为至少两人时,
    所述监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为,包括:
    若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离的人数大于 所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    所述监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为,包括:
    若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
  7. 根据权利要求2所述的方法,其特征在于,所述根据所述多媒体观看者的行为变化对多媒体的所述特定内容进行标记,包括:
    若在第三时间监测到所述多媒体观看者从表情平静变化到表情不平静;在第四时间监测到所述多媒体观看者从表情不平静变化到表情平静,则标记所述第三时间到所述第四时间段内的多媒体内容为候选精彩内容;
    若所述第三时间到所述第四时间段内的多媒体内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记所述候选精彩内容为精彩内容。
  8. 根据权利要求7所述的方法,其特征在于,所述标记所述候选精彩内容为精彩内容,包括:
    根据所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述多媒体的播放热度曲线;或者,
    在所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述时间区间内的多媒体的缩略图。
  9. 一种多媒体处理装置,其特征在于,包括:
    监测模块,用于监测多媒体观看者的行为变化;
    识别模块,用于根据所述多媒体观看者的行为变化识别多媒体的特定内容;
    处理模块,用于对所述多媒体的特定内容进行处理。
  10. 根据权利要求9所述的装置,其特征在于,所述处理模块包括:
    存储单元,用于根据所述多媒体观看者的行为变化对多媒体的特定内容进行存储;或者,
    标记单元,用于根据所述多媒体观看者的行为变化对多媒体的特定内容进行标记。
  11. 根据权利要求10所述的装置,其特征在于,所述存储单元具体用于若所述监测模块在第一时间监测到所述多媒体观看者从观看多媒体行为变化为未观看多媒体行为;在第二时间监测到所述多媒体观看者从未观看多媒体行为变化为观看多媒体行为;若所述第一时间与第二时间的时间间隔大于第一预设阈值,则存储所述第一时间到所述第二时间段内的多媒体内容。
  12. 根据权利要求11所述的装置,其特征在于,当所述多媒体观看者为一人时:
    所述监测模块具体用于若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    所述监测模块具体用于若监测到所述多媒体观看者的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
  13. 根据权利要求12所述的装置,其特征在于,所述监测模块还用于确定第二时间监测到所述多媒体观看者与所述第一时间监测到所述多媒体观看者为同一观看者。
  14. 根据权利要求11所述的装置,其特征在于,当所述多媒体观看者为至少两人时,
    所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从小于等于第一预设角度变化到大于第一预设角度的人数大于预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从小于等于第一预设距离变化到大于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域内变化到位于所述屏幕的显示区域外的人数大于所述预设人数,则确定多媒体观看者从观看多媒体行为变化为未观看多媒体行为;
    所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与多媒体播放器的屏幕的夹角从大于第一预设角度变化到小于等于第一预设角度的人数大于预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者中的面部图像与所述多媒体播放器的屏幕的距离从大于第一预设距离变化到小于等于第一预设距离的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为;
    或者,
    所述监测模块具体用于若监测到所述多媒体观看者中的视线从位于所述屏幕的显示区域外变化到位于所述屏幕的显示区域内的人数大于所述预设人数,则确定多媒体观看者从未观看多媒体行为变化为观看多媒体行为。
  15. 根据权利要求10所述的装置,其特征在于,所述标记单元具体用于若所述监测模块第三时间监测到所述多媒体观看者从表情平静变化到表情不平静;在第四时间监测到所述多媒体观看者从表情不平静变化到表情平静,则标记所述第三时间到所述第四时间段内的多媒体内容为候选精彩内容;若所述第三时间到所述第四时间段内的多媒体内容被标记为候选精彩内容的次数大于等于第二预设阈值,则标记所述候选精彩内容为精彩内容。
  16. 根据权利要求15所述的装置,其特征在于,所述标记单元具体用于根据所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述多媒体的播放热度曲线;或者,
    所述标记单元具体用于在所述精彩内容在所述多媒体的播放进度中的时间区间,生成所述时间区间内的多媒体的缩略图。
PCT/CN2015/080518 2015-06-01 2015-06-01 多媒体处理方法及装置 WO2016192013A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
EP15893672.4A EP3306463A4 (en) 2015-06-01 2015-06-01 Method and device for processing multimedia
PCT/CN2015/080518 WO2016192013A1 (zh) 2015-06-01 2015-06-01 多媒体处理方法及装置
CN201580080439.2A CN107615236A (zh) 2015-06-01 2015-06-01 多媒体处理方法及装置
US15/578,566 US20180160174A1 (en) 2015-06-01 2015-06-01 Method and device for processing multimedia

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/080518 WO2016192013A1 (zh) 2015-06-01 2015-06-01 多媒体处理方法及装置

Publications (1)

Publication Number Publication Date
WO2016192013A1 true WO2016192013A1 (zh) 2016-12-08

Family

ID=57439708

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/080518 WO2016192013A1 (zh) 2015-06-01 2015-06-01 多媒体处理方法及装置

Country Status (4)

Country Link
US (1) US20180160174A1 (zh)
EP (1) EP3306463A4 (zh)
CN (1) CN107615236A (zh)
WO (1) WO2016192013A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114999534A (zh) * 2022-06-10 2022-09-02 中国第一汽车股份有限公司 一种车载音乐的播放控制方法、装置、设备和存储介质

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118072765B (zh) * 2024-04-24 2024-09-06 合众新能源汽车股份有限公司 一种人机交互判定方法及装置

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101604491A (zh) * 2009-07-06 2009-12-16 北京派瑞根科技开发有限公司 随环境和表情变化的电子画
CN102473264A (zh) * 2009-06-30 2012-05-23 伊斯曼柯达公司 根据观看者因素和反应进行图像显示控制的方法和装置
CN102611909A (zh) * 2011-02-08 2012-07-25 微软公司 具有运动视差的三维显示
CN103118297A (zh) * 2013-01-22 2013-05-22 广东星海数字家庭产业技术研究院有限公司 一种基于动作识别的防疲累数字电视系统
CN103649904A (zh) * 2011-05-10 2014-03-19 Nds有限公司 自适应内容呈现

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6585521B1 (en) * 2001-12-21 2003-07-01 Hewlett-Packard Development Company, L.P. Video indexing based on viewers' behavior and emotion feedback
US7233684B2 (en) * 2002-11-25 2007-06-19 Eastman Kodak Company Imaging method and system using affective information
US20070154163A1 (en) * 2005-12-29 2007-07-05 United Video Properties, Inc. Systems and methods for creating aggregations of episodes of series programming in order
GB2459707B (en) * 2008-05-01 2010-08-11 Sony Computer Entertainment Inc Media recorder, audio visual entertainment system and method
JP2010016482A (ja) * 2008-07-01 2010-01-21 Sony Corp 情報処理装置および情報処理方法
JP2013017105A (ja) * 2011-07-06 2013-01-24 Hitachi Consumer Electronics Co Ltd コンテンツ表示装置、コンテンツ出力装置、および、コンテンツ表示方法
JP2014016965A (ja) * 2012-07-11 2014-01-30 Toshiba Corp 画像処理装置、画像処理方法およびプログラム、ならびに、撮像装置
US20140153900A1 (en) * 2012-12-05 2014-06-05 Samsung Electronics Co., Ltd. Video processing apparatus and method
CN103826160A (zh) * 2014-01-09 2014-05-28 广州三星通信技术研究有限公司 获取视频的信息的方法及设备及播放视频的方法及设备
CN103916711A (zh) * 2014-03-31 2014-07-09 小米科技有限责任公司 一种播放视频信号的方法及装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102473264A (zh) * 2009-06-30 2012-05-23 伊斯曼柯达公司 根据观看者因素和反应进行图像显示控制的方法和装置
CN101604491A (zh) * 2009-07-06 2009-12-16 北京派瑞根科技开发有限公司 随环境和表情变化的电子画
CN102611909A (zh) * 2011-02-08 2012-07-25 微软公司 具有运动视差的三维显示
CN103649904A (zh) * 2011-05-10 2014-03-19 Nds有限公司 自适应内容呈现
CN103118297A (zh) * 2013-01-22 2013-05-22 广东星海数字家庭产业技术研究院有限公司 一种基于动作识别的防疲累数字电视系统

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3306463A4 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114999534A (zh) * 2022-06-10 2022-09-02 中国第一汽车股份有限公司 一种车载音乐的播放控制方法、装置、设备和存储介质

Also Published As

Publication number Publication date
EP3306463A1 (en) 2018-04-11
US20180160174A1 (en) 2018-06-07
CN107615236A (zh) 2018-01-19
EP3306463A4 (en) 2018-06-06

Similar Documents

Publication Publication Date Title
US11770588B2 (en) Systems and methods for dynamically syncing from time-shifted frame to live stream of content
US11006065B2 (en) Systems and methods for resizing content based on a relative importance of the content
WO2017166509A1 (zh) 视频回看控制方法及装置
KR102025334B1 (ko) 검출된 물리적 표시를 통한 사용자 관심 결정
US11438642B2 (en) Systems and methods for displaying multiple media assets for a plurality of users
US9852774B2 (en) Methods and systems for performing playback operations based on the length of time a user is outside a viewing area
US20170332125A1 (en) Systems and methods for notifying different users about missed content by tailoring catch-up segments to each different user
US20140255004A1 (en) Automatically determining and tagging intent of skipped streaming and media content for collaborative reuse
WO2016095384A1 (zh) 一种弹幕显示方法及系统
US20140210702A1 (en) Systems and methods for presenting messages based on user engagement with a user device
US9137560B2 (en) Methods and systems for providing access to content during a presentation of a media content instance
US20140028917A1 (en) Displaying multimedia
JP2018530277A (ja) 注目検出に基づくメディア・コンテンツ制御のための方法、システムおよび装置
US11490153B2 (en) Systems and methods for dynamically syncing from time-shifted frame to live stream of content
WO2016192013A1 (zh) 多媒体处理方法及装置
US11490167B2 (en) Systems and methods for dynamically syncing from time-shifted frame to live stream of content
US11356725B2 (en) Systems and methods for dynamically adjusting quality levels for transmitting content based on context
US12075122B2 (en) Gesture-based parental control system
CA3204498A1 (en) Systems and methods for dynamically syncing from time-shifted frame to live stream of content
CN116074548A (zh) 一种直播内容展示方法、装置、计算机设备及存储介质

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15893672

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 15578566

Country of ref document: US

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2015893672

Country of ref document: EP