WO2022077977A1 - Video conversion method and video conversion apparatus - Google Patents

Video conversion method and video conversion apparatus Download PDF

Info

Publication number
WO2022077977A1
WO2022077977A1 PCT/CN2021/106338 CN2021106338W WO2022077977A1 WO 2022077977 A1 WO2022077977 A1 WO 2022077977A1 CN 2021106338 W CN2021106338 W CN 2021106338W WO 2022077977 A1 WO2022077977 A1 WO 2022077977A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
information
cut
frame
user interface
Prior art date
Application number
PCT/CN2021/106338
Other languages
French (fr)
Chinese (zh)
Inventor
宋玉岩
徐宁
Original Assignee
北京达佳互联信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京达佳互联信息技术有限公司 filed Critical 北京达佳互联信息技术有限公司
Publication of WO2022077977A1 publication Critical patent/WO2022077977A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234363Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution
    • H04N21/234372Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by altering the spatial resolution, e.g. for clients with a lower screen resolution for performing aspect ratio conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/426Internal components of the client ; Characteristics thereof
    • H04N21/42653Internal components of the client ; Characteristics thereof for processing graphics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440263Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA
    • H04N21/440272Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA for performing aspect ratio conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/485End-user interface for client configuration
    • H04N21/4854End-user interface for client configuration for modifying image parameters, e.g. image brightness, contrast

Definitions

  • the present disclosure relates to the technical field of video processing, and in particular, to a video conversion method, device, system, and storage medium.
  • Video or similar media recorded in a wide aspect ratio may be designed to be viewed on a desktop or in landscape orientation. Therefore, when a user uses a mobile terminal to watch a horizontal screen video, in order to obtain a good visual experience, the terminal screen is generally converted to a horizontal screen position to play the video.
  • the present disclosure provides a video conversion method, device, system and storage medium.
  • a video conversion method may include: acquiring a first video in a first orientation and converting the first video into a second video in a second orientation cutting information; generating and displaying a user interface for adjusting the cutting information based on the cutting information; receiving user input for adjusting the cutting information via the user interface; and generating a second video according to the adjusted cutting information.
  • a video conversion apparatus may include: an interface module configured to receive a first video in a first orientation; an analysis module configured to obtain a video for converting The first video is converted into cut information of the second video in the second orientation, and based on the cut information, a user interface for adjusting the cut information is generated and displayed; the display module is configured to display the user interface, wherein, using a user input for adjusting the cut information is received via the user interface; and an editing module configured to generate a second video according to the adjusted cut information.
  • a video conversion device may include: a display; a transceiver for receiving a first video in a first orientation; and a processor for: acquiring for converting the first video into the cut information of the second video in the second orientation, generating and displaying a user interface for adjusting the cut information based on the cut information, controlling the display to display the user interface, and controlling the transceiver via the user interface User input for adjusting the cut information is received, and a second video is generated according to the adjusted cut information.
  • an electronic device may include: a processor; a memory storing instructions for execution by the processor, wherein execution of the instructions causes the processor Perform the video conversion method as described above.
  • a non-volatile computer-readable storage medium having stored thereon instructions for execution by a processor, wherein execution of the instructions causes the processor to execute the above-described Video conversion method.
  • FIG. 1 is a diagram of an application environment for converting video from one orientation to another, provided according to an embodiment of the present disclosure
  • FIG. 2 is a flowchart of a video conversion method according to an embodiment of the present disclosure
  • FIG. 3 is a diagram of a user interface for adjusting a clipping window according to an embodiment of the present disclosure
  • FIG. 4 is a schematic flowchart of obtaining clipping window information of a single frame according to an embodiment of the present disclosure
  • FIG. 5 is a schematic diagram of a marked area according to an embodiment of the present disclosure.
  • FIG. 6 is a schematic diagram of a user interface for adjusting information weights according to an embodiment of the present disclosure
  • FIG. 7 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure.
  • FIG. 8 is a flowchart of a video conversion method according to another embodiment of the present disclosure.
  • FIG. 9 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure.
  • FIG. 10 is a block diagram of an electronic device according to an embodiment of the present disclosure.
  • the video cutting of the related art is fully automatic, and the automatically cut video may not achieve the user's expected cutting effect, but the user cannot make further cutting adjustments to the final cutting result.
  • automatic video cutting the user cannot adjust the importance of each information flow in the video scene. As a result, the cut video scene may not meet user expectations.
  • the present disclosure can provide users with the functions of parameter adjustment before video cutting processing and adjustment of the cutting area after processing, so that users can obtain video cutting results that they are satisfied with.
  • FIG. 1 is a diagram of an application environment for converting video from one orientation to another, provided according to an embodiment of the present disclosure.
  • orientation is landscape or portrait relative to the device/device.
  • the application environment 100 includes a terminal 110 and a media server system 120 .
  • the terminal 110 is a terminal where the user is located, and the terminal 110 may be at least one of a smart phone, a tablet computer, a portable computer, a desktop computer, and the like. Although this embodiment only shows one terminal 110 for description, those skilled in the art may know that the number of the above-mentioned terminals may be two or more. This embodiment of the present disclosure does not impose any limitation on the number of terminals and device types.
  • the terminal 110 may be installed with a target application for providing the video to be cut and converted to the media server system 120 , and the target application may be a multimedia application, a social application or an information application or the like.
  • the terminal 110 may be a terminal used by a user, and the user's account is logged in an application running in the terminal 110 .
  • the terminal 110 can be connected to the media server system 120 through a wireless network or a wired network, so that data interaction can be performed between the terminal 110 and the media server system 120 .
  • a network may include a local area network (LAN), a wide area network (WAN), a telephone network, a wireless link, an intranet, the Internet, combinations thereof, and the like.
  • the media server system 120 may be a server system for cut-converting video.
  • media server system 120 may include one or more processing processors and memory.
  • the memory may include one or more programs for performing the above video conversion method.
  • the media server system 120 may also include a power supply assembly configured to perform power management of the media server system 120, a wired or wireless network interface configured to connect the media server system 120 to a network, and an input output (I/O) interface .
  • the media server system 120 may operate based on an operating system stored in memory, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, and the like.
  • Windows ServerTM Windows ServerTM
  • Mac OS XTM UnixTM
  • LinuxTM FreeBSDTM
  • the devices included in the media server system 120 described above are only exemplary, and the present disclosure is not limited thereto.
  • the media server system 120 can cut and convert the input video, and then deliver the converted video to the terminal 110 or publish it to the media platform via a wireless network or a wired network.
  • the media server system 120 may acquire cut information for converting the first video into the second video of the second orientation, generate and display a user interface for adjusting the cut information based on the cut information, and receive via the user interface User input for adjusting cut information, and then adjust the previously cut video again according to the adjusted cut information.
  • the terminal 110 may be installed with an application program implementing the video conversion method of the present disclosure, and the terminal 110 may realize the cut conversion of the video.
  • the memory of the terminal 110 may store one or more programs for performing the above video conversion method.
  • the processor of the terminal 110 may implement the cut and convert of the video by running related programs/algorithms.
  • the terminal 110 may then upload the cut and converted video to the media server system 120 via a wireless network or a wired network, or may store the converted video in the memory of the terminal 110 .
  • the terminal 110 may transmit the horizontal video obtained locally or externally to the media server system 120 via a wireless or wired network, and the media server system 120 may cut and convert the horizontal video into a vertical video according to the video conversion method of the present disclosure, and then The converted vertical video is delivered to the terminal 110 via a wireless or wired network.
  • the terminal 110 may convert a locally or externally acquired horizontal video into a vertical video screen according to the video conversion method of the present disclosure, and then upload the vertical video to the media server system 120 via a wireless or wired network.
  • the media server system 120 may distribute the vertical video to other electronic devices.
  • the method of the present disclosure can similarly be used to cut a portrait video into a landscape video.
  • FIG. 2 is a flowchart of a video conversion method according to an embodiment of the present disclosure.
  • the video conversion method of the embodiment of the present disclosure may be executed by the media server system 120 or an electronic device having a video cut conversion function.
  • a first video in a first orientation and cut information for converting the first video into a second video in a second orientation are acquired.
  • the cut information may include a cut window for cutting the first video into the second video.
  • the first video of the first orientation may refer to a landscape video.
  • a video smart crop tool such as Google Autoflip, can be used to directly obtain crop information for converting a video in one orientation to a video in another. That is to say, the clipping information for clipping the first video can be obtained from the relevant video intelligent clipping tool.
  • the cut information can be obtained by: analyzing each frame of the first video to determine at least one kind of information of each frame, generating an annotation map of the corresponding frame based on the at least one kind of information,
  • the focus of the corresponding frame is obtained by calculating the moment of the annotation map, the focus is taken as the center of the clipping window for clipping the frame, and the clipping window is generated according to the focus and the specified aspect ratio.
  • the cut information may be obtained by: analyzing each frame of the first video to determine at least one kind of information of each frame, and generating and displaying the cutout information for each frame based on the analysis result
  • a user interface for adjusting the weight of at least one information in the case of video orientation conversion, receiving user input for adjusting the weight of the at least one information through the user interface, and generating based on the weighted at least one information
  • the focus of the corresponding frame is obtained by calculating the moment of the annotation map, the focus is taken as the center of the clipping window used to clip the frame, and the clipping window is generated according to the focus and the specified aspect ratio.
  • a user interface before acquiring the cut information, can be set so that the user can adjust the proportion of each information stream in the converted video result according to their own needs, so that the important information defined by the user is retained in the cut process. .
  • the distribution of key information in each frame is more prominent, and by fitting the trajectory of the focus of each frame, better clipping information can be provided, and the fit between frames can be increased. , to improve the user experience.
  • the acquired cut information may be result information after the video is cut.
  • the clipping information may be clipping window information calculated during information analysis for the video frame. That is to say, the acquired clipping information may be information after video clipping processing, or may be pre-analysis information before video clipping processing.
  • a user interface for adjusting the cut information is generated and displayed based on the cut information.
  • a cutting window for cutting the frame into a corresponding frame of the second video may be displayed on the frame. For example, refer to FIG. 3 .
  • a user input for adjusting the clipping information is received via the user interface.
  • the user input may be one of a touch input, a key input, a hovering input, and the like. Different types of user input can be implemented depending on the capabilities of the display device.
  • a second video is generated according to the adjusted cut information.
  • the cutting window of the first video may be adaptively adjusted according to the adjusted cutting information, and then the first video may be cut using the adaptively adjusted cutting window to obtain the second video. Through adaptive adjustment, better clipping information can be provided and the fit between frames can be increased.
  • At least one key frame of the first video may be determined, and then a user interface for adjusting cut information of each key frame in the at least one key frame is generated and displayed. After adjusting the cutting window of the key frame of the first video, the cutting window of the relevant frame of the first video can be automatically adjusted adaptively. After the adjustment of the whole video is completed, the user can export the cut video .
  • users are allowed to have a more comprehensive grasp of the entire cutting process flow before and after the video cutting process, and finally obtain a cutting result that they are satisfied with.
  • the video conversion method according to the present disclosure can better handle video scene switching, user-specified area change, or lost scenes.
  • FIG. 3 is a diagram of a user interface for adjusting a clipping window according to an embodiment of the present disclosure.
  • the user interface of FIG. 3 may be displayed on a partial area of a display such as a terminal or a server, or on the display in full screen.
  • the cutting information of each frame can be provided to the user, and the cutting information can be reflected in the user interface for adjusting the cutting window later.
  • the user can adjust the clipping window for a certain frame through the user interface 301 .
  • the user can move the clipping window up, down, left and right to adjust to the area of interest.
  • the user interface 301 is displayed on the touch screen, the user can touch the cutout window to move accordingly. Or you can drag the clipping window to the area of interest by mouse, keyboard, etc.
  • the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
  • the user can selectively adjust the clipping window for some frames. For example, in the user interface 301, the user can select a frame of interest to the user by dragging the slide bar of the video, and then adjust the clipping window of the frame.
  • a “next frame” button (not shown) may be set on the user interface 301, and after the user adjusts the clipping window of the current frame, the adjustment interface of the next frame can be switched by clicking the “next frame” button .
  • buttons with different functions can be set on the user interface according to actual requirements.
  • the cutout window of each key frame may be displayed on the user interface 301, so that the user can adjust the cutout window of the keyframe of the video. After adjusting the cutout window of the keyframe, the user can Click the "Export” button on the user interface to export the adjusted video.
  • the user interface according to the embodiment of the present disclosure is simple, easy for the user to operate, and improves the efficiency of the user to adjust the cut information.
  • FIG. 4 is a schematic flowchart of acquiring clipping window information of a single frame according to an embodiment of the present disclosure.
  • the method for acquiring the cut window information of a single frame according to the embodiment of the present disclosure may be executed by the media server system 120 or an electronic device having a video cut conversion function.
  • the image 401 is analyzed to determine M kinds of information of the image 401 , where M is a positive integer.
  • the analysis of each type of information may be implemented by using a corresponding analysis method, that is, the image 401 may be analyzed using M types of analysis methods to determine M types of information.
  • the face information of the image 401 can be analyzed using a face analysis method.
  • M corresponding marked regions can be generated, that is, for each type of information analyzed, an information distribution map corresponding to the image 401 is generated. For example, when analyzing face information, a pixel-based labeling area of the face information of the image 401 is generated, and then the pixel-based labeling area is converted into an information distribution labeling area.
  • the overall marked area of the image 401 is calculated according to the weighted M marked areas.
  • the overall labeled regions of the image 401 can be obtained by summing the weighted M labeled regions.
  • the labeling map of the image 401 may be generated based on the overall labeling area. Since the weighting of each annotated area was performed before, the annotation map can show the importance of each annotated area.
  • the focus of the image 401 is obtained by computing the moments of the annotation map.
  • the focus of image 401 can be obtained by calculating the geometric center point of the annotation map. Generates a clipping window using the position of the focus and the specified aspect ratio.
  • the clipping window information for converting a video in one orientation to a video in another orientation can also be obtained from a video intelligent cropping tool (such as Google Autoflip).
  • a video intelligent cropping tool such as Google Autoflip
  • the clipping information such as the center position, size, aspect ratio, etc., of the clipping window of each frame can be obtained in a similar manner as described above.
  • FIG. 5 is a schematic diagram of a marked area according to an embodiment of the present disclosure.
  • (a) of FIG. 5 is a certain frame of the first video
  • (b) of FIG. 5 shows the marked area of important information (such as motion information) in the frame
  • the white area in (b) is Label area.
  • important information such as motion information
  • FIG. 6 is a schematic diagram of a user interface for adjusting information weights according to an embodiment of the present disclosure. After analyzing various information of a frame, a user interface associated with the various information may be displayed accordingly.
  • a slider bar may be configured for each type of information (such as the first information, the second information, etc.), and the slider bar may be used to adjust the weight of the corresponding information.
  • the range of the slider can be set to [0, 1].
  • click the "OK” button to complete the setting of the weight of each information flow in a frame.
  • the weight information input by the user may be transmitted to the processor of the electronic device for subsequent cut conversion.
  • the corresponding cutout window may be presented on the corresponding frame, so as to show the user the cutout position of the cutout window on the frame.
  • the user interface of FIG. 6 is merely exemplary, and elements in the user interface may be presented in other forms.
  • a text input box may be configured for each type of information, and the user may assign weights to the corresponding information through the text input box.
  • the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
  • the user interface can be displayed on a partial area of the display of the electronic device (such as the terminal 110 or the media server system 120 ), or displayed on the display in a full screen, and those skilled in the art can make display settings according to actual needs.
  • the user before the video cutting process, the user is allowed to adjust the information flow weight of each frame, so that the important information defined by the user is preserved in the cutting process.
  • the video conversion device 700 may be implemented as a terminal 110 or as a media server system 120, or any other device.
  • a video conversion apparatus 700 may include a transceiver 701 , a display 702 and a processor 703 .
  • the transceiver 701 can receive the first video in the first orientation.
  • the processor 703 may use a video smart cropping tool (such as Google Autoflip) to obtain the cropping window information for converting a video in one orientation to a video in another orientation.
  • a video smart cropping tool such as Google Autoflip
  • the processor may use the algorithm for obtaining cut information of embodiments of the present disclosure (eg, the method shown in FIG. 4 ) to obtain the data for converting the first video into the second video in the second orientation Cut information.
  • the processor 703 may generate and display a user interface for adjusting the clipping information based on the clipping information, and control the display 702 to display the user interface. For example, the user interface shown in FIG. 3 may be displayed.
  • the user interface may include graphics, text, icons, video, and any combination thereof associated with the analysis information.
  • the display 702 is a touch display screen
  • the display 702 also has the ability to acquire touch signals on or over the surface of the display 702 .
  • the touch signal may be input to the processor 701 as a control signal for processing.
  • the display 702 may also be used to provide virtual buttons and/or virtual keyboards, also referred to as soft buttons and/or soft keyboards.
  • the number of displays 702 may be one, which is arranged on the front panel of the video conversion device 700; in other embodiments, the number of displays 702 may be at least two, which are respectively arranged on different surfaces of the video conversion device 700 or folded.
  • display 702 may be a flexible display screen disposed on a curved or folded surface of video conversion device 700 .
  • the display 702 can be prepared by using materials such as LCD (Liquid Crystal Display, liquid crystal display), OLED (Organic Light-Emitting Diode, organic light emitting diode).
  • LCD Liquid Crystal Display, liquid crystal display
  • OLED Organic Light-Emitting Diode, organic light emitting diode
  • the processor 703 can control the transceiver 701 to receive a user input for adjusting the clipping information via the user interface. After the clipping window is adjusted, the processor 703 can automatically adjust the clipping window for the relevant frame to ensure that the frames are consistent with each other. compatibility between.
  • the processor 703 may adaptively adjust the cutting window of the first video according to the adjusted cutting information, and then use the adaptively adjusted cutting window to cut the first video to obtain the second video . After the final second video is obtained, the second video can be output to other devices via the transceiver 701 .
  • the user can make further adjustments to the final cut result.
  • the function of adjusting the trimming area after the video trimming process can be provided to the user, but also the parameter adjustment before the video trimming process can be provided to the user, so that the user can obtain the trimming result they are satisfied with.
  • the processor 703 may analyze each frame of the first video to determine at least one kind of information for each frame, and based on the analysis result, generate a method for adjusting the at least one kind of information for each frame in the case of video orientation conversion.
  • Weight UI For example, the user interface shown in FIG. 6 may be displayed.
  • the processor 703 may control the transceiver 701 to receive, through the user interface of FIG. 6, a user input for adjusting the weight of at least one kind of information of each frame, and to generate a clipping for each frame based on the at least one kind of information whose weight is adjusted. Cut window information. After generating the clipping window information of each frame, the processor 703 may generate a user interface according to the clipping window information to visually display to the user how each frame is clipped.
  • the processor 703 may generate, based on the analysis of the at least one type of information, each marked area of the corresponding frame corresponding to the at least one type of information, where the marked area is an area representing the distribution of information, wherein the corresponding frame Each annotated region of is given a weight entered by the user.
  • the processor 703 may calculate the overall labeling area of the corresponding frame according to each labeling area whose weights are adjusted, and calculate the focus of the corresponding frame based on the overall labeling area, Generates a clipping window for the corresponding frame based on the focus and the specified aspect ratio.
  • the size of the cutting window may be preset, or the size of the cutting window may be adaptively adjusted.
  • the processor 703 may obtain the fitted focus of the corresponding frame by fitting the focus of each frame, and then generate the corresponding frame based on the fitted focus and the specified aspect ratio. frame clipping window.
  • the processor may generate an annotation map for the corresponding frame based on the overall annotation area, and obtain the focus of the corresponding frame by calculating a moment of the annotation map.
  • the video conversion apparatus 700 may include a memory that may store the original input video and the converted video. Additionally, the memory may include one or more computer-readable storage media, which may be non-transitory. Memory may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash storage devices. In some embodiments, a non-transitory computer-readable storage medium in memory is used to store at least one instruction for execution by processor 703 .
  • the video conversion device 700 further includes: a peripheral device interface and at least one peripheral device.
  • the processor 703 and the peripheral device interface can be connected through a bus or a signal line.
  • Each peripheral device can be connected to the peripheral device interface through bus, signal line or circuit board.
  • the peripheral devices may include at least one of radio frequency circuits, touch screen displays, cameras, audio circuits, positioning components, power supplies, and the like.
  • the video conversion device 700 may also include one or more sensors.
  • the one or more sensors include, but are not limited to, acceleration sensors, gyroscope sensors, pressure sensors, fingerprint sensors, optical sensors, and proximity sensors.
  • the processor 703 may receive an indication of an orientation change from one or more sensors, thereby recommending a video of the corresponding orientation to the user.
  • FIG. 8 is a flowchart of a video conversion method according to another embodiment of the present disclosure.
  • a first video of a first orientation is acquired.
  • the first video in the first orientation may be a landscape video.
  • step S802 at least one kind of information of each frame of the first video is analyzed.
  • At least one type of information of each frame may include key area information, for example, may include at least one of face information, human body information, main object information, motion scene information, and video boundary information.
  • the face information may include face recognition information and face tracking information, etc.
  • the main object information may include object identification information and object tracking information.
  • the above examples are merely exemplary, and the present disclosure may analyze any amount and kind of information in a frame.
  • Analysis algorithms for main information, key information or information of interest to the user may be stored in advance to analyze the information contained in the frame.
  • a face recognition algorithm can be used to analyze the face information in a frame
  • an optical flow algorithm can be used to analyze the motion scene information in a frame.
  • the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
  • each annotated area corresponding to the at least one type of information of the corresponding frame is generated based on the analysis of the at least one type of information.
  • the labeled area may refer to an area representing the distribution of information.
  • the frame may include a variety of information, each time one kind of information in the frame is analyzed, an information distribution map corresponding to the frame can be generated.
  • an information distribution map corresponding to the frame can be generated.
  • multiple annotations can be generated area.
  • a pixel-based annotation area (mask) corresponding to the face information of this frame can be generated, and then the pixel-based annotation area can be converted into an annotation of information distribution area.
  • a user interface for adjusting the weight occupied by each marked region in the video cutting is generated and displayed.
  • the user interface may include a slider bar or text input box for adjusting the weight for each of the at least one information.
  • a user interface for that frame can be generated, and the user interface can include a user interface for adjusting the weight of the information contained in the frame.
  • the user interface may include slider bars or text entry boxes for adjusting each type of information.
  • the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
  • a user input for adjusting the weight of each marked region is received through the user interface.
  • Weights input by the user may be assigned to each annotated region of the corresponding frame.
  • Users can set the weight of the information they want to keep through the user interface according to their needs. For example, if the user wants to focus on protecting the face part from being cut off, the user can increase the weighting ratio of the marked area of the face information, and reduce the weighted ratio of the marked area of other information.
  • the user can interactively adjust the weighting parameters. By weighting each labeled area, the information/area that the user pays more attention to can be highlighted.
  • each kind of information corresponds to one kind of information labeling area, and weighting each kind of information can be interpreted as the weighting of the information labeling area.
  • step S806 for each frame of the first video, the overall labeling area of the corresponding frame is calculated according to each labeling area whose weight is adjusted. For example, the weighted regions can be summed to obtain the overall annotated region of a frame.
  • an annotation map for the corresponding frame is generated based on the overall annotation area.
  • the annotation map may be an information distribution image for each annotation area.
  • step S808 the focus of the corresponding frame is obtained by calculating the moment of the annotation map.
  • the focus can reflect the distribution of important information in a frame.
  • the geometric center point of the annotation map can be calculated as the focal point of a frame.
  • a clipping window of the corresponding frame is generated based on the focus and the specified aspect ratio. For example, after obtaining the focus of a frame, the focus is set as the center of the clipping window, and the layout and size of the clipping window are set according to the specified aspect ratio.
  • the aspect ratio of the second video may be used as the specified aspect ratio, but the present disclosure is not limited thereto.
  • a fitted focus of the corresponding frame may be obtained by fitting the focus of each frame, and a crop of the corresponding frame may be generated based on the fitted focus and a specified aspect ratio window.
  • step S810 cut information for converting the first video into the second video in the second orientation is obtained. For example, after the clipping window information of each frame is obtained according to steps S802 to S809, the clipping window information of all frames is obtained, which is used for further adjustment of the clipping window subsequently.
  • a user interface for adjusting the cut information is generated and displayed based on the cut information.
  • a cutting window for cutting the frame into a corresponding frame of the second video may be displayed on the frame. For example, refer to FIG. 3 .
  • step S812 a user input for adjusting the cut information is received via the user interface.
  • the cutting window of the first video may be adaptively adjusted according to the adjusted cutting information. For example, after the user further adjusts the video frame, a fitting process may be performed on the further adjusted clipping window, so that the final presented video is smoother.
  • step S814 the first video is cut using the adaptively adjusted cut window to obtain a further adjusted second video.
  • the embodiments of the present disclosure can provide the user with the functions of parameter adjustment before video cutting processing and adjustment of the cutting area after processing, so that the user can have a more comprehensive grasp of the entire cutting processing flow before and after the video cutting processing, and In the end, he was satisfied with the cutting result.
  • FIG. 9 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure.
  • the video conversion apparatus 900 may include an interface module 901 , an analysis module 902 , a display module 903 and an editing module 904 .
  • Each module in the video conversion apparatus 900 may be implemented by one or more modules, and the name of the corresponding module may vary according to the type of the module. In various embodiments, some modules in the video conversion apparatus 900 may be omitted, or additional modules may also be included.
  • modules/elements according to various embodiments of the present disclosure may be combined to form a single entity, and thus may equivalently perform the functions of the corresponding modules/elements prior to combination.
  • the interface module 901 may be configured to receive the first video in the first orientation and user input.
  • the analysis module 902 may be configured to analyze each frame of the first video to determine at least one kind of information for each frame, and to generate, for each frame, a method for adjusting the at least one kind of information in the video orientation transition based on the analysis result.
  • User interface for case weights.
  • At least one type of information may include key area information.
  • the key area information may include at least one of face information, human body information, significant object information, motion scene information, and video boundary information.
  • the display module 903 may be configured to display a user interface for adjusting the weight of at least one kind of information.
  • the user interface may include a user interface for adjusting the weight for each of the at least one information.
  • the editing module 904 may be configured to generate cut window information to cut the first video based on at least one type of information whose weights are adjusted, and to generate a second video in a second orientation based on the cut first video.
  • the analysis module 902 may generate, based on the analysis of the at least one type of information, each marked area of the corresponding frame corresponding to the at least one type of information, where the marked area is an area representing the distribution of information, wherein the corresponding frame Each annotated region of is given a weight entered by the user.
  • the editing module 904 may calculate the overall labeling area of the corresponding frame according to each labeling area whose weight is adjusted; calculate the focus of the corresponding frame based on the overall labeling area, Generates a clipping window for the corresponding frame based on the focus and the specified aspect ratio.
  • the editing module 904 may obtain the fitted focus of the corresponding frame by fitting the focus of each frame, and generate the corresponding frame based on the fitted focus and the specified aspect ratio clipping window.
  • the editing module 904 may generate an annotation map for the corresponding frame based on the overall annotation area, and obtain the focus of the corresponding frame by calculating the moment of the annotation map.
  • the video conversion apparatus 900 can provide the user with the function of adjusting the cut area after the video cutting process, so that the user can obtain a cutting result that they are satisfied with.
  • the analysis module 902 may obtain cut information for converting the first video into the second video in the second orientation, and generate and display a user interface for adjusting the cut information based on the cut information. User input for adjusting clipping information may be received via the user interface.
  • the cut information may include a cut window for cutting the first video into the second video.
  • the analysis module 902 may display a cutout window on the frame for cutting the frame into a corresponding frame of the second video.
  • the video after adjusting the weight of each piece of information in each frame, the video can be cut according to the adjusted pieces of information, and then, the cut information that has been cut before can be presented to the user again, so that the user can Adjust the cut window again for the cut video.
  • the video after adjusting the weight of each information of each frame, the video is not cut at this time, but the cut information generated according to the adjusted information is presented to the user through the user interface , the user can adjust the clipping window as a whole, and then use the final adjusted clipping window for clipping processing.
  • the analysis module 902 may determine at least one key frame of the first video, and generate and display a user interface for adjusting cut information of each of the at least one key frame.
  • the editing module 904 may adaptively adjust the cutting window of the first video according to the adjusted cutting information, and use the adaptively adjusted cutting window to cut the first video Cut to get the second video.
  • an electronic device can be provided.
  • 10 is a block diagram of an electronic device according to an embodiment of the present disclosure
  • the electronic device 1000 includes at least one memory 1002 and at least one processor 1001, the at least one memory 1002 stores a set of computer-executable instructions, when the computer can execute the instructions When the collection is executed by at least one processor 1001, the video conversion method according to the embodiment of the present disclosure is executed.
  • the electronic device 1000 may be a PC computer, a tablet device, a personal digital assistant, a smart phone, or any other device capable of executing the above set of instructions.
  • the electronic device 1000 is not necessarily a single electronic device, but can also be a collection of any device or circuit capable of executing the above-mentioned instructions (or instruction sets) individually or jointly.
  • Electronic device 1000 may also be part of an integrated control system or system manager, or may be configured as a portable electronic device that interfaces locally or remotely (eg, via wireless transmission).
  • the processor 1001 may include a central processing unit (CPU), a graphics processing unit (GPU), a programmable logic device, a special purpose processor system, a microcontroller or a microprocessor.
  • processor 1001 may also include analog processors, digital processors, microprocessors, multi-core processors, processor arrays, network processors, and the like.
  • the processor 1001 may execute instructions or code stored in memory, which may also store data. Instructions and data may also be sent and received over a network via a network interface device, which may employ any known transport protocol.
  • the memory 1002 may be integrated with the processor, eg, RAM or flash memory arranged within an integrated circuit microprocessor or the like.
  • the memory may comprise a separate device such as an external disk drive, a storage array, or any other storage device that may be used by a database system.
  • the memory and the processor may be operatively coupled, or may communicate with each other, eg, through I/O ports, network connections, etc., to enable the processor to read files stored in the memory.
  • the electronic device 1000 may also include a video display (such as a liquid crystal display) and a user interaction interface (such as a keyboard, mouse, touch input device, etc.). All components of the electronic device 1000 may be connected to each other via a bus and/or a network.
  • a video display such as a liquid crystal display
  • a user interaction interface such as a keyboard, mouse, touch input device, etc.
  • a computer-readable storage medium storing instructions, wherein the instructions, when executed by at least one processor, cause the at least one processor to perform the video conversion method according to the present disclosure.
  • Examples of computer-readable storage media herein include: Read Only Memory (ROM), Random Access Programmable Read Only Memory (PROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Random Access Memory (RAM) , dynamic random access memory (DRAM), static random access memory (SRAM), flash memory, non-volatile memory, CD-ROM, CD-R, CD+R, CD-RW, CD+RW, DVD-ROM , DVD-R, DVD+R, DVD-RW, DVD+RW, DVD-RAM, BD-ROM, BD-R, BD-R LTH, BD-RE, Blu-ray or Optical Disc Storage, Hard Disk Drive (HDD), Solid State Hard disk (SSD), card memory (such as a multimedia card, Secure Digital (SD) card, or Extreme Digital (XD
  • the computer program in the above-mentioned computer-readable storage medium can run in an environment deployed in a computer device such as a client, a host, an agent device, a server, etc.
  • a computer device such as a client, a host, an agent device, a server, etc.
  • the computer program and any associated data, data files and data structures are distributed over networked computer systems so that the computer programs and any associated data, data files and data structures are stored, accessed and executed in a distributed fashion by one or more processors or computers.
  • a computer program product can also be provided, and instructions in the computer program product can be executed by a processor of a computer device to complete the above-mentioned video conversion method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Graphics (AREA)
  • Human Computer Interaction (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The present application provides a video conversion method, apparatus, and system, and a storage medium. The video conversion method may comprise the following steps: obtaining a first video in a first orientation and cutting information which is used for converting the first video into a second video in a second orientation; on the basis of the cutting information, generating and displaying a user interface used for adjusting the cutting information; receiving, by means of the user interface, user input used for adjusting the cutting information; and generating the second video according to the adjusted cutting information.

Description

视频转换方法及视频转换装置Video conversion method and video conversion device
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请基于申请号为202011086867.5、申请日为2020年10年12日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。This application is based on the Chinese patent application with the application number of 202011086867.5 and the filing date of 2020.10.12, and claims the priority of the Chinese patent application. The entire content of the Chinese patent application is incorporated herein by reference.
技术领域technical field
本公开涉及视频处理技术领域,尤其涉及一种视频转换方法、装置、系统及存储介质。The present disclosure relates to the technical field of video processing, and in particular, to a video conversion method, device, system, and storage medium.
背景技术Background technique
目前,大多数视频和影视作品在拍摄过程中会采用宽的宽高比(即横屏),诸如4:3、16:9。以宽的宽高比录制的视频或类似媒体可能会被设计为在桌面上或横屏取向上观看。因此,用户在使用移动终端观看横屏视频时,为了获得良好的视觉体验,一般要把终端屏转换到横屏位置来播放视频。At present, most video and film and television works will use a wide aspect ratio (ie landscape), such as 4:3 and 16:9, during the shooting process. Video or similar media recorded in a wide aspect ratio may be designed to be viewed on a desktop or in landscape orientation. Therefore, when a user uses a mobile terminal to watch a horizontal screen video, in order to obtain a good visual experience, the terminal screen is generally converted to a horizontal screen position to play the video.
然而,越来越多的用户,特别是手机用户,更习惯于观看高的宽高比(即竖屏)的视频。垂直取向的媒体已成为用于在许多应用中观看和显示媒体的流行格式。However, more and more users, especially mobile phone users, are more accustomed to watching videos in high aspect ratio (ie, portrait). Vertically oriented media has become a popular format for viewing and displaying media in many applications.
发明内容SUMMARY OF THE INVENTION
本公开提供一种视频转换方法、装置、系统及存储介质。The present disclosure provides a video conversion method, device, system and storage medium.
根据本公开实施例的第一方面,提供一种视频转换方法,所述视频转换方法可以包括:获取第一取向的第一视频以及用于将第一视频转换为第二取向的第二视频的剪切信息;基于剪切信息生成并显示用于调整剪切信息的用户界面;经由用户界面接收用于调整剪切信息的用户输入;以及根据调整后的剪切信息来生成第二视频。According to a first aspect of the embodiments of the present disclosure, there is provided a video conversion method, the video conversion method may include: acquiring a first video in a first orientation and converting the first video into a second video in a second orientation cutting information; generating and displaying a user interface for adjusting the cutting information based on the cutting information; receiving user input for adjusting the cutting information via the user interface; and generating a second video according to the adjusted cutting information.
根据本公开实施例的第二方面,提供一种视频转换装置,所述视频转换装置可以包括:接口模块,被配置为接收第一取向的第一视频;分析模块,被配置为获取用于将第一视频转换为第二取向的第二视频的剪切信息,并且基于剪切信息生成并显示用于调整剪切信息的用户界面;显示模块,被配置为显示所述用户界面,其中,用于调整剪切信息的用户输入经由所述用户界面被接收;以及编辑模块,被配置为根据调整后的剪切信息来生成第二视频。According to a second aspect of the embodiments of the present disclosure, there is provided a video conversion apparatus, the video conversion apparatus may include: an interface module configured to receive a first video in a first orientation; an analysis module configured to obtain a video for converting The first video is converted into cut information of the second video in the second orientation, and based on the cut information, a user interface for adjusting the cut information is generated and displayed; the display module is configured to display the user interface, wherein, using a user input for adjusting the cut information is received via the user interface; and an editing module configured to generate a second video according to the adjusted cut information.
根据本公开实施例的第三方面,提供一种视频转换设备,所述视频转换设备可以包括:显示器;收发器,用于接收第一取向的第一视频;以及处理器,用于:获取用于将第一视频转换为第二取向的第二视频的剪切信息,基于剪切信息生成并显示用于调整剪切信息的用户界面,控制显示器显示所述用户界面,控制收发器经由用户界面接收用于调整剪切信息的用户输入,并且根据调整后的剪切信息来生成第二视频。According to a third aspect of the embodiments of the present disclosure, there is provided a video conversion device, the video conversion device may include: a display; a transceiver for receiving a first video in a first orientation; and a processor for: acquiring for converting the first video into the cut information of the second video in the second orientation, generating and displaying a user interface for adjusting the cut information based on the cut information, controlling the display to display the user interface, and controlling the transceiver via the user interface User input for adjusting the cut information is received, and a second video is generated according to the adjusted cut information.
根据本公开实施例的第四方面,提供一种电子设备,所述电子设备可以包括:处理器;存储器,存储由所述处理器执行的指令,其中,所述指令的执行促使所述处理器执行如上所述的视频转换方法。According to a fourth aspect of embodiments of the present disclosure, there is provided an electronic device, the electronic device may include: a processor; a memory storing instructions for execution by the processor, wherein execution of the instructions causes the processor Perform the video conversion method as described above.
根据本公开实施例的第五方面,提供一种非易失性计算机可读存储介质,其上存储由处理器执行的指令,其中,所述指令的执行促使所述处理器执行如上所述的视频转换方法。According to a fifth aspect of embodiments of the present disclosure, there is provided a non-volatile computer-readable storage medium having stored thereon instructions for execution by a processor, wherein execution of the instructions causes the processor to execute the above-described Video conversion method.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本公开。It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the present disclosure.
附图说明Description of drawings
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并与说明书一起用于解释本公开的原理,并不构成对本公开的不当限定。The accompanying drawings, which are incorporated into and constitute a part of this specification, illustrate embodiments consistent with the present disclosure, and together with the description, serve to explain the principles of the present disclosure and do not unduly limit the present disclosure.
图1是根据本公开实施例提供的将视频从一个取向转换为另一个取向的应用环境的示图;1 is a diagram of an application environment for converting video from one orientation to another, provided according to an embodiment of the present disclosure;
图2是根据本公开实施例的视频转换方法的流程图;2 is a flowchart of a video conversion method according to an embodiment of the present disclosure;
图3是根据本公开实施例的调整剪切窗口的用户界面的示图;3 is a diagram of a user interface for adjusting a clipping window according to an embodiment of the present disclosure;
图4是根据本公开实施例的获取单个帧的剪切窗口信息的流程示意图;4 is a schematic flowchart of obtaining clipping window information of a single frame according to an embodiment of the present disclosure;
图5是根据本公开实施例的标注区域的示意图;5 is a schematic diagram of a marked area according to an embodiment of the present disclosure;
图6是根据本公开实施例的调整信息权重的用户界面的示意图;6 is a schematic diagram of a user interface for adjusting information weights according to an embodiment of the present disclosure;
图7是根据本公开实施例的视频转换设备的框图;7 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure;
图8是根据本公开另一实施例的视频转换方法的流程图;8 is a flowchart of a video conversion method according to another embodiment of the present disclosure;
图9是根据本公开实施例的视频转换装置的框图;9 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure;
图10是根据本公开实施例的电子设备的框图。10 is a block diagram of an electronic device according to an embodiment of the present disclosure.
在整个附图中,应注意,相同的参考标号用于表示相同或相似的元件、特征和结构。Throughout the drawings, it should be noted that the same reference numerals are used to refer to the same or similar elements, features and structures.
具体实施方式Detailed ways
提供参照附图的以下描述以帮助对由权利要求及其等同物限定的本公开的实施例的全面理解。包括各种特定细节以帮助理解,但这些细节仅被视为是示例性的。因此,本领域的普通技术人员将认识到在不脱离本公开的范围和精神的情况下,可对描述于此的实施例进行各种改变和修改。此外,为了清楚和简洁,省略对公知的功能和结构的描述。The following description with reference to the accompanying drawings is provided to assist in a comprehensive understanding of embodiments of the present disclosure as defined by the claims and their equivalents. Various specific details are included to aid in that understanding, but are to be regarded as exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present disclosure. Also, descriptions of well-known functions and constructions are omitted for clarity and conciseness.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本公开的实施例能够以除了在这里图示或描述的那些以外的顺序实施。以下实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the following examples are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure as recited in the appended claims.
在此需要说明的是,在本公开中出现的“若干项之中的至少一项”均表示包含“该若干项中的任意一项”、“该若干项中的任意多项的组合”、“该若干项的全体”这三类并列的情况。例如“包括A和B之中的至少一个”即包括如下三种并列的情况:(1)包括A;(2) 包括B;(3)包括A和B。又例如“执行步骤一和步骤二之中的至少一个”,即表示如下三种并列的情况:(1)执行步骤一;(2)执行步骤二;(3)执行步骤一和步骤二。It should be noted here that "at least one of several items" in the present disclosure all means including "any one of the several items", "a combination of any of the several items", The three categories of "the whole of the several items" are juxtaposed. For example, "including at least one of A and B" includes the following three parallel situations: (1) including A; (2) including B; (3) including A and B. Another example is "execute at least one of step 1 and step 2", which means the following three parallel situations: (1) execute step 1; (2) execute step 2; (3) execute step 1 and step 2.
相关技术的视频剪切均为全自动实现,自动剪切后的视频可能没有达到用户的预期剪切效果,但是用户并不能对最终剪切结果做进一步的剪切调整。此外,在视频自动剪切中,用户也无法调节视频场景中各信息流的重要性。这导致剪切出来的视频场景也可能不符合用户预期。The video cutting of the related art is fully automatic, and the automatically cut video may not achieve the user's expected cutting effect, but the user cannot make further cutting adjustments to the final cutting result. In addition, in automatic video cutting, the user cannot adjust the importance of each information flow in the video scene. As a result, the cut video scene may not meet user expectations.
本公开可以向用户提供视频剪切处理前的参数调整和处理后的剪切区域调整的功能,让用户得到他们满意的视频剪切结果。The present disclosure can provide users with the functions of parameter adjustment before video cutting processing and adjustment of the cutting area after processing, so that users can obtain video cutting results that they are satisfied with.
在下文中,根据本公开的各种实施例,将参照附图对本公开的方法、装置以及系统进行详细描述。Hereinafter, according to various embodiments of the present disclosure, the method, apparatus, and system of the present disclosure will be described in detail with reference to the accompanying drawings.
图1是根据本公开实施例提供的将视频从一个取向转换为另一个取向的应用环境的示图。在本公开中,取向是相对于设备/装置的横向或竖向。FIG. 1 is a diagram of an application environment for converting video from one orientation to another, provided according to an embodiment of the present disclosure. In this disclosure, orientation is landscape or portrait relative to the device/device.
参照图1,该应用环境100包括终端110和媒体服务器系统120。Referring to FIG. 1 , the application environment 100 includes a terminal 110 and a media server system 120 .
终端110为用户所在终端,终端110可以是智能手机、平板电脑、便携式计算机和台式计算机等中的至少一种。虽然本实施例仅示出一个终端110进行说明,但是本领域技术人员可以知晓,上述终端的数量可以为两个或更多个。本公开实施例不对终端的数量和设备类型进行任何限定。The terminal 110 is a terminal where the user is located, and the terminal 110 may be at least one of a smart phone, a tablet computer, a portable computer, a desktop computer, and the like. Although this embodiment only shows one terminal 110 for description, those skilled in the art may know that the number of the above-mentioned terminals may be two or more. This embodiment of the present disclosure does not impose any limitation on the number of terminals and device types.
终端110可以安装有目标应用,用于向媒体服务器系统120提供将被剪切和转换的视频,该目标应用可以是多媒体类应用、社交类应用或资讯类应用等。例如,终端110可以是用户使用的终端,在终端110中运行的应用内登录有用户的账户。The terminal 110 may be installed with a target application for providing the video to be cut and converted to the media server system 120 , and the target application may be a multimedia application, a social application or an information application or the like. For example, the terminal 110 may be a terminal used by a user, and the user's account is logged in an application running in the terminal 110 .
终端110可以通过无线网络或有线网络与媒体服务器系统120连接,使得终端110与媒体服务器系统120之间可以进行数据交互。例如,网络可以包含局域网(LAN)、广域网(WAN)、电话网络、无线链路、内联网、互联网或其组合等。The terminal 110 can be connected to the media server system 120 through a wireless network or a wired network, so that data interaction can be performed between the terminal 110 and the media server system 120 . For example, a network may include a local area network (LAN), a wide area network (WAN), a telephone network, a wireless link, an intranet, the Internet, combinations thereof, and the like.
媒体服务器系统120可以是用于对视频进行剪切转换的服务器系统。例如,媒体服务器系统120可以包括一个或多个处理处理器以及存储器。存储器可以包括用于执行以上的视频转换方法的一个或一个以上的程序。媒体服务器系统120还可以包括一个电源组件被配置为执行媒体服务器系统120的电源管理,一个有线或无线网络接口被配置为将媒体服务器系统120连接到网络,和一个输入输出(I/O)接口。媒体服务器系统120可以操作基于存储在存储器的操作系统,例如Windows ServerTM、Mac OS XTM、UnixTM、LinuxTM、FreeBSDTM等。然而,上述媒体服务器系统120包含的装置仅是示例性的,本公开不限于此。The media server system 120 may be a server system for cut-converting video. For example, media server system 120 may include one or more processing processors and memory. The memory may include one or more programs for performing the above video conversion method. The media server system 120 may also include a power supply assembly configured to perform power management of the media server system 120, a wired or wireless network interface configured to connect the media server system 120 to a network, and an input output (I/O) interface . The media server system 120 may operate based on an operating system stored in memory, such as Windows Server™, Mac OS X™, Unix™, Linux™, FreeBSD™, and the like. However, the devices included in the media server system 120 described above are only exemplary, and the present disclosure is not limited thereto.
媒体服务器系统120可以对输入的视频进行剪切和转换,然后经由无线网络或有线网络将转换好的视频下发给终端110或发布到媒体平台上。The media server system 120 can cut and convert the input video, and then deliver the converted video to the terminal 110 or publish it to the media platform via a wireless network or a wired network.
进一步地,媒体服务器系统120可以获取用于将第一视频转换为第二取向的第二视频的剪切信息,基于剪切信息生成并显示用于调整剪切信息的用户界面,经由用户界面接收用于调整剪切信息的用户输入,然后根据调整后的剪切信息来对之前剪切的视频再次进行调整。Further, the media server system 120 may acquire cut information for converting the first video into the second video of the second orientation, generate and display a user interface for adjusting the cut information based on the cut information, and receive via the user interface User input for adjusting cut information, and then adjust the previously cut video again according to the adjusted cut information.
在一些实施例中,终端110可以安装有实施本公开的视频转换方法的应用程序,终端110可以实现对视频的剪切转换。例如,终端110的存储器可以存储用于执行以上的视频转换方法的一个或一个以上的程序。终端110的处理器可以通过运行相关的程序/算法来实现对视频的剪切转换。然后终端110可以经由无线网络或有线网络将剪切转换好的视频上传至媒体服务器系统120,或者可以将转换好的视频存储在终端110的存储器中。In some embodiments, the terminal 110 may be installed with an application program implementing the video conversion method of the present disclosure, and the terminal 110 may realize the cut conversion of the video. For example, the memory of the terminal 110 may store one or more programs for performing the above video conversion method. The processor of the terminal 110 may implement the cut and convert of the video by running related programs/algorithms. The terminal 110 may then upload the cut and converted video to the media server system 120 via a wireless network or a wired network, or may store the converted video in the memory of the terminal 110 .
作为示例,终端110可以将本地或外部获取的横向视频经由无线或有线网络传输给媒体服务器系统120,媒体服务器系统120可以根据本公开的视频转换方法将横向视频剪切转换为竖向视频,然后经由无线或有线网络将转换好的竖向视频下发给终端110。As an example, the terminal 110 may transmit the horizontal video obtained locally or externally to the media server system 120 via a wireless or wired network, and the media server system 120 may cut and convert the horizontal video into a vertical video according to the video conversion method of the present disclosure, and then The converted vertical video is delivered to the terminal 110 via a wireless or wired network.
作为另一示例,终端110可以根据本公开的视频转换方法将本地或外部获取的横向视频转换为竖向视屏,然后经由无线或有线网络将竖向视频上传至媒体服务器系统120。媒体服务器系统120可以将该竖向视频分发给其他的电子设备。As another example, the terminal 110 may convert a locally or externally acquired horizontal video into a vertical video screen according to the video conversion method of the present disclosure, and then upload the vertical video to the media server system 120 via a wireless or wired network. The media server system 120 may distribute the vertical video to other electronic devices.
虽然实施例举例说明将横向视频转换为竖向视频,但是也可以采用本公开方法类似地,将竖向视频剪切转换为横向视频。Although the embodiments illustrate converting a landscape video to a portrait video, the method of the present disclosure can similarly be used to cut a portrait video into a landscape video.
图2是根据本公开实施例的视频转换方法的流程图。本公开实施例的视频转换方法可以由媒体服务器系统120执行或具有视频剪切转换功能的电子设备执行。FIG. 2 is a flowchart of a video conversion method according to an embodiment of the present disclosure. The video conversion method of the embodiment of the present disclosure may be executed by the media server system 120 or an electronic device having a video cut conversion function.
在步骤S201,获取第一取向的第一视频以及用于将第一视频转换为第二取向的第二视频的剪切信息。剪切信息可以包括用于将第一视频剪切为第二视频的剪切窗口。这里,第一取向的第一视频可以指横向视频。In step S201, a first video in a first orientation and cut information for converting the first video into a second video in a second orientation are acquired. The cut information may include a cut window for cutting the first video into the second video. Here, the first video of the first orientation may refer to a landscape video.
可以使用视频智能裁剪工具(诸如Google Autoflip)来直接获取将一个取向的视频转换为另一个取向的视频的剪切信息。也就是说,可以从相关视频智能裁剪工具获取对第一视频进行剪切的剪切信息。A video smart crop tool, such as Google Autoflip, can be used to directly obtain crop information for converting a video in one orientation to a video in another. That is to say, the clipping information for clipping the first video can be obtained from the relevant video intelligent clipping tool.
根据本公开的实施例,可以通过以下方式获取剪切信息:对第一视频的每个帧进行分析以确定每个帧的至少一种信息,基于至少一种信息来生成相应帧的标注图,通过计算标注图的矩来获得相应帧的焦点,将该焦点作为用于剪切该帧的剪切窗口的中心,根据焦点以及指定宽高比来生成剪切窗口。According to an embodiment of the present disclosure, the cut information can be obtained by: analyzing each frame of the first video to determine at least one kind of information of each frame, generating an annotation map of the corresponding frame based on the at least one kind of information, The focus of the corresponding frame is obtained by calculating the moment of the annotation map, the focus is taken as the center of the clipping window for clipping the frame, and the clipping window is generated according to the focus and the specified aspect ratio.
根据本公开的另一实施例,可以通过以下方式获取剪切信息:对第一视频的每个帧进行分析以确定每个帧的至少一种信息,基于分析结果生成并显示针对每个帧的用于调整至少一种信息在视频取向转换的情况下的权重的用户界面,通过该用户界面来接收用于调整至少一种信息的权重的用户输入,基于权重被调整的至少一种信息来生成相应帧的标注图,通过计算标注图的矩来获得相应帧的焦点,将该焦点作为用于剪切该帧的剪切窗口的中心,根据焦点以及指定宽高比来生成剪切窗口。According to another embodiment of the present disclosure, the cut information may be obtained by: analyzing each frame of the first video to determine at least one kind of information of each frame, and generating and displaying the cutout information for each frame based on the analysis result A user interface for adjusting the weight of at least one information in the case of video orientation conversion, receiving user input for adjusting the weight of the at least one information through the user interface, and generating based on the weighted at least one information For the annotation map of the corresponding frame, the focus of the corresponding frame is obtained by calculating the moment of the annotation map, the focus is taken as the center of the clipping window used to clip the frame, and the clipping window is generated according to the focus and the specified aspect ratio.
根据本公开的实施例,在获取剪切信息之前,可以通过设置用户界面使用户可以根据自己需求来调节各信息流在转换视频结果中的比例,使得在剪切处理中保留用户定义的重要信息。According to the embodiments of the present disclosure, before acquiring the cut information, a user interface can be set so that the user can adjust the proportion of each information stream in the converted video result according to their own needs, so that the important information defined by the user is retained in the cut process. .
此外,通过计算每帧图像的焦点,使得更加突出每帧重点信息的分布情况,并且通过对每帧焦点的轨迹拟合,能够提供更好的剪切信息,增加帧与帧之间的契合度,提高用户体验。In addition, by calculating the focus of each frame of image, the distribution of key information in each frame is more prominent, and by fitting the trajectory of the focus of each frame, better clipping information can be provided, and the fit between frames can be increased. , to improve the user experience.
然而,上述剪切信息的获取仅是示例性的,本公开不限于此。However, the above-mentioned acquisition of the clipping information is only exemplary, and the present disclosure is not limited thereto.
在本公开中,获取的剪切信息可以是在对视频进行剪切之后的结果信息。在一些实施例中,剪切信息可以是在针对视频帧进行信息分析时计算的剪切窗口信息。也就是说,获取的剪切信息可以是在视频剪切处理后的信息,也可以是视频剪切处理前的预分析信息。In the present disclosure, the acquired cut information may be result information after the video is cut. In some embodiments, the clipping information may be clipping window information calculated during information analysis for the video frame. That is to say, the acquired clipping information may be information after video clipping processing, or may be pre-analysis information before video clipping processing.
在步骤S202,基于剪切信息生成并显示用于调整剪切信息的用户界面。在用户界面中,针对第一视频的一帧,在该帧上可以显示用于将该帧剪切为第二视频的对应帧的剪切窗口。例如,参照图3所示。In step S202, a user interface for adjusting the cut information is generated and displayed based on the cut information. In the user interface, for a frame of the first video, a cutting window for cutting the frame into a corresponding frame of the second video may be displayed on the frame. For example, refer to FIG. 3 .
在步骤S203,经由用户界面接收用于调整剪切信息的用户输入。这里,用户输入可以是触摸输入、键输入、悬停输入等中的一种。可以根据显示设备的性能来实现不同类型的用户输入。At step S203, a user input for adjusting the clipping information is received via the user interface. Here, the user input may be one of a touch input, a key input, a hovering input, and the like. Different types of user input can be implemented depending on the capabilities of the display device.
在步骤S204,根据调整后的剪切信息来生成第二视频。可以根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整,然后利用自适应调整后的剪切窗口对第一视频进行剪切以获得第二视频。通过自适应调整,能够提供更好的剪切信息,增加帧与帧之间的契合度。In step S204, a second video is generated according to the adjusted cut information. The cutting window of the first video may be adaptively adjusted according to the adjusted cutting information, and then the first video may be cut using the adaptively adjusted cutting window to obtain the second video. Through adaptive adjustment, better clipping information can be provided and the fit between frames can be increased.
在一种可能的实现方式中,可以确定第一视频的至少一个关键帧,然后生成并显示用于调整至少一个关键帧中的每个关键帧的剪切信息的用户界面。在对第一视频的关键帧的剪切窗口的调整后,可以自动对第一视频的相关帧的剪切窗口自适应地进行调整,在整个视频调整完成后,用户可以导出剪切后的视频。In a possible implementation manner, at least one key frame of the first video may be determined, and then a user interface for adjusting cut information of each key frame in the at least one key frame is generated and displayed. After adjusting the cutting window of the key frame of the first video, the cutting window of the relevant frame of the first video can be automatically adjusted adaptively. After the adjustment of the whole video is completed, the user can export the cut video .
根据本公开的实施例,允许用户在视频剪切处理前后对整个剪切处理流程有更加全面的把握,并最终得到他们满意的剪切结果。According to the embodiments of the present disclosure, users are allowed to have a more comprehensive grasp of the entire cutting process flow before and after the video cutting process, and finally obtain a cutting result that they are satisfied with.
此外,根据本公开的视频转换方法能够更好地处理视频场景切换、用户指定区域变更或丢失的场景。In addition, the video conversion method according to the present disclosure can better handle video scene switching, user-specified area change, or lost scenes.
图3是根据本公开实施例的调整剪切窗口的用户界面的示图。图3的用户界面可以被显示在诸如终端或服务器的显示器的部分区域上,或者以全屏显示在显示器上。FIG. 3 is a diagram of a user interface for adjusting a clipping window according to an embodiment of the present disclosure. The user interface of FIG. 3 may be displayed on a partial area of a display such as a terminal or a server, or on the display in full screen.
根据本公开的实施例,在自动剪切流程后,可以提供给用户每一帧的剪切信息,并且将剪切信息反映在后期用于调整剪切窗口的用户界面中。According to the embodiments of the present disclosure, after the automatic cutting process, the cutting information of each frame can be provided to the user, and the cutting information can be reflected in the user interface for adjusting the cutting window later.
参照图3,用户可以通过用户界面301针对某一帧做剪切窗口的调整。用户可以对剪切窗口进行上、下、左、右移动以调整至自己关注的区域。当用户界面301显示在触摸屏上时,用户可以触摸剪切窗口来进行相应地移动。或者可以通过鼠标、键盘等来拖动剪切窗口至关注区域。然而,上述示例仅是示例性的,本公开不限于此。Referring to FIG. 3 , the user can adjust the clipping window for a certain frame through the user interface 301 . The user can move the clipping window up, down, left and right to adjust to the area of interest. When the user interface 301 is displayed on the touch screen, the user can touch the cutout window to move accordingly. Or you can drag the clipping window to the area of interest by mouse, keyboard, etc. However, the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
用户可以选择性地对一些帧的剪切窗口进行调整。例如,用户可以在用户界面301中通过拖动视频的滑动条来选择用户感兴趣的帧,然后对该帧的剪切窗口进行调整。或者,可以在用户界面301上设置“下一帧”按钮(未示出),在用户调整完当前帧的剪切窗口后,通过点击“下一帧”按钮来切换值下一帧的调整界面。The user can selectively adjust the clipping window for some frames. For example, in the user interface 301, the user can select a frame of interest to the user by dragging the slide bar of the video, and then adjust the clipping window of the frame. Alternatively, a “next frame” button (not shown) may be set on the user interface 301, and after the user adjusts the clipping window of the current frame, the adjustment interface of the next frame can be switched by clicking the “next frame” button .
此外,在整个视频调整完成后,用户可以通过点击用户界面上的“导出”按钮(未示出),即可导出剪切后的视频。上述按钮示例仅是示例性的,可以根据实际需求在用户界面上设置不同功能的按钮。In addition, after the adjustment of the entire video is completed, the user can export the cut video by clicking an "export" button (not shown) on the user interface. The above button examples are only exemplary, and buttons with different functions can be set on the user interface according to actual requirements.
在一些实施例中,可以在用户界面301上显示每个关键帧的剪切窗口,使得用户对视频的关键帧的剪切窗口进行调整,在调整完关键帧的剪切窗口之后,用户可以通过点击用户界面上的“导出”按钮来导出调整后的视频。In some embodiments, the cutout window of each key frame may be displayed on the user interface 301, so that the user can adjust the cutout window of the keyframe of the video. After adjusting the cutout window of the keyframe, the user can Click the "Export" button on the user interface to export the adjusted video.
根据本公开实施例的用户界面简单,易于用户操作,提高用户调整剪切信息的效率。The user interface according to the embodiment of the present disclosure is simple, easy for the user to operate, and improves the efficiency of the user to adjust the cut information.
图4是根据本公开实施例的获取单个帧的剪切窗口信息的流程示意图。本公开实施例的获取单个帧的剪切窗口信息的方法可以由媒体服务器系统120执行或具有视频剪切转换功能的电子设备执行。FIG. 4 is a schematic flowchart of acquiring clipping window information of a single frame according to an embodiment of the present disclosure. The method for acquiring the cut window information of a single frame according to the embodiment of the present disclosure may be executed by the media server system 120 or an electronic device having a video cut conversion function.
参照图4,在获取图像401后,对图像401进行分析以确定图像401的M种信息,M为正整数。其中,对于每一种信息的分析,可以使用对应的分析方法来实现,也就是说,可以使用M种分析方法对图像401进行分析以确定M种信息。例如,可以使用人脸分析方法对图像401的人脸信息进行分析。Referring to FIG. 4 , after the image 401 is acquired, the image 401 is analyzed to determine M kinds of information of the image 401 , where M is a positive integer. The analysis of each type of information may be implemented by using a corresponding analysis method, that is, the image 401 may be analyzed using M types of analysis methods to determine M types of information. For example, the face information of the image 401 can be analyzed using a face analysis method.
通过对M种信息的分析可以生成M个对应的标注区域,即每分析一种信息,都会产生一个图像401相对应的信息分布图。例如,在分析人脸信息时,会生成一个图像401的人脸信息的基于像素的标注区域,然后将基于像素的标注区域转化为信息分布的标注区域。By analyzing the M types of information, M corresponding marked regions can be generated, that is, for each type of information analyzed, an information distribution map corresponding to the image 401 is generated. For example, when analyzing face information, a pixel-based labeling area of the face information of the image 401 is generated, and then the pixel-based labeling area is converted into an information distribution labeling area.
用户可以根据自己的需求对M个标注区域分别赋予权重,以突出自己关注的部分。例如,如果想重点保护人脸部分不被剪切掉,可以提高人脸信息的标注区域的加权比例并且降低其他信息的标注区域的加权比例。Users can assign weights to the M marked regions according to their own needs to highlight the parts they are concerned about. For example, if you want to focus on protecting the face part from being cut off, you can increase the weighting ratio of the marked area of the face information and reduce the weighted ratio of the marked area of other information.
根据加权后的M个标注区域来计算图像401的整体标注区域。例如,可以通过对加权的M个标注区域求和来获得图像401的整体标注区域。The overall marked area of the image 401 is calculated according to the weighted M marked areas. For example, the overall labeled regions of the image 401 can be obtained by summing the weighted M labeled regions.
在获得整体标注区域后,可以基于整体标注区域来生成图像401的标注图。由于之前对各个标注区域进行了加权处理,所以标注图可以显示出每个标注区域的重要性。After the overall labeling area is obtained, the labeling map of the image 401 may be generated based on the overall labeling area. Since the weighting of each annotated area was performed before, the annotation map can show the importance of each annotated area.
通过计算标注图的矩来获得图像401的焦点。例如,可以通过计算标注图的几何中心点来获得图像401的焦点。利用焦点的位置和指定宽高比来生成一个剪切窗口。The focus of the image 401 is obtained by computing the moments of the annotation map. For example, the focus of image 401 can be obtained by calculating the geometric center point of the annotation map. Generates a clipping window using the position of the focus and the specified aspect ratio.
然而,上述示例仅是示例性的,也可以从视频智能裁剪工具(诸如Google Autoflip)来获取将一个取向的视频转换为另一个取向的视频的剪切窗口信息。在从其他剪切工具或软件获得剪切信息后,可以按照上述类似的方式来获得每个帧的剪切窗口的中心位置、尺寸、宽高比等剪切信息。However, the above examples are only exemplary, and the clipping window information for converting a video in one orientation to a video in another orientation can also be obtained from a video intelligent cropping tool (such as Google Autoflip). After obtaining the clipping information from other clipping tools or software, the clipping information such as the center position, size, aspect ratio, etc., of the clipping window of each frame can be obtained in a similar manner as described above.
图5是根据本公开实施例的标注区域的示意图。FIG. 5 is a schematic diagram of a marked area according to an embodiment of the present disclosure.
参照图5,图5的(a)为第一视频的某一帧,图5的(b)示出了该帧中重要信息(例如运动信息)的标注区域,(b)中的白色区域为标注区域。然而,上述示例仅是示例性的,本公开不限于此。Referring to FIG. 5 , (a) of FIG. 5 is a certain frame of the first video, (b) of FIG. 5 shows the marked area of important information (such as motion information) in the frame, and the white area in (b) is Label area. However, the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
图6是根据本公开实施例的调整信息权重的用户界面的示意图。在分析出一帧的各种信息后,可以相应地显示与各种信息相关联的用户界面。FIG. 6 is a schematic diagram of a user interface for adjusting information weights according to an embodiment of the present disclosure. After analyzing various information of a frame, a user interface associated with the various information may be displayed accordingly.
参照图6,在用户界面601中,针对每种信息(诸如第一信息、第二信息等)可以配置有一个滑动条,该滑动条可以用于调整对应信息的权重。例如,可以将滑动条的范围设置为[0,1]。在针对每种信息设置相应权重后,点击“确定”按钮来完成对一帧中的各个信息流的权重的设置。例如,在点击“确定”按钮后,可以将用户输入的权重信息传输给电子设备的处理器,以进行后续的剪切转换。或者可以在点击“确定”按钮后,将相应的剪切窗口呈现在相应的帧上,以向用户展示剪切窗口在帧上的剪切位置。Referring to FIG. 6 , in the user interface 601, a slider bar may be configured for each type of information (such as the first information, the second information, etc.), and the slider bar may be used to adjust the weight of the corresponding information. For example, the range of the slider can be set to [0, 1]. After setting the corresponding weight for each kind of information, click the "OK" button to complete the setting of the weight of each information flow in a frame. For example, after clicking the "OK" button, the weight information input by the user may be transmitted to the processor of the electronic device for subsequent cut conversion. Alternatively, after clicking the "OK" button, the corresponding cutout window may be presented on the corresponding frame, so as to show the user the cutout position of the cutout window on the frame.
然而,图6的用户界面仅是示例性的,用户界面中的元素也可以以其他形式展示。However, the user interface of FIG. 6 is merely exemplary, and elements in the user interface may be presented in other forms.
在一些实施例中,可以针对每种信息配置一个文本输入框,用户可以通过文本输入框对相应信息赋予权重。然而,上述示例仅是示例性的,本公开不限于此。In some embodiments, a text input box may be configured for each type of information, and the user may assign weights to the corresponding information through the text input box. However, the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
用户界面可以显示在电子设备(诸如终端110或媒体服务器系统120)的显示器部分区域上,或者以全屏显示在显示器上,本领域技术人员可以根据实际需求进行显示设置。The user interface can be displayed on a partial area of the display of the electronic device (such as the terminal 110 or the media server system 120 ), or displayed on the display in a full screen, and those skilled in the art can make display settings according to actual needs.
根据本公开的实施例,在视频剪切处理前,允许用户调整每个帧的信息流权重,使得在剪切处理中保留用户定义的重要信息。According to an embodiment of the present disclosure, before the video cutting process, the user is allowed to adjust the information flow weight of each frame, so that the important information defined by the user is preserved in the cutting process.
图7是根据本公开实施例的视频转换设备的框图。该视频转换设备700可以被实施为终端110或者被实施为媒体服务器系统120,或者任意其他的设备。7 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure. The video conversion device 700 may be implemented as a terminal 110 or as a media server system 120, or any other device.
参照图7,视频转换设备700可以包括收发器701、显示器702和处理器703。7 , a video conversion apparatus 700 may include a transceiver 701 , a display 702 and a processor 703 .
收发器701可以接收第一取向的第一视频。The transceiver 701 can receive the first video in the first orientation.
处理器703可以使用视频智能裁剪工具(诸如Google Autoflip)来获取将一个取向的视频转换为另一个取向的视频的剪切窗口信息。在一些实施例中,处理器可以使用本公开实施例的用于获取剪切信息的算法(例如图4所示的方法)来获得用于将第一视频转换为第二取向的第二视频的剪切信息。The processor 703 may use a video smart cropping tool (such as Google Autoflip) to obtain the cropping window information for converting a video in one orientation to a video in another orientation. In some embodiments, the processor may use the algorithm for obtaining cut information of embodiments of the present disclosure (eg, the method shown in FIG. 4 ) to obtain the data for converting the first video into the second video in the second orientation Cut information.
处理器703可以基于剪切信息生成并显示用于调整剪切信息的用户界面,并且控制显示器702显示该用户界面。例如,可以显示图3所示的用户界面。The processor 703 may generate and display a user interface for adjusting the clipping information based on the clipping information, and control the display 702 to display the user interface. For example, the user interface shown in FIG. 3 may be displayed.
用户界面可以包括与分析信息相关联的图形、文本、图标、视频及其它们的任意组合。当显示器702是触摸显示屏时,显示器702还具有采集在显示器702的表面或表面上方的触摸信号的能力。该触摸信号可以作为控制信号输入至处理器701进行处理。此时,显示器702还可以用于提供虚拟按钮和/或虚拟键盘,也称软按钮和/或软键盘。在一些实施例中,显示器702可以为一个,设置在视频转换设备700的前面板;在另一些实施例中,显示器702可以为至少两个,分别设置在视频转换设备700的不同表面或呈折叠设计;在再一些实施例中,显示器702可以是柔性显示屏,设置在视频转换设备700的弯曲表面上或折叠面上。显示器702可以采用LCD(Liquid Crystal Display,液晶显示屏)、OLED(Organic Light-Emitting Diode,有机发光二极管)等材质制备。然而,上述示例仅是示例性的,本公开不限于此。The user interface may include graphics, text, icons, video, and any combination thereof associated with the analysis information. When the display 702 is a touch display screen, the display 702 also has the ability to acquire touch signals on or over the surface of the display 702 . The touch signal may be input to the processor 701 as a control signal for processing. At this time, the display 702 may also be used to provide virtual buttons and/or virtual keyboards, also referred to as soft buttons and/or soft keyboards. In some embodiments, the number of displays 702 may be one, which is arranged on the front panel of the video conversion device 700; in other embodiments, the number of displays 702 may be at least two, which are respectively arranged on different surfaces of the video conversion device 700 or folded. Design; In still other embodiments, display 702 may be a flexible display screen disposed on a curved or folded surface of video conversion device 700 . The display 702 can be prepared by using materials such as LCD (Liquid Crystal Display, liquid crystal display), OLED (Organic Light-Emitting Diode, organic light emitting diode). However, the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
处理器703可以控制收发器701经由用户界面接收用于调整剪切信息的用户输入,在剪切窗口调整后,处理器703可以自动对相关帧做剪切窗口的调整,以确保帧与帧之间的契合度。The processor 703 can control the transceiver 701 to receive a user input for adjusting the clipping information via the user interface. After the clipping window is adjusted, the processor 703 can automatically adjust the clipping window for the relevant frame to ensure that the frames are consistent with each other. compatibility between.
作为示例,处理器703可以根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整,然后利用自适应调整后的剪切窗口对第一视频进行剪切以获得第二视频。在获得最终的第二视频后,可以经由收发器701向其他设备输出第二视频。As an example, the processor 703 may adaptively adjust the cutting window of the first video according to the adjusted cutting information, and then use the adaptively adjusted cutting window to cut the first video to obtain the second video . After the final second video is obtained, the second video can be output to other devices via the transceiver 701 .
通过设置剪切后的视频帧的剪切窗口的调整选项,用户可以对最终剪切结果做进一步的调整。By setting the adjustment options of the cut window of the cut video frame, the user can make further adjustments to the final cut result.
根据本公开的实施例,不仅可以向用户提供视频剪切处理后的剪切区域调整的功能,还可以向用户提供视频剪切处理前的参数调整,让用户得到他们满意的剪切结果。According to the embodiments of the present disclosure, not only the function of adjusting the trimming area after the video trimming process can be provided to the user, but also the parameter adjustment before the video trimming process can be provided to the user, so that the user can obtain the trimming result they are satisfied with.
处理器703可以对第一视频的每个帧进行分析以确定每个帧的至少一种信息,并且基于分析结果生成针对每个帧的用于调整至少一种信息在视频取向转换的情况下的权重的用户界面。例如,可以显示图6所示的用户界面。The processor 703 may analyze each frame of the first video to determine at least one kind of information for each frame, and based on the analysis result, generate a method for adjusting the at least one kind of information for each frame in the case of video orientation conversion. Weight UI. For example, the user interface shown in FIG. 6 may be displayed.
处理器703可以控制收发器701通过图6的用户界面来接收用于调整每个帧的至少一种信息的权重的用户输入,基于权重被调整的至少一种信息来生成针对每个帧的剪切窗口信息。在生成每个帧的剪切窗口信息后,处理器703可以根据剪切窗口信息来生成用户界面,以向用户直观地显示每一帧是如何被剪切的。The processor 703 may control the transceiver 701 to receive, through the user interface of FIG. 6, a user input for adjusting the weight of at least one kind of information of each frame, and to generate a clipping for each frame based on the at least one kind of information whose weight is adjusted. Cut window information. After generating the clipping window information of each frame, the processor 703 may generate a user interface according to the clipping window information to visually display to the user how each frame is clipped.
在一种可能的实现方式中,处理器703可以基于对至少一种信息的分析来生成相应帧的与至少一种信息对应的各个标注区域,标注区域是表示信息分布的区域,其中,相应帧的各个标注区域被赋予由用户输入的权重。In a possible implementation manner, the processor 703 may generate, based on the analysis of the at least one type of information, each marked area of the corresponding frame corresponding to the at least one type of information, where the marked area is an area representing the distribution of information, wherein the corresponding frame Each annotated region of is given a weight entered by the user.
在一种可能的实现方式中,对于第一视频的每个帧,处理器703可以根据权重被调整的各个标注区域来计算相应帧的整体标注区域,基于整体标注区域来计算相应帧的焦点,基于焦点和指定宽高比来生成相应帧的剪切窗口。此外,可以预先设置剪切窗口的尺寸,或可以自适应地调整剪切窗口的尺寸。In a possible implementation manner, for each frame of the first video, the processor 703 may calculate the overall labeling area of the corresponding frame according to each labeling area whose weights are adjusted, and calculate the focus of the corresponding frame based on the overall labeling area, Generates a clipping window for the corresponding frame based on the focus and the specified aspect ratio. In addition, the size of the cutting window may be preset, or the size of the cutting window may be adaptively adjusted.
在一种可能的实现方式中,处理器703可以通过对每个帧的焦点进行拟合来获得相应帧的拟合后的焦点,然以基于拟合后的焦点和指定宽高比来生成相应帧的剪切窗口。In a possible implementation manner, the processor 703 may obtain the fitted focus of the corresponding frame by fitting the focus of each frame, and then generate the corresponding frame based on the fitted focus and the specified aspect ratio. frame clipping window.
在一种可能的实现方式中,处理器可以基于整体标注区域来生成针对相应帧的标注图,并通过计算标注图的矩来获得相应帧的焦点。In a possible implementation manner, the processor may generate an annotation map for the corresponding frame based on the overall annotation area, and obtain the focus of the corresponding frame by calculating a moment of the annotation map.
在一些实施例中,视频转换设备700可以包括存储器,存储器可以存储原始输入视频和转换后的视频。此外,存储器可以包括一个或多个计算机可读存储介质,该计算机可读存储介质可以是非暂态的。存储器还可包括高速随机存取存储器,以及非易失性存储器,比如一个或多个磁盘存储设备、闪存存储设备。在一些实施例中,存储器中的非暂态的计算机可读存储介质用于存储至少一个指令,该至少一个指令用于被处理器703运行。In some embodiments, the video conversion apparatus 700 may include a memory that may store the original input video and the converted video. Additionally, the memory may include one or more computer-readable storage media, which may be non-transitory. Memory may also include high-speed random access memory, as well as non-volatile memory, such as one or more magnetic disk storage devices, flash storage devices. In some embodiments, a non-transitory computer-readable storage medium in memory is used to store at least one instruction for execution by processor 703 .
在一些实施例中,视频转换设备700还包括有:外围设备接口和至少一个外围设备。处理器703和外围设备接口之间可以通过总线或信号线相连。各个外围设备可以通过总线、信号线或电路板与外围设备接口相连。在一些实施例中,外围设备可以包括射频电路、触摸显示屏、摄像头、音频电路、定位组件和电源等中的至少一种。In some embodiments, the video conversion device 700 further includes: a peripheral device interface and at least one peripheral device. The processor 703 and the peripheral device interface can be connected through a bus or a signal line. Each peripheral device can be connected to the peripheral device interface through bus, signal line or circuit board. In some embodiments, the peripheral devices may include at least one of radio frequency circuits, touch screen displays, cameras, audio circuits, positioning components, power supplies, and the like.
在一些实施例中,视频转换设备700还可以包括有一个或多个传感器。该一个或多个传感器包括但不限于加速度传感器、陀螺仪传感器、压力传感器、指纹传感器、光学传感器以 及接近传感器。例如,处理器703可以从一个或多个传感器接收取向变化的指示,从而向用户推荐相应取向的视频。In some embodiments, the video conversion device 700 may also include one or more sensors. The one or more sensors include, but are not limited to, acceleration sensors, gyroscope sensors, pressure sensors, fingerprint sensors, optical sensors, and proximity sensors. For example, the processor 703 may receive an indication of an orientation change from one or more sensors, thereby recommending a video of the corresponding orientation to the user.
图8是根据本公开另一实施例的视频转换方法的流程图。FIG. 8 is a flowchart of a video conversion method according to another embodiment of the present disclosure.
参照图8,在步骤S801,获取第一取向的第一视频。例如,第一取向的第一视频可以是横向视频。Referring to FIG. 8, in step S801, a first video of a first orientation is acquired. For example, the first video in the first orientation may be a landscape video.
在步骤S802,对第一视频的每个帧的至少一种信息进行分析。In step S802, at least one kind of information of each frame of the first video is analyzed.
这里,每个帧的至少一种信息可以包括关键区域信息,例如,可以包括人脸信息、人体信息、主要物体信息、运动场景信息和视频边界信息等中的至少一种。其中,人脸信息可以包括人脸识别信息和人脸跟踪信息等,主要物体信息可以包括物体识别信息和物体跟踪信息等。然而,上述示例仅是示例性的,本公开可以分析一帧中的任意数量和种类的信息。Here, at least one type of information of each frame may include key area information, for example, may include at least one of face information, human body information, main object information, motion scene information, and video boundary information. The face information may include face recognition information and face tracking information, etc., and the main object information may include object identification information and object tracking information. However, the above examples are merely exemplary, and the present disclosure may analyze any amount and kind of information in a frame.
可以预先存储针对主要信息、关键信息或用户感兴趣的信息的分析算法来实现对帧内包含的信息进行分析。例如,可以利用人脸识别算法来分析一帧中的人脸信息,可以利用光流算法来分析一帧中的运动场景信息。然而,上述示例仅是示例性的,本公开不限于此。Analysis algorithms for main information, key information or information of interest to the user may be stored in advance to analyze the information contained in the frame. For example, a face recognition algorithm can be used to analyze the face information in a frame, and an optical flow algorithm can be used to analyze the motion scene information in a frame. However, the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
在步骤S803,基于对至少一种信息的分析来生成相应帧的与至少一种信息对应的各个标注区域。这里,标注区域可以指表示信息分布的区域。针对一帧,该帧可能包括多种信息,每分析该帧中的一种信息,可以生成该帧对应的一个信息分布图,相应地,如果分析一帧中的多种信息可以生成多个标注区域。In step S803, each annotated area corresponding to the at least one type of information of the corresponding frame is generated based on the analysis of the at least one type of information. Here, the labeled area may refer to an area representing the distribution of information. For a frame, the frame may include a variety of information, each time one kind of information in the frame is analyzed, an information distribution map corresponding to the frame can be generated. Correspondingly, if a variety of information in a frame is analyzed, multiple annotations can be generated area.
作为示例,在分析一帧中的人脸信息时,可以生成一个相对应这一帧的人脸信息的基于像素的标注区域(mask),然后可以将基于像素的标注区域转化为信息分布的标注区域。As an example, when analyzing the face information in a frame, a pixel-based annotation area (mask) corresponding to the face information of this frame can be generated, and then the pixel-based annotation area can be converted into an annotation of information distribution area.
在步骤S804,基于分析结果生成用于调整各个标注区域在视频剪切时所占权重的用户界面并且显示用户界面。用户界面可以包括针对至少一种信息中的每种信息的用于调整权重的滑动条或文本输入框。In step S804, based on the analysis result, a user interface for adjusting the weight occupied by each marked region in the video cutting is generated and displayed. The user interface may include a slider bar or text input box for adjusting the weight for each of the at least one information.
在每分析完一帧内包含的信息,可以生成一个针对该帧的用户界面,该用户界面可以包括用于调整该帧中包含的信息的权重的用户接口。例如,用户界面可以包括用于调整每种信息的滑动条或文本输入框。然而,上述示例仅是示例性的,本公开不限于此。After analyzing the information contained in a frame, a user interface for that frame can be generated, and the user interface can include a user interface for adjusting the weight of the information contained in the frame. For example, the user interface may include slider bars or text entry boxes for adjusting each type of information. However, the above-described examples are merely exemplary, and the present disclosure is not limited thereto.
在步骤S805,通过用户界面来接收用于调整各个标注区域的权重的用户输入。可以对相应帧的各个标注区域赋予由用户输入的权重。用户可以根据自己的需求通过用户界面来设置想要保留的信息的权重。例如,如果用户想重点保护人脸部分不被剪切掉,用户可以提高人脸信息的标注区域的加权比例,并且降低其他信息的标注区域的加权比例。用户可以互动地调节加权参数。通过对各个标注区域进行加权,可以突出用户更加关注的信息/区域。In step S805, a user input for adjusting the weight of each marked region is received through the user interface. Weights input by the user may be assigned to each annotated region of the corresponding frame. Users can set the weight of the information they want to keep through the user interface according to their needs. For example, if the user wants to focus on protecting the face part from being cut off, the user can increase the weighting ratio of the marked area of the face information, and reduce the weighted ratio of the marked area of other information. The user can interactively adjust the weighting parameters. By weighting each labeled area, the information/area that the user pays more attention to can be highlighted.
这里,每种信息对应于一种信息标注区域,对每种信息加权可以解释为对信息标注区域的加权。Here, each kind of information corresponds to one kind of information labeling area, and weighting each kind of information can be interpreted as the weighting of the information labeling area.
通过针对每一帧设置用户界面,可以实现用户对一帧中的各个信息流在后续剪切转换操作中的权重。By setting the user interface for each frame, it is possible to realize the user's weight on each information flow in a frame in the subsequent cut and transform operation.
在步骤S806,对于第一视频的每个帧,根据权重被调整的各个标注区域来计算相应帧的整体标注区域。例如,可以对加权后的各个区域进行求和来获得一帧的整体标注区域。In step S806, for each frame of the first video, the overall labeling area of the corresponding frame is calculated according to each labeling area whose weight is adjusted. For example, the weighted regions can be summed to obtain the overall annotated region of a frame.
在步骤S807,基于整体标注区域来生成针对相应帧的标注图。这里,标注图可以是针对各个标注区域的信息分布图像。In step S807, an annotation map for the corresponding frame is generated based on the overall annotation area. Here, the annotation map may be an information distribution image for each annotation area.
在步骤S808,通过计算标注图的矩来获得相应帧的焦点。这里,焦点可以反应一帧中的重要信息的分布状况。例如,可以计算标注图的几何中心点作为一帧的焦点。In step S808, the focus of the corresponding frame is obtained by calculating the moment of the annotation map. Here, the focus can reflect the distribution of important information in a frame. For example, the geometric center point of the annotation map can be calculated as the focal point of a frame.
在步骤S809,基于焦点和指定宽高比来生成相应帧的剪切窗口。例如,在获得一帧的焦点后,将该焦点作为剪切窗口的中心,并且按照指定宽高比来设置剪切窗口的布局和尺寸。这里,可以将第二视频的宽高比作为指定宽高比,然而本公开不限于此。In step S809, a clipping window of the corresponding frame is generated based on the focus and the specified aspect ratio. For example, after obtaining the focus of a frame, the focus is set as the center of the clipping window, and the layout and size of the clipping window are set according to the specified aspect ratio. Here, the aspect ratio of the second video may be used as the specified aspect ratio, but the present disclosure is not limited thereto.
在一种可能的实现方式中,可以通过对每个帧的焦点进行拟合来获得相应帧的拟合后的焦点,并且基于拟合后的焦点和指定宽高比来生成相应帧的剪切窗口。通过根据当前场景的一些列帧的焦点,对当前场景剪切区域进行拟合来达到帧与帧之间的更加流畅的剪切效果。In one possible implementation, a fitted focus of the corresponding frame may be obtained by fitting the focus of each frame, and a crop of the corresponding frame may be generated based on the fitted focus and a specified aspect ratio window. By fitting the clipping region of the current scene according to the focus of some series of frames of the current scene, a smoother clipping effect between frames is achieved.
在步骤S810,获取用于将第一视频转换为第二取向的第二视频的剪切信息。例如,在按照步骤S802至S809获得每个帧的剪切窗口信息后,获取全部帧的剪切窗口信息,以用于后续对剪切窗口的进一步调整。In step S810, cut information for converting the first video into the second video in the second orientation is obtained. For example, after the clipping window information of each frame is obtained according to steps S802 to S809, the clipping window information of all frames is obtained, which is used for further adjustment of the clipping window subsequently.
在步骤S811,基于剪切信息生成并显示用于调整剪切信息的用户界面。在用户界面中,针对第一视频的一帧,在该帧上可以显示用于将该帧剪切为第二视频的对应帧的剪切窗口。例如,参照图3所示。In step S811, a user interface for adjusting the cut information is generated and displayed based on the cut information. In the user interface, for a frame of the first video, a cutting window for cutting the frame into a corresponding frame of the second video may be displayed on the frame. For example, refer to FIG. 3 .
在步骤S812,经由用户界面接收用于调整剪切信息的用户输。In step S812, a user input for adjusting the cut information is received via the user interface.
在步骤S813,可以根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整。例如,在用户对视频帧进行进一步调整之后,可以对被进一步调整的剪切窗口进行拟合处理,使得最终呈现的视频更加流畅。In step S813, the cutting window of the first video may be adaptively adjusted according to the adjusted cutting information. For example, after the user further adjusts the video frame, a fitting process may be performed on the further adjusted clipping window, so that the final presented video is smoother.
在步骤S814,利用自适应调整后的剪切窗口对第一视频进行剪切以获得进一步被调整后的第二视频。In step S814, the first video is cut using the adaptively adjusted cut window to obtain a further adjusted second video.
根据本公开的实施例能够提供给用户视频剪切处理前的参数调整和处理后的剪切区域调整的功能,使得用户在视频剪切处理前后对整个剪切处理流程有更加全面的把握,并最终得到他满意的剪切结果。The embodiments of the present disclosure can provide the user with the functions of parameter adjustment before video cutting processing and adjustment of the cutting area after processing, so that the user can have a more comprehensive grasp of the entire cutting processing flow before and after the video cutting processing, and In the end, he was satisfied with the cutting result.
图9是根据本公开实施例的视频转换装置的框图。FIG. 9 is a block diagram of a video conversion apparatus according to an embodiment of the present disclosure.
参照图9,视频转换装置900可以包括接口模块901、分析模块902、显示模块903以及编辑模块904。视频转换装置900中的每个模块可以由一个或多个模块来实现,并且对应模块的名称可根据模块的类型而变化。在各种实施例中,可以省略视频转换装置900中的一些模块,或者还可包括另外的模块。此外,根据本公开的各种实施例的模块/元件可以被组合以形成单个实体,并且因此可等效地执行相应模块/元件在组合之前的功能。9 , the video conversion apparatus 900 may include an interface module 901 , an analysis module 902 , a display module 903 and an editing module 904 . Each module in the video conversion apparatus 900 may be implemented by one or more modules, and the name of the corresponding module may vary according to the type of the module. In various embodiments, some modules in the video conversion apparatus 900 may be omitted, or additional modules may also be included. Furthermore, modules/elements according to various embodiments of the present disclosure may be combined to form a single entity, and thus may equivalently perform the functions of the corresponding modules/elements prior to combination.
接口模块901可以被配置为接收第一取向的第一视频以及用户输入。The interface module 901 may be configured to receive the first video in the first orientation and user input.
分析模块902可以被配置为对第一视频的每个帧进行分析以确定每个帧的至少一种信息,并且基于分析结果生成针对每个帧的用于调整至少一种信息在视频取向转换的情况下的权重的用户界面。The analysis module 902 may be configured to analyze each frame of the first video to determine at least one kind of information for each frame, and to generate, for each frame, a method for adjusting the at least one kind of information in the video orientation transition based on the analysis result. User interface for case weights.
在一种可能的实现方式中,至少一种信息可以包括关键区域信息。In one possible implementation, at least one type of information may include key area information.
在一种可能的实现方式中,关键区域信息可以包括人脸信息、人体信息、显要物体信息、运动场景信息和视频边界信息中的至少一种。In a possible implementation manner, the key area information may include at least one of face information, human body information, significant object information, motion scene information, and video boundary information.
显示模块903可以被配置为显示用于调整至少一种信息的权重的用户界面。The display module 903 may be configured to display a user interface for adjusting the weight of at least one kind of information.
在一种可能的实现方式中,用户界面可以包括针对至少一种信息中的每种信息的用于调整权重的用户接口。In one possible implementation, the user interface may include a user interface for adjusting the weight for each of the at least one information.
编辑模块904可以被配置为基于权重被调整的至少一种信息来生成剪切窗口信息以对第一视频进行剪切,并且基于剪切后的第一视频生成第二取向的第二视频。The editing module 904 may be configured to generate cut window information to cut the first video based on at least one type of information whose weights are adjusted, and to generate a second video in a second orientation based on the cut first video.
在一种可能的实现方式中,分析模块902可以基于对至少一种信息的分析来生成相应帧的与至少一种信息对应的各个标注区域,标注区域是表示信息分布的区域,其中,相应帧的各个标注区域被赋予由用户输入的权重。In a possible implementation manner, the analysis module 902 may generate, based on the analysis of the at least one type of information, each marked area of the corresponding frame corresponding to the at least one type of information, where the marked area is an area representing the distribution of information, wherein the corresponding frame Each annotated region of is given a weight entered by the user.
在一种可能的实现方式中,对于第一视频的每个帧,编辑模块904可以根据权重被调整的各个标注区域来计算相应帧的整体标注区域;基于整体标注区域来计算相应帧的焦点,基于焦点和指定宽高比来生成相应帧的剪切窗口。In a possible implementation manner, for each frame of the first video, the editing module 904 may calculate the overall labeling area of the corresponding frame according to each labeling area whose weight is adjusted; calculate the focus of the corresponding frame based on the overall labeling area, Generates a clipping window for the corresponding frame based on the focus and the specified aspect ratio.
在一种可能的实现方式中,编辑模块904可以通过对每个帧的焦点进行拟合来获得相应帧的拟合后的焦点,并且基于拟合后的焦点和指定宽高比来生成相应帧的剪切窗口。In one possible implementation, the editing module 904 may obtain the fitted focus of the corresponding frame by fitting the focus of each frame, and generate the corresponding frame based on the fitted focus and the specified aspect ratio clipping window.
在一种可能的实现方式中,编辑模块904可以基于整体标注区域来生成针对相应帧的标注图,并通过计算标注图的矩来获得相应帧的焦点。In a possible implementation manner, the editing module 904 may generate an annotation map for the corresponding frame based on the overall annotation area, and obtain the focus of the corresponding frame by calculating the moment of the annotation map.
此外,视频转换装置900可以向用户提供视频剪切处理后的剪切区域调整的功能,让用户得到他们满意的剪切结果。In addition, the video conversion apparatus 900 can provide the user with the function of adjusting the cut area after the video cutting process, so that the user can obtain a cutting result that they are satisfied with.
分析模块902可以获取用于将第一视频转换为第二取向的第二视频的剪切信息,并且基于剪切信息生成并显示用于调整剪切信息的用户界面。用于调整剪切信息的用户输入可以经由该用户界面被接收。The analysis module 902 may obtain cut information for converting the first video into the second video in the second orientation, and generate and display a user interface for adjusting the cut information based on the cut information. User input for adjusting clipping information may be received via the user interface.
在一种可能的实现方式中,剪切信息可以包括用于将第一视频剪切为第二视频的剪切窗口。In a possible implementation manner, the cut information may include a cut window for cutting the first video into the second video.
在一种可能的实现方式中,针对第一视频的一帧,分析模块902可以使在该帧上显示用于将该帧剪切为第二视频的对应帧的剪切窗口。In a possible implementation manner, for a frame of the first video, the analysis module 902 may display a cutout window on the frame for cutting the frame into a corresponding frame of the second video.
作为示例,可以在对每一帧的各个信息的权重进行调整之后,根据调整后的各个信息来剪切视频,之后,可以再次将之前剪切处理后的剪切信息呈现给用户,使得用户可以对剪切后的视频再次进行剪切窗口的调整。在一些实施例中,可以在对每一帧的各个信息的权重进行调整之后,此时并不对视频进行剪切,而是将由根据调整后的各个信息产生的剪切信息通过用户界面呈现给用户,用户可以从整体来调整剪切窗口,然后使用最终调整好的剪切窗口进行剪切处理。As an example, after adjusting the weight of each piece of information in each frame, the video can be cut according to the adjusted pieces of information, and then, the cut information that has been cut before can be presented to the user again, so that the user can Adjust the cut window again for the cut video. In some embodiments, after adjusting the weight of each information of each frame, the video is not cut at this time, but the cut information generated according to the adjusted information is presented to the user through the user interface , the user can adjust the clipping window as a whole, and then use the final adjusted clipping window for clipping processing.
在一种可能的实现方式中,分析模块902可以确定第一视频的至少一个关键帧,并且生成并显示用于调整所述至少一个关键帧中的每个关键帧的剪切信息的用户界面。In one possible implementation, the analysis module 902 may determine at least one key frame of the first video, and generate and display a user interface for adjusting cut information of each of the at least one key frame.
在一种可能的实现方式中,编辑模块904可以根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整,并且利用自适应调整后的剪切窗口对第一视频进行剪切以获得第二视频。In a possible implementation manner, the editing module 904 may adaptively adjust the cutting window of the first video according to the adjusted cutting information, and use the adaptively adjusted cutting window to cut the first video Cut to get the second video.
本实施例的视频转换装置,通过采用上述模块实现视频转换的实现原理以及技术效果与上述相关方法实施例相同,详细可以参考上述相关方法实施例的记载,在此不再赘述。In the video conversion apparatus of this embodiment, the implementation principle and technical effect of video conversion by using the above-mentioned modules are the same as those of the above-mentioned related method embodiments.
根据本公开的实施例,可以提供一种电子设备。图10是根据本公开实施例的电子设备的框图,该电子设备1000包括至少一个存储器1002和至少一个处理器1001,所述至少一个存储器1002中存储有计算机可以执行指令集合,当计算机可以执行指令集合被至少一个处理器1001执行时,执行根据本公开实施例的视频转换方法。According to an embodiment of the present disclosure, an electronic device can be provided. 10 is a block diagram of an electronic device according to an embodiment of the present disclosure, the electronic device 1000 includes at least one memory 1002 and at least one processor 1001, the at least one memory 1002 stores a set of computer-executable instructions, when the computer can execute the instructions When the collection is executed by at least one processor 1001, the video conversion method according to the embodiment of the present disclosure is executed.
作为示例,电子设备1000可以是PC计算机、平板装置、个人数字助理、智能手机、或其他能够执行上述指令集合的装置。这里,电子设备1000并非必须是单个的电子设备,还可以是任何能够单独或联合执行上述指令(或指令集)的装置或电路的集合体。电子设备1000还可以是集成控制系统或系统管理器的一部分,或者可以被配置为与本地或远程(例如,经由无线传输)以接口互联的便携式电子设备。As an example, the electronic device 1000 may be a PC computer, a tablet device, a personal digital assistant, a smart phone, or any other device capable of executing the above set of instructions. Here, the electronic device 1000 is not necessarily a single electronic device, but can also be a collection of any device or circuit capable of executing the above-mentioned instructions (or instruction sets) individually or jointly. Electronic device 1000 may also be part of an integrated control system or system manager, or may be configured as a portable electronic device that interfaces locally or remotely (eg, via wireless transmission).
在电子设备1000中,处理器1001可以包括中央处理器(CPU)、图形处理器(GPU)、可以编程逻辑装置、专用处理器系统、微控制器或微处理器。作为示例而非限制,处理器1001还可以包括模拟处理器、数字处理器、微处理器、多核处理器、处理器阵列、网络处理器等。In the electronic device 1000, the processor 1001 may include a central processing unit (CPU), a graphics processing unit (GPU), a programmable logic device, a special purpose processor system, a microcontroller or a microprocessor. By way of example and not limitation, processor 1001 may also include analog processors, digital processors, microprocessors, multi-core processors, processor arrays, network processors, and the like.
处理器1001可以运行存储在存储器中的指令或代码,其中,存储器还可以存储数据。指令和数据还可以经由网络接口装置而通过网络被发送和接收,其中,网络接口装置可以采用任何已知的传输协议。The processor 1001 may execute instructions or code stored in memory, which may also store data. Instructions and data may also be sent and received over a network via a network interface device, which may employ any known transport protocol.
存储器1002可以与处理器集成为一体,例如,将RAM或闪存布置在集成电路微处理器等之内。此外,存储器可以包括独立的装置,诸如,外部盘驱动、存储阵列或任何数据库系统可以使用的其他存储装置。存储器和处理器可以在操作上进行耦合,或者可以例如通过I/O端口、网络连接等互相通信,使得处理器能够读取存储在存储器中的文件。The memory 1002 may be integrated with the processor, eg, RAM or flash memory arranged within an integrated circuit microprocessor or the like. In addition, the memory may comprise a separate device such as an external disk drive, a storage array, or any other storage device that may be used by a database system. The memory and the processor may be operatively coupled, or may communicate with each other, eg, through I/O ports, network connections, etc., to enable the processor to read files stored in the memory.
此外,电子设备1000还可以包括视频显示器(诸如,液晶显示器)和用户交互接口(诸如,键盘、鼠标、触摸输入装置等)。电子设备1000的所有组件可以经由总线和/或网络而彼此连接。In addition, the electronic device 1000 may also include a video display (such as a liquid crystal display) and a user interaction interface (such as a keyboard, mouse, touch input device, etc.). All components of the electronic device 1000 may be connected to each other via a bus and/or a network.
根据本公开的实施例,还可以提供一种存储指令的计算机可以读存储介质,其中,当指令被至少一个处理器运行时,促使至少一个处理器执行根据本公开的视频转换方法。这里的计算机可以读存储介质的示例包括:只读存储器(ROM)、随机存取可以编程只读存储器(PROM)、电可以擦除可以编程只读存储器(EEPROM)、随机存取存储器(RAM)、动态随机存取存储器(DRAM)、静态随机存取存储器(SRAM)、闪存、非易失性存储器、CD-ROM、CD-R、CD+R、CD-RW、CD+RW、DVD-ROM、DVD-R、DVD+R、DVD-RW、DVD+RW、DVD-RAM、BD-ROM、BD-R、BD-R LTH、BD-RE、蓝光或光盘存储器、硬盘驱动器(HDD)、固态硬盘(SSD)、卡式存储器(诸如,多媒体卡、安全数字(SD)卡或极速数字(XD)卡)、磁带、软盘、磁光 数据存储装置、光学数据存储装置、硬盘、固态盘以及任何其他装置,所述任何其他装置被配置为以非暂时性方式存储计算机程序以及任何相关联的数据、数据文件和数据结构并将所述计算机程序以及任何相关联的数据、数据文件和数据结构提供给处理器或计算机使得处理器或计算机能执行所述计算机程序。上述计算机可以读存储介质中的计算机程序可以在诸如客户端、主机、代理装置、服务器等计算机设备中部署的环境中运行,此外,在一个示例中,计算机程序以及任何相关联的数据、数据文件和数据结构分布在联网的计算机系统上,使得计算机程序以及任何相关联的数据、数据文件和数据结构通过一个或多个处理器或计算机以分布式方式存储、访问和执行。According to an embodiment of the present disclosure, there may also be provided a computer-readable storage medium storing instructions, wherein the instructions, when executed by at least one processor, cause the at least one processor to perform the video conversion method according to the present disclosure. Examples of computer-readable storage media herein include: Read Only Memory (ROM), Random Access Programmable Read Only Memory (PROM), Electrically Erasable Programmable Read Only Memory (EEPROM), Random Access Memory (RAM) , dynamic random access memory (DRAM), static random access memory (SRAM), flash memory, non-volatile memory, CD-ROM, CD-R, CD+R, CD-RW, CD+RW, DVD-ROM , DVD-R, DVD+R, DVD-RW, DVD+RW, DVD-RAM, BD-ROM, BD-R, BD-R LTH, BD-RE, Blu-ray or Optical Disc Storage, Hard Disk Drive (HDD), Solid State Hard disk (SSD), card memory (such as a multimedia card, Secure Digital (SD) card, or Extreme Digital (XD) card), magnetic tape, floppy disk, magneto-optical data storage device, optical data storage device, hard disk, solid state disk, and any other apparatuses configured to store, in a non-transitory manner, a computer program and any associated data, data files and data structures and to provide said computer program and any associated data, data files and data structures The computer program is given to a processor or computer so that the processor or computer can execute the computer program. The computer program in the above-mentioned computer-readable storage medium can run in an environment deployed in a computer device such as a client, a host, an agent device, a server, etc. In addition, in one example, the computer program and any associated data, data files and data structures are distributed over networked computer systems so that the computer programs and any associated data, data files and data structures are stored, accessed and executed in a distributed fashion by one or more processors or computers.
根据本公开的实施例中,还可以提供一种计算机程序产品,该计算机程序产品中的指令可以由计算机设备的处理器执行以完成上述视频转换方法。According to an embodiment of the present disclosure, a computer program product can also be provided, and instructions in the computer program product can be executed by a processor of a computer device to complete the above-mentioned video conversion method.
本公开所有实施例均可以单独被执行,也可以与其他实施例相结合被执行,均视为本公开要求的保护范围。All the embodiments of the present disclosure can be implemented independently or in combination with other embodiments, which are all regarded as the protection scope required by the present disclosure.
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。Other embodiments of the present disclosure will readily occur to those skilled in the art upon consideration of the specification and practice of the invention disclosed herein. This application is intended to cover any variations, uses, or adaptations of the present disclosure that follow the general principles of the present disclosure and include common knowledge or techniques in the technical field not disclosed by the present disclosure . The specification and examples are to be regarded as exemplary only, with the true scope and spirit of the disclosure being indicated by the following claims.
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。It is to be understood that the present disclosure is not limited to the precise structures described above and illustrated in the accompanying drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (23)

  1. 一种视频转换方法,其中,所述视频转换方法包括:A video conversion method, wherein the video conversion method comprises:
    获取第一取向的第一视频以及用于将第一视频转换为第二取向的第二视频的剪切信息;obtaining a first video in a first orientation and clipping information for converting the first video into a second video in a second orientation;
    基于剪切信息生成并显示用于调整剪切信息的用户界面;Generate and display a user interface for adjusting the clipping information based on the clipping information;
    经由用户界面接收用于调整剪切信息的用户输入;以及receiving, via the user interface, user input for adjusting the clipping information; and
    根据调整后的剪切信息来生成第二视频。A second video is generated according to the adjusted cut information.
  2. 根据权利要求1所述的视频转换方法,其中,剪切信息包括用于将第一视频剪切为第二视频的剪切窗口。The video conversion method of claim 1, wherein the cut information includes a cut window for cutting the first video into the second video.
  3. 根据权利要求1所述的视频转换方法,其中,基于剪切信息生成并显示用于调整剪切信息的用户界面的步骤包括:The video conversion method according to claim 1, wherein the step of generating and displaying a user interface for adjusting the cut information based on the cut information comprises:
    针对第一视频的一帧,在该帧上显示用于将该帧剪切为第二视频的对应帧的剪切窗口。For a frame of the first video, a cutout window for cutting the frame into a corresponding frame of the second video is displayed on the frame.
  4. 根据权利要求1所述的视频转换方法,其中,所述视频转换方法包括:The video conversion method according to claim 1, wherein the video conversion method comprises:
    确定第一视频的至少一个关键帧,determining at least one key frame of the first video,
    其中,生成并显示用于调整剪切信息的用户界面的步骤包括:Wherein, the step of generating and displaying a user interface for adjusting the clipping information includes:
    生成并显示用于调整所述至少一个关键帧中的每个关键帧的剪切信息的用户界面。A user interface for adjusting cut information for each of the at least one keyframe is generated and displayed.
  5. 根据权利要求1所述的视频转换方法,其中,根据调整后的剪切信息来生成第二视频的步骤包括:The video conversion method according to claim 1, wherein the step of generating the second video according to the adjusted cut information comprises:
    根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整;adaptively adjust the cut window of the first video according to the adjusted cut information;
    利用自适应调整后的剪切窗口对第一视频进行剪切以获得第二视频。Cut the first video by using the adaptively adjusted cut window to obtain the second video.
  6. 根据权利要求1所述的视频转换方法,其中,获取第一视频转换为第二取向的第二视频的剪切信息的步骤包括:The video conversion method according to claim 1, wherein the step of acquiring cut information for converting the first video into the second video in the second orientation comprises:
    对第一视频的每个帧进行分析以确定每个帧的至少一种信息;analyzing each frame of the first video to determine at least one information for each frame;
    基于分析结果生成并显示针对每个帧的用于调整所述至少一种信息在视频取向转换的情况下的权重的另一用户界面;generating and displaying, for each frame, another user interface for adjusting the weight of the at least one information in the case of a video orientation transition based on the analysis results;
    通过所述另一用户界面来接收用于调整所述至少一种信息的权重的用户输入;receiving, through the another user interface, user input for adjusting the weight of the at least one information;
    基于权重被调整的所述至少一种信息来生成剪切信息。Clipping information is generated based on the at least one kind of information whose weights are adjusted.
  7. 根据权利要求6所述的视频转换方法,其中,基于权重被调整的所述至少一种信息来生成剪切信息的步骤包括:The video conversion method according to claim 6, wherein the step of generating cut information based on the at least one type of information whose weights are adjusted comprises:
    基于权重被调整的所述至少一种信息来生成相应帧的标注图;generating an annotation map of the corresponding frame based on the at least one kind of information whose weights are adjusted;
    通过计算标注图的矩来获得相应帧的焦点;Obtain the focus of the corresponding frame by calculating the moment of the annotation map;
    根据所述焦点以及指定宽高比来生成剪切窗口。A clipping window is generated based on the focus and the specified aspect ratio.
  8. 一种视频转换装置,其中,所述视频转换装置包括:A video conversion device, wherein the video conversion device comprises:
    接口模块,被配置为接收第一取向的第一视频;an interface module configured to receive a first video in a first orientation;
    分析模块,被配置为获取用于将第一视频转换为第二取向的第二视频的剪切信息,并且基于剪切信息生成并显示用于调整剪切信息的用户界面;an analysis module configured to obtain cut information for converting the first video into the second video in the second orientation, and to generate and display a user interface for adjusting the cut information based on the cut information;
    显示模块,被配置为显示所述用户界面,其中,用于调整剪切信息的用户输入经由所述用户界面被接收;a display module configured to display the user interface, wherein user input for adjusting clipping information is received via the user interface;
    编辑模块,被配置为根据调整后的剪切信息来生成第二视频。The editing module is configured to generate the second video according to the adjusted cut information.
  9. 根据权利要求8所述的视频转换装置,其中,剪切信息包括用于将第一视频剪切为第二视频的剪切窗口。The video conversion apparatus of claim 8, wherein the cut information includes a cut window for cutting the first video into the second video.
  10. 根据权利要求8所述的视频转换装置,其中,分析模块被配置为针对第一视频的一帧,在该帧上设置用于将该帧剪切为第二视频的对应帧的剪切窗口。9. The video conversion apparatus of claim 8, wherein the analysis module is configured for a frame of the first video, and a cutting window for cutting the frame into a corresponding frame of the second video is set on the frame.
  11. 根据权利要求8所述的视频转换装置,其中,分析模块,被配置为:The video conversion device of claim 8, wherein the analysis module is configured to:
    确定第一视频的至少一个关键帧,determining at least one key frame of the first video,
    生成并显示用于调整所述至少一个关键帧中的每个关键帧的剪切信息的用户界面。A user interface for adjusting cut information for each of the at least one keyframe is generated and displayed.
  12. 根据权利要求8所述的视频转换装置,其中,剪辑模块被配置为:The video conversion device according to claim 8, wherein the editing module is configured to:
    根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整;adaptively adjust the cut window of the first video according to the adjusted cut information;
    利用自适应调整后的剪切窗口对第一视频进行剪切以获得第二视频。Cut the first video by using the adaptively adjusted cut window to obtain the second video.
  13. 根据权利要求8所述的视频转换装置,其中,分析模块被配置为:The video conversion apparatus of claim 8, wherein the analysis module is configured to:
    对第一视频的每个帧进行分析以确定每个帧的至少一种信息;analyzing each frame of the first video to determine at least one information for each frame;
    基于分析结果生成并显示针对每个帧的用于调整所述至少一种信息在视频取向转换的情况下的权重的另一用户界面;generating and displaying, for each frame, another user interface for adjusting the weight of the at least one information in the case of a video orientation transition based on the analysis results;
    通过所述另一用户界面来接收用于调整所述至少一种信息的权重的用户输入;receiving, through the another user interface, user input for adjusting the weight of the at least one information;
    基于权重被调整的所述至少一种信息来生成剪切信息。Clipping information is generated based on the at least one kind of information whose weights are adjusted.
  14. 根据权利要求13所述的视频转换装置,其中,分析模块被配置为:The video conversion apparatus of claim 13, wherein the analysis module is configured to:
    基于权重被调整的所述至少一种信息来生成相应帧的标注图;generating an annotation map of the corresponding frame based on the at least one kind of information whose weights are adjusted;
    通过计算标注图的矩来获得相应帧的焦点;Obtain the focus of the corresponding frame by calculating the moment of the annotation map;
    根据所述焦点以及指定宽高比来生成剪切窗口。A clipping window is generated based on the focus and the specified aspect ratio.
  15. 一种视频转换设备,其中,所述视频转换设备包括:A video conversion device, wherein the video conversion device comprises:
    显示器;monitor;
    收发器,用于接收第一取向的第一视频;以及a transceiver for receiving a first video in a first orientation; and
    处理器,用于:processor for:
    获取用于将第一视频转换为第二取向的第二视频的剪切信息;obtaining cut information for converting the first video into a second video in a second orientation;
    基于剪切信息生成并显示用于调整剪切信息的用户界面;Generate and display a user interface for adjusting the clipping information based on the clipping information;
    控制显示器显示所述用户界面;controlling the display to display the user interface;
    控制收发器经由用户界面接收用于调整剪切信息的用户输入;以及The control transceiver receives, via the user interface, user input for adjusting the clipping information; and
    根据调整后的剪切信息来生成第二视频。A second video is generated according to the adjusted cut information.
  16. 根据权利要求15所述的视频转换设备,其中,剪切信息包括用于将第一视频剪切为第二视频的剪切窗口。16. The video conversion apparatus of claim 15, wherein the cut information includes a cut window for cutting the first video into the second video.
  17. 根据权利要求15所述的视频转换设备,其中,处理器用于:The video conversion apparatus of claim 15, wherein the processor is configured to:
    针对第一视频的一帧,在该帧上设置用于将该帧剪切为第二视频的对应帧的剪切窗口。For a frame of the first video, a cutting window for cutting the frame into a corresponding frame of the second video is set on the frame.
  18. 根据权利要求15所述的视频转换设备,其中,处理器用于:The video conversion apparatus of claim 15, wherein the processor is configured to:
    确定第一视频的至少一个关键帧,determining at least one key frame of the first video,
    生成并显示用于调整所述至少一个关键帧中的每个关键帧的剪切信息的用户界面。A user interface for adjusting cut information for each of the at least one keyframe is generated and displayed.
  19. 根据权利要求15所述的视频转换设备,其中,处理器用于:The video conversion apparatus of claim 15, wherein the processor is configured to:
    根据调整后的剪切信息自适应地对第一视频的剪切窗口进行调整;adaptively adjust the cut window of the first video according to the adjusted cut information;
    利用自适应调整后的剪切窗口对第一视频进行剪切以获得第二视频。Cut the first video by using the adaptively adjusted cut window to obtain the second video.
  20. 根据权利要求15所述的视频转换设备,其中,处理器用于:The video conversion apparatus of claim 15, wherein the processor is configured to:
    对第一视频的每个帧进行分析以确定每个帧的至少一种信息;analyzing each frame of the first video to determine at least one information for each frame;
    基于分析结果生成并显示针对每个帧的用于调整所述至少一种信息在视频取向转换的情况下的权重的另一用户界面;generating and displaying, for each frame, another user interface for adjusting the weight of the at least one information in the case of a video orientation transition based on the analysis results;
    通过所述另一用户界面来接收用于调整所述至少一种信息的权重的用户输入;receiving, through the another user interface, user input for adjusting the weight of the at least one information;
    基于权重被调整的所述至少一种信息来生成剪切信息。Clipping information is generated based on the at least one kind of information whose weights are adjusted.
  21. 根据权利要求20所述的视频转换设备,其中,处理器用于:The video conversion device of claim 20, wherein the processor is configured to:
    基于权重被调整的所述至少一种信息来生成相应帧的标注图;generating an annotation map of the corresponding frame based on the at least one kind of information whose weights are adjusted;
    通过计算标注图的矩来获得相应帧的焦点;Obtain the focus of the corresponding frame by calculating the moment of the annotation map;
    根据所述焦点以及指定宽高比来生成剪切窗口。A clipping window is generated based on the focus and the specified aspect ratio.
  22. 一种电子设备,其中,包括:An electronic device comprising:
    处理器;processor;
    存储器,用于存储由所述处理器执行的指令,memory for storing instructions to be executed by the processor,
    其中,所述指令的执行促使所述处理器执行以下步骤:wherein execution of the instructions causes the processor to perform the following steps:
    获取第一取向的第一视频以及用于将第一视频转换为第二取向的第二视频的剪切信息;obtaining a first video in a first orientation and clipping information for converting the first video into a second video in a second orientation;
    基于剪切信息生成并显示用于调整剪切信息的用户界面;Generate and display a user interface for adjusting the clipping information based on the clipping information;
    经由用户界面接收用于调整剪切信息的用户输入;以及receiving, via the user interface, user input for adjusting the clipping information; and
    根据调整后的剪切信息来生成第二视频。A second video is generated according to the adjusted cut information.
  23. 一种非易失性计算机可读存储介质,其上存储由处理器执行的指令,其中,所述指令的执行促使所述处理器执行以下步骤:A non-volatile computer-readable storage medium having stored thereon instructions for execution by a processor, wherein execution of the instructions causes the processor to perform the following steps:
    获取第一取向的第一视频以及用于将第一视频转换为第二取向的第二视频的剪切信息;obtaining a first video in a first orientation and clipping information for converting the first video into a second video in a second orientation;
    基于剪切信息生成并显示用于调整剪切信息的用户界面;Generate and display a user interface for adjusting the clipping information based on the clipping information;
    经由用户界面接收用于调整剪切信息的用户输入;以及receiving, via the user interface, user input for adjusting the clipping information; and
    根据调整后的剪切信息来生成第二视频。A second video is generated according to the adjusted cut information.
PCT/CN2021/106338 2020-10-12 2021-07-14 Video conversion method and video conversion apparatus WO2022077977A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011086867.5 2020-10-12
CN202011086867.5A CN112165635A (en) 2020-10-12 2020-10-12 Video conversion method, device, system and storage medium

Publications (1)

Publication Number Publication Date
WO2022077977A1 true WO2022077977A1 (en) 2022-04-21

Family

ID=73868175

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/106338 WO2022077977A1 (en) 2020-10-12 2021-07-14 Video conversion method and video conversion apparatus

Country Status (2)

Country Link
CN (1) CN112165635A (en)
WO (1) WO2022077977A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112165635A (en) * 2020-10-12 2021-01-01 北京达佳互联信息技术有限公司 Video conversion method, device, system and storage medium
CN112218160A (en) * 2020-10-12 2021-01-12 北京达佳互联信息技术有限公司 Video conversion method and device, video conversion equipment and storage medium
CN115291779A (en) * 2021-04-19 2022-11-04 华为技术有限公司 Window control method and device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120291083A1 (en) * 2011-05-12 2012-11-15 Cable Television Laboratories, Inc. Media Files Delivery System And Method
CN106454407A (en) * 2016-10-25 2017-02-22 广州华多网络科技有限公司 Video live broadcast method and device
CN110298380A (en) * 2019-05-22 2019-10-01 北京达佳互联信息技术有限公司 Image processing method, device and electronic equipment
CN112165635A (en) * 2020-10-12 2021-01-01 北京达佳互联信息技术有限公司 Video conversion method, device, system and storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104020928A (en) * 2014-06-09 2014-09-03 联想(北京)有限公司 Image display method and device
US10506198B2 (en) * 2015-12-04 2019-12-10 Livestream LLC Video stream encoding system with live crop editing and recording
CN105898566A (en) * 2016-04-29 2016-08-24 乐视控股(北京)有限公司 Video content presenting switching method and device, and mobile play terminal
CN107197372B (en) * 2017-06-30 2019-12-27 北京金山安全软件有限公司 Method and device for shearing batch vertical screen videos and electronic equipment
CN109089157B (en) * 2018-06-15 2021-12-07 广州华多网络科技有限公司 Video picture cutting method, display device and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120291083A1 (en) * 2011-05-12 2012-11-15 Cable Television Laboratories, Inc. Media Files Delivery System And Method
CN106454407A (en) * 2016-10-25 2017-02-22 广州华多网络科技有限公司 Video live broadcast method and device
CN110298380A (en) * 2019-05-22 2019-10-01 北京达佳互联信息技术有限公司 Image processing method, device and electronic equipment
CN112165635A (en) * 2020-10-12 2021-01-01 北京达佳互联信息技术有限公司 Video conversion method, device, system and storage medium

Also Published As

Publication number Publication date
CN112165635A (en) 2021-01-01

Similar Documents

Publication Publication Date Title
WO2022077977A1 (en) Video conversion method and video conversion apparatus
US11422671B2 (en) Defining, displaying and interacting with tags in a three-dimensional model
US10762382B2 (en) Image recognition based on augmented reality
US20170285922A1 (en) Systems and methods for creation and sharing of selectively animated digital photos
US20190340817A1 (en) Learning opportunity based display generation and presentation
US10289390B2 (en) Interactive multimodal display platform
US11321946B2 (en) Content entity recognition within digital video data for dynamic content generation
US11715223B2 (en) Active image depth prediction
JP2018538736A (en) Dynamic color determination for video player user interface components
MX2010012826A (en) 3d content aggregation built into devices.
WO2022077995A1 (en) Video conversion method and video conversion device
US11468786B2 (en) Generating tool-based smart-tutorials
US20190155465A1 (en) Augmented media
US20160275723A1 (en) System and method for generating three dimensional representation using contextual information
US11003467B2 (en) Visual history for content state changes
US20230169625A1 (en) Computing device displaying image conversion possibility information
CN112399265B (en) Method and system for adding content to image based on negative space recognition
US11790653B2 (en) Computer-generated reality recorder
US11288311B2 (en) Interactive image cloud visualization of videos
US20240062490A1 (en) System and method for contextualized selection of objects for placement in mixed reality
US11012662B1 (en) Multimedia content adjustment using comparable content

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21879025

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 03/08/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21879025

Country of ref document: EP

Kind code of ref document: A1