WO2023284469A1 - Video capture information acquisition method, and video capture and processing instruction method - Google Patents

Video capture information acquisition method, and video capture and processing instruction method Download PDF

Info

Publication number
WO2023284469A1
WO2023284469A1 PCT/CN2022/098711 CN2022098711W WO2023284469A1 WO 2023284469 A1 WO2023284469 A1 WO 2023284469A1 CN 2022098711 W CN2022098711 W CN 2022098711W WO 2023284469 A1 WO2023284469 A1 WO 2023284469A1
Authority
WO
WIPO (PCT)
Prior art keywords
field
shooting
mirror
information
video
Prior art date
Application number
PCT/CN2022/098711
Other languages
French (fr)
Chinese (zh)
Inventor
申子宜
Original Assignee
上海幻电信息科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海幻电信息科技有限公司 filed Critical 上海幻电信息科技有限公司
Publication of WO2023284469A1 publication Critical patent/WO2023284469A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/85Assembly of content; Generation of multimedia applications
    • H04N21/854Content authoring
    • H04N21/8549Creating video summaries, e.g. movie trailer

Definitions

  • the embodiments of the present application relate to the field of computer technology, and in particular, to a video shooting information acquisition method, system, computer equipment, and computer-readable storage medium, and to a video shooting and processing instruction method.
  • the purpose of the embodiment of the present application is to provide a video shooting information acquisition method, system, computer equipment and computer-readable storage medium, as well as a video shooting and processing instruction method, which are used to solve the following problems: For users who have not learned professional shooting technology , it is difficult to correctly apply the shooting method, resulting in the inability to shoot a satisfactory video and low efficiency.
  • An aspect of the embodiments of the present application provides a method for acquiring video shooting information, the method including:
  • the plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
  • the multiple shooting information includes multiple shooting parameters
  • the target video is analyzed to obtain multiple shooting information, including:
  • the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
  • the multiple shooting information includes multiple shooting parameters
  • the target video is analyzed to obtain multiple shooting information, including:
  • the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field.
  • the position distribution of the mirror in the field corresponds to that.
  • the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
  • it also includes: generating a plurality of storyboard scripts for each field according to the shooting parameters of each mirror in each field;
  • each mirror in each field corresponds to one or more mirror scripts respectively, and the position distribution of multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field.
  • the location distribution within the field corresponds.
  • An aspect of the embodiments of the present application further provides a system for obtaining video shooting information, the system comprising:
  • a determination module is used to determine a target video for analysis, and the target video corresponds to a time axis representing video progress;
  • An analysis module configured to analyze the target video to obtain a plurality of shooting information
  • a labeling module configured to label the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
  • An aspect of the embodiments of the present application further provides a computer device, the computer device includes a memory, a processor, and computer-readable instructions stored in the memory and operable on the processor, the computer can The following steps are implemented when the read instruction is executed by the processor:
  • the plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
  • An aspect of the embodiments of the present application further provides a computer-readable storage medium, where computer-readable instructions are stored in the computer-readable storage medium, and the computer-readable instructions can be executed by at least one processor, so that The at least one processor performs the following steps:
  • the plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
  • An aspect of the embodiments of the present application further provides a video shooting and processing instruction method, the method including:
  • the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the time sequence of each shooting information;
  • the target field information is field information of a target field in multiple fields; the method further includes pre-acquiring the target field information:
  • the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
  • the target field information is field information of a target field in multiple fields; the method further includes pre-acquiring the target field information:
  • the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field.
  • the location distributions have correspondences.
  • An aspect of the embodiment of the present application provides a video shooting and processing instruction system, the system includes:
  • the receiving module is used to receive the request information of the client
  • An acquisition module configured to acquire target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis indicates that each shooting information the chronological order of the information;
  • a return module configured to return the target field information to the client, to instruct the client to perform video shooting or video processing.
  • An aspect of the embodiments of the present application provides a computer device, the computer device includes a memory, a processor, and a computer program stored in the memory and operable on the processor, when the processor executes the computer program Steps for realizing the above-mentioned video shooting and processing instruction method.
  • An aspect of the embodiments of the present application further provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and the computer program can be executed by at least one processor, so that the at least one The processor executes the steps of the above-mentioned video shooting and processing instruction method.
  • the video shooting information acquisition method, system, computer equipment, and computer-readable storage medium deconstruct the shooting and editing related shooting information of the target video (high-quality video), and deconstruct the shooting information according to its
  • the positions in the target video are distributed on the time axis.
  • the client can be instructed to use various shooting information as time progresses when shooting or editing a video, such as scene layout, character layout, shooting means, etc.
  • shooting or editing a video such as scene layout, character layout, shooting means, etc.
  • FIG. 1 schematically shows an application environment diagram of a method for acquiring video shooting information according to an embodiment of the present application
  • FIG. 2 schematically shows a flowchart of a method for acquiring video shooting information according to Embodiment 1 of the present application
  • Fig. 3 is the sub-step flowchart of step S202 in Fig. 2;
  • Fig. 4 is the sub-step flowchart of step S202 in Fig. 2;
  • FIG. 5 schematically shows a flow chart of newly added steps of the method for acquiring video shooting information according to Embodiment 1 of the present application;
  • FIG. 6 schematically shows a specific operation example diagram of the video shooting information acquisition method based on Embodiment 1 of the present application
  • FIG. 7 schematically shows a flow chart of a video shooting and processing instruction method according to Embodiment 2 of the present application.
  • FIG. 8 schematically shows a specific operation example diagram based on Embodiment 2 of the present application.
  • FIG. 9 schematically shows a block diagram of a system for acquiring video shooting information according to Embodiment 3 of the present application.
  • FIG. 10 schematically shows a block diagram of a system for acquiring video shooting information according to Embodiment 4 of the present application.
  • FIG. 11 schematically shows a schematic diagram of a hardware architecture of a computer device according to Embodiment 5 of the present application.
  • High-quality video includes the following aspects:
  • Video quality based on the visual sense, by using professional shooting equipment to record the video in a professional shooting scene
  • this application aims to use video understanding technology to guide users to shoot or edit high-quality videos during the video creation process, for example: screening high-quality videos through intelligent means, and using artificial intelligence related algorithms to build Quantitative analysis of shooting skills related to "high-quality" video.
  • Users can obtain shooting information of similar types of high-quality videos by inputting relevant demand keywords, such as the necessary distribution of elements such as scene switching on the time axis, camera operation, shooting angle, and camera switching under the story line. Therefore, by providing shooting information, users can learn professional shooting techniques for video creation, so as to quickly and specifically perform video creation, lowering the threshold for video creation. details as follows:
  • the high-quality video can include: through professional video shooting hardware, high-definition video with excellent picture quality; and through professional video shooting means (layout design): field design, mirror distribution, various scene Other distribution, operation of mirror movement, image stabilizer or post-use image stabilization technology to improve video quality.
  • users can determine the personalized shooting theme, and find high-quality videos that match it according to the shooting theme.
  • the user shoots or records a video guided by each shooting element of the matched high-quality video, so as to shoot a similar type of high-quality video.
  • the user can also make fine-tuning on the basis of each shooting element, and shoot or record a video based on the fine-tuning.
  • the user can also search for matching (similar) high-quality video shooting techniques (such as scene selection, camera movement mode) after shooting the video, and edit the video according to the high-quality video shooting techniques so that the high-quality The shooting skills of the video are applied and reflected in the video to efficiently improve the video quality and lower the threshold of creation.
  • high-quality video shooting techniques such as scene selection, camera movement mode
  • the present application provides a plurality of embodiments to introduce the video shooting information acquisition solution and the video shooting or processing instruction solution, and refer to the following for details.
  • Instructions including controls, guidance and/or prompts.
  • AI Artificial Intelligence, artificial intelligence
  • pull film use artificial intelligence to detect video content and shooting methods.
  • the shooting parameters of the video are obtained, such as: shooting skills and related statistical distribution information, such as the content of each shot, scene scheduling, camera movement, scene, editing, sound, picture, rhythm, performance, camera position, etc. .
  • a mirror representing a video segment with a temporal start and a temporal end.
  • a feature film generally consists of 400-600 shots.
  • a field corresponds to a video bridge segment, which can be composed of one or more mirrors.
  • Scene difference The distance between the camera and the subject is different, resulting in the difference in the range of the subject in the camera video recorder.
  • Scenes from near to far include: close-up (referring to the human body above the shoulder), close-up (referring to the human body above the chest), middle shot (referring to the human body above the knee), panorama (the entire human body and the surrounding environment), and long-range view (referring to the human body above the knee). environment of the subject).
  • Mirror movement the movement mode of the shooting device such as pushing, pulling, shaking, moving, and stilling of the lens during shooting.
  • Storyboard script refers to various video media such as movies, animations, TV dramas, advertisements, MTV, etc.
  • the composition of the image is explained in the form of a story grid, and the continuous screen is used as a mirror Decompose it as a unit, and mark the camera movement method, duration, dialogue, special effects, etc. In this way, the required shooting content is briefly recorded in the early stage of shooting, so as to remind each storyboard during the shooting process.
  • FIG. 1 schematically shows a schematic diagram of an environment application according to an embodiment of the present application. As shown in Figure 1:
  • the computer device 10000 can connect to the client 30000 via the network 20000 .
  • the computer device 10000 may provide services, such as providing shooting information, to control or prompt the client 30000 to take a shooting action.
  • Computer equipment 10000 may be located in a data center, such as a single site, or distributed among different geographical locations (e.g., across multiple sites).
  • the computer device 10000 may provide services via one or more networks 20000 .
  • Network 20000 includes various network devices such as routers, switches, multiplexers, hubs, modems, bridges, repeaters, firewalls, proxy devices and/or the like.
  • Network 20000 may include physical links, such as coaxial cable links, twisted pair cable links, fiber optic links, combinations thereof, and the like.
  • Network 20000 may include wireless links, such as cellular links, satellite links, Wi-Fi links, and the like.
  • Computer device 10000 may be implemented by one or more computing nodes.
  • One or more compute nodes may include virtualized compute instances.
  • Virtualized computing instances may include virtual machines, such as emulations of computer systems, operating systems, servers, and the like.
  • the compute node may load the virtual machine by the compute node based on the virtual image and/or other data defining the specific software (eg, operating system, application-specific, server) used for emulation. As the demand for different types of processing services changes, different virtual machines can be loaded and/or terminated on one or more computing nodes.
  • a hypervisor can be implemented to manage the use of different virtual machines on the same compute node.
  • Client 30000 may be configured to access computer device 10000 content and services.
  • the client 30000 may include any type of electronic device supporting a photography function, such as a mobile device, a tablet device, a video camera, and the like.
  • the client 30000 can output shooting information (such as technique quantification information) to the user.
  • shooting information such as technique quantification information
  • This scheme can be implemented by the computer device 10000 .
  • Fig. 2 schematically shows a flowchart of a method for acquiring video shooting information according to Embodiment 1 of the present application.
  • the method for obtaining video shooting information may include steps S200-S204, wherein:
  • step S200 a target video for analysis is determined, and the target video is corresponding to a time axis representing video progress.
  • Described target video can be based on the video manuscript of various video formats, for example: AVI (Audio Video Interleaved, audio video is interleaved) format, H.264/AVC (Advanced Video Coding, advanced video coding), H.265/HEVC (High Efficiency Video Coding, high efficiency video coding) H.265 format, etc.
  • AVI Audio Video Interleaved, audio video is interleaved
  • H.264/AVC Advanced Video Coding, advanced video coding
  • H.265/HEVC High Efficiency Video Coding, high efficiency video coding
  • H.265 format etc.
  • the target video can be preferably a high-quality video, such as a high-definition video with excellent image quality shot by professional video shooting hardware; and through professional video shooting means (layout design: field design, mirror distribution, various The distribution of different scenes, the operation of mirror movement) or the video quality improved by image stabilization technology in the later stage.
  • a high-quality video such as a high-definition video with excellent image quality shot by professional video shooting hardware
  • professional video shooting means layout design: field design, mirror distribution, various The distribution of different scenes, the operation of mirror movement
  • image stabilization technology in the later stage.
  • Step S202 analyzing the target video to obtain a plurality of shooting information.
  • the multiple shooting information may correspond to various shooting elements involved in video shooting or editing. That is, according to the distribution of the various shooting elements in the target video (such as the total time of appearance and duration), the layout of the shooting means (skills) of the target video in the target video can be analyzed shooting information, etc.
  • the target video can be analyzed in units of "field” and "mirror” to obtain the plurality of shooting information.
  • mirror Taking "mirror” as a unit, obtain the scene, shooting angle, character information, mirror type, mirror movement, and scene of each mirror.
  • Scene types can include: long shot, panoramic shot, medium shot, close shot, close-up, and close-up.
  • Scenes can include: indoors and outdoors. The scene can be further refined into offices, squares, coffee shops, etc.
  • Character information may include: character position, posture (orientation, etc.), identity (man, woman, old man, policeman, lawyer, etc.).
  • the above shooting information is calculated, estimated and counted through artificial intelligence related algorithms.
  • artificial intelligence related algorithms such as:
  • the scene and movement of each shot can be identified through a unified framework for Shot Type Classification Based on Subject Centric Lens (Unified Framework for Shot Type Classification Based on Subject Centric Lens).
  • the shooting angle of each mirror can be identified by performing shooting angle detection through Back to the Feature: Learning Robust Camera Localization from Pixels to Pose.
  • each person in each mirror can be detected through the face recognition model (Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation Person Search in Videos with One Portrait Through Visual and Temporal Links).
  • Face recognition model Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation Person Search in Videos with One Portrait Through Visual and Temporal Links.
  • the quantity and distribution of each shooting information in the target video can be known.
  • Step S204 marking the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
  • each shooting information is displayed in a distributed manner according to the direction of the time axis.
  • the video shooting information acquisition method provided in this embodiment deconstructs the shooting and editing related shooting information of the target video (high-quality video), and distributes the deconstructed shooting information on the time axis according to its position in the target video. Based on the shooting information distributed on the time axis, the client can be instructed to use various shooting information as time progresses when shooting or editing a video, such as scene layout, character layout, shooting means, etc., to shoot or edit Produce videos similar to high-quality videos and improve shooting or editing efficiency.
  • the multiple shooting information includes multiple shooting parameters.
  • the step S202 may include: step S300, performing field segmentation on the target video to obtain multiple fields; step S302, performing mirror segmentation on the multiple fields respectively, to obtain multiple mirror images , each field includes one or more mirrors; step S304, analyze each mirror in the multiple mirrors to obtain the shooting parameters of each mirror; and step S306, according to each field
  • the shooting parameters of each mirror in the field are obtained to obtain the field information of each field; wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the shooting parameters of each mirror in the field are listed in the
  • the position distribution on the time axis corresponds to the position distribution of each mirror in the field in the field.
  • the target video is analyzed in units of "field" and "mirror", so as to obtain the shooting parameters of each mirror in different video bridge segments in the target video, which is convenient for storage, classification and user query.
  • the multiple shooting information includes multiple shooting parameters.
  • the step S202 may include: step S400, performing field segmentation on the target video to obtain multiple fields; step S402, performing mirror segmentation on the multiple fields respectively, to obtain multiple mirror images , each field includes one or more mirrors; step S404, analyzing the multiple fields to obtain the subject of each field; step S406, analyzing each mirror in the multiple mirrors, to obtain the shooting parameters of each mirror; and step S408, according to the theme of each field and the shooting parameters of each mirror in each field, obtain the field information of each field; wherein, the The above-mentioned field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to the position distribution of each mirror in the field in the field.
  • the target video is analyzed in units of "field” and "mirror”, so as to obtain the themes of different video segments in the target video and shooting parameters of each mirror in different video segments , which is convenient for storage, classification and user query based on themes of different bridge sections.
  • the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
  • a graphical storyboard script can be generated for each scene to better prompt the user.
  • the method for obtaining video shooting information may also include: step S500, generating a plurality of storyboard scripts for each field according to the shooting parameters of each mirror in each field; wherein, Each mirror in each field corresponds to one or more storyboard scripts, and the position distribution of the multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field in the field. corresponding to the location distribution within . details as follows:
  • Corresponding elements and element vector information can be generated according to character information, etc., and the element vector information includes the size, posture, and relative position of the element in the key frame in the corresponding mirror.
  • the above character information can be divided by character identity, such as children, old people, etc., and can be divided by character occupation, such as policeman and lawyer.
  • the vector element associated with the element category may be obtained from a vector element material library.
  • each vector element is set on a designated canvas to generate a storyboard. For example: according to the size of the element, determine the size of the vector element in the designated canvas; determine the posture of the vector element in the designated canvas according to the posture of the element; Determine the relative position of the vector element in the specified canvas based on the relative position in the reference image.
  • the storyboard script in the form of vector graphics is obtained, which makes it more efficient and easy to make storyboard scripts, and effectively improves the user experience. .
  • the scene type, shooting angle, character information, mirror type, and mirror operation can be added to the specified canvas in text form.
  • the storyboard script can be an editable vector diagram, which can be modified by vectorization according to user needs (habits).
  • the vectorization modification includes at least one of the following: modifying the size of the vector element, modifying the pose of the vector element, modifying the relative position of the vector element in the specified canvas, deleting the vector element or adding a new Vector elements.
  • the user can realize the personalized provision of the storyboard.
  • S602 Divide the target video to obtain multiple fields. As shown in FIG. 6, the target video can be divided into multiple fields.
  • S604 Divide each field to obtain multiple mirrors. As shown in FIG. 6, one scene is composed of 5 mirrors.
  • S606 Perform various detections on each mirror to deconstruct and obtain various shooting information, that is, information corresponding to various shooting/editing elements.
  • the field information of each field includes the detection information of each mirror in this field, such as the movement of the mirror, scene, characters, shooting angle, etc.
  • the single video can be edited based on the above field information to obtain multiple videos switched between different scenes.
  • This embodiment provides a video shooting and processing instruction method, and some technical details and effects can be referred to above.
  • Fig. 7 schematically shows a flow chart of a video shooting and processing instruction method according to Embodiment 2 of the present application.
  • the video shooting and processing instruction method may include steps S700-S704, wherein:
  • Step S700 receiving request information from the client.
  • the request information may include the following:
  • Text information including shooting scene, theme, shooting location, scene, etc.
  • Step S702 acquire target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the location of each shooting information Chronologically.
  • the most matching video bridge segment (target field) is searched from the database.
  • the target field has the distribution on the time axis of the shooting information (mirrors, scenes, mirrors, characters, etc.) corresponding to the bridge segment of the video.
  • Step S704 returning the target field information to the client to instruct the client to perform video shooting or video processing according to the respective shooting information and the marked positions of the respective shooting information on the time axis .
  • the client can guide the user to shoot and edit or automatically shoot and edit according to the field information of the target field.
  • automatic editing as an example: the video of the same scene (panorama) is captured, and the shooting information searched is referred to later, and it is edited into a diversified video with multiple scene switching, so as to obtain an effect similar to that of a high-quality video bridge.
  • the video shooting and processing instruction method provided by this embodiment finds the high-quality video bridge section that meets the user's shooting period according to the theme and content of the user's input of wanting to shoot or edit, and deconstructs the shooting information (field information) obtained by deconstructing the high-quality video bridge section. ) is returned to the client, and the shooting information obtained by the deconstruction is distributed on the time axis. Based on the shooting information distributed on the time axis, the client can be instructed to use various shooting information as time progresses when shooting or editing a video, such as scene layout, character layout, shooting means, editing, etc.
  • the target field information is field information of a target field in multiple fields; the method further includes acquiring the target field information in advance:
  • the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
  • the target field information is field information of a target field in multiple fields; the method further includes acquiring the target field information in advance:
  • the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field.
  • the location distributions have correspondences.
  • the client 30000 receives search content input by a user, and initiates a search request based on the search content.
  • the search content may include video topics, shooting locations, and the like.
  • S802 The computer device 10000 searches the database according to the search request.
  • the computer device 10000 returns to the client 30000 the searched field information (which may include a plurality of shooting information, each shooting information is distributed on the time axis) of the video segment most relevant to the search content.
  • the searched field information which may include a plurality of shooting information, each shooting information is distributed on the time axis
  • S806 The client 30000 performs shooting or editing according to each shooting information and the position of each shooting information on the time axis.
  • the client 30000 generates a video with high-quality shooting means and content according to the shooting or editing results.
  • FIG. 9 schematically shows a block diagram of a video shooting information acquisition system according to Embodiment 3 of the present application.
  • the video shooting information acquisition system can be divided into one or more program modules, and one or more program modules are stored in a storage medium. and executed by one or more processors to complete the embodiments of the present application.
  • the program modules referred to in the embodiments of the present application refer to a series of computer-readable instruction segments capable of accomplishing specific functions. The following description will specifically introduce the functions of the program modules in the embodiments of the present application.
  • the video shooting information acquisition system 900 may include a determination module 910, an analysis module 920, and an annotation module 930, wherein:
  • Determining module 910 is used for determining the target video that is used for analysis, and described target video is corresponding with the time axis that represents video progress;
  • An analysis module 920 configured to analyze the target video to obtain a plurality of shooting information
  • the marking module 930 is configured to mark the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
  • the multiple shooting information includes multiple shooting parameters; the analyzing module 920 is further configured to:
  • the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
  • the multiple shooting information includes multiple shooting parameters; the analyzing module 920 is further configured to:
  • the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field.
  • the position distribution of the mirror in the field corresponds to that.
  • the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
  • the system also includes a script generation module for:
  • each mirror in each field corresponds to one or more mirror scripts respectively, and the position distribution of multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field.
  • the location distribution within the field corresponds.
  • FIG. 10 schematically shows a block diagram of a video shooting and processing instruction system according to Embodiment 4 of the present application.
  • the video shooting and processing instruction system can be divided into one or more program modules, and one or more program modules are stored in stored in a storage medium and executed by one or more processors to complete the embodiments of the present application.
  • the program modules referred to in the embodiments of the present application refer to a series of computer-readable instruction segments capable of accomplishing specific functions. The following description will specifically introduce the functions of the program modules in the embodiments of the present application.
  • the video shooting and processing instruction system 1000 may include a receiving module 1010, an obtaining module 1020 and a returning module 1030, wherein:
  • a receiving module 1010 configured to receive request information from the client
  • the obtaining module 1020 is configured to obtain target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis indicates that each the chronological order of the capture information;
  • Returning module 1030 configured to return the target field information to the client to instruct the client to perform video shooting according to the respective shooting information and the marked positions of the respective shooting information on the time axis Or video processing.
  • the target field information is field information of a target field in multiple fields; the system further includes a preset acquisition module, configured to pre-acquire the target field information:
  • the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
  • the target field information is field information of a target field in multiple fields; the system further includes a preset acquisition module, configured to pre-acquire the target field information:
  • the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field.
  • the location distributions have correspondences.
  • FIG. 11 schematically shows a schematic diagram of a hardware architecture of a computer device 10000 according to Embodiment 5 of the present application.
  • the computer device 10000 is a device capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions.
  • it may be a rack server, a blade server, a tower server, or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like.
  • the computer device 10000 at least includes but is not limited to: a memory 10010 , a processor 10020 , and a network interface 10030 that can communicate with each other through a system bus. in:
  • the memory 10010 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, etc.
  • the memory 10010 may be an internal storage module of the computer device 10000 , such as a hard disk or memory of the computer device 10000 .
  • the memory 10010 can also be an external storage device of the computer device 10000, such as a plug-in hard disk equipped on the computer device 10000, a smart memory card (Smart Media Card, referred to as SMC), a secure digital (Secure Digital (referred to as SD) card, flash memory card (Flash Card) and so on.
  • the memory 10010 may also include both an internal storage module of the computer device 10000 and an external storage device thereof.
  • the memory 10010 is generally used to store the operating system and various application software installed in the computer device 10000, such as program codes for methods of acquiring video shooting information, video shooting and processing instruction methods, and the like.
  • the memory 10010 can also be used to temporarily store various types of data that have been output or will be output.
  • the processor 10020 may be a central processing unit (Central Processing Unit, CPU for short), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments.
  • the processor 10020 is generally used to control the overall operation of the computer device 10000 , such as performing control and processing related to data interaction or communication with the computer device 10000 .
  • the processor 10020 is configured to run program codes stored in the memory 10010 or process data.
  • the network interface 10030 may include a wireless network interface or a wired network interface, and the network interface 10030 is generally used to establish a communication link between the computer device 10000 and other computer devices.
  • the network interface 10030 is used to connect the computer device 10000 with an external terminal through a network, and establish a data transmission channel and a communication link between the computer device 10000 and an external terminal.
  • the network can be Intranet, Internet, Global System of Mobile Communication (GSM for short), Wideband Code Division Multiple Access (WCDMA for short), 4G network , 5G network, Bluetooth (Bluetooth), Wi-Fi and other wireless or wired networks.
  • FIG. 11 only shows a computer device having components 10010-10030, but it should be understood that implementing all of the illustrated components is not a requirement and that more or fewer components may instead be implemented.
  • the video shooting information acquisition method, video shooting and processing instruction method stored in the memory 10010 can also be divided into one or more program modules, and processed by one or more processors (this embodiment is a processing module) device 10020) to complete the embodiment of this application.
  • the embodiment of the present application also provides a computer-readable storage medium, on which computer-readable instructions are stored, and when the computer-readable instructions are executed by a processor, the following steps are implemented:
  • each shooting information is respectively distributed at a corresponding position on the time axis;
  • the computer readable instructions implement the following steps when executed by the processor:
  • the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the time sequence of each shooting information;
  • the computer-readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), read-only memory (ROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Programmable Read-Only Memory (PROM), Magnetic Memory, Magnetic Disk, Optical Disk, etc.
  • the computer-readable storage medium may be an internal storage unit of a computer device, such as a hard disk or a memory of the computer device.
  • the computer-readable storage medium can also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, referred to as SMC), a secure digital ( Secure Digital (referred to as SD) card, flash memory card (Flash Card), etc.
  • the computer-readable storage medium may also include both the internal storage unit of the computer device and its external storage device.
  • the computer-readable storage medium is usually used to store the operating system and various application software installed on the computer device, such as the program codes of the video shooting information acquisition method, video shooting and processing instruction method in the embodiment.
  • the computer-readable storage medium can also be used to temporarily store various types of data that have been output or will be output.
  • modules or steps of the above-mentioned embodiments of the present application can be implemented by general-purpose computing devices, and they can be concentrated on a single computing device, or distributed among multiple computing devices.
  • they may be implemented in program code executable by a computing device, thereby, they may be stored in a storage device to be executed by a computing device, and in some cases, may be implemented in a code different from that described herein
  • the steps shown or described are executed in sequence, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation.
  • embodiments of the present application are not limited to any specific combination of hardware and software.

Abstract

Embodiments of the present application provides a video capture information acquisition method. The method comprises: determining a target video for analysis, the target video corresponding to a time axis representing a video progress; analyzing the target video to obtain a plurality of pieces of capture information; and marking the plurality of pieces of capture information on the time axis, each piece of capture information being distributed at a corresponding position of the time axis, respectively. According to the video capture information acquisition method provided in the embodiments of the present application, capture and editing-related capture information of the target video (high-quality video) is deconstructed, and the capture information obtained by deconstruction is distributed on the time axis according to the position of said information in the target video. On the basis of the capture information distributed on the time axis, various capture information that a client needs to use over time when capturing or editing a video, such as, scene layout, character arrangement and capture means, can be indicated.

Description

视频拍摄信息获取方法,及视频拍摄和处理指示方法Method for acquiring video shooting information, and video shooting and processing instruction method
本申请申明2021年7月15日递交的申请号为202110801309.0、名称为“视频拍摄信息获取方法,及视频拍摄和处理指示方法”的中国专利申请的优先权,该中国专利申请的整体内容以参考的方式结合在本申请中。This application declares the priority of the Chinese patent application filed on July 15, 2021 with the application number 202110801309.0 and titled "Method for Acquiring Video Shooting Information, and Method for Video Shooting and Processing Instructions". The entire content of the Chinese patent application is referred to way is incorporated in this application.
技术领域technical field
本申请实施例涉及计算机技术领域,尤其涉及一种视频拍摄信息获取方法、系统、计算机设备以及计算机可读存储介质,以及涉及一种视频拍摄和处理指示方法。The embodiments of the present application relate to the field of computer technology, and in particular, to a video shooting information acquisition method, system, computer equipment, and computer-readable storage medium, and to a video shooting and processing instruction method.
背景技术Background technique
随着视频拍摄门槛的降低,越来越多的用户成为创作者,对视频进行拍摄创作。高质量的视频,需要大量专业且复杂的拍摄手段拍摄。但是,发明人意识到对于没有学习过专业拍摄技术的用户,难于正确应用拍摄手段,导致无法拍摄出满意的视频且效率低下。With the lowering of the threshold for video shooting, more and more users have become creators, shooting and creating videos. High-quality video requires a lot of professional and complex shooting methods. However, the inventor realized that for users who have not learned professional shooting techniques, it is difficult to correctly apply the shooting means, resulting in the inability to shoot satisfactory videos and low efficiency.
发明内容Contents of the invention
本申请实施例的目的是提供一种视频拍摄信息获取方法、系统、计算机设备及计算机可读存储介质,以及视频拍摄和处理指示方法,用于解决以下问题:对于没有学习过专业拍摄技术的用户,难于正确应用拍摄手段,导致无法拍摄出满意的视频且效率低下。The purpose of the embodiment of the present application is to provide a video shooting information acquisition method, system, computer equipment and computer-readable storage medium, as well as a video shooting and processing instruction method, which are used to solve the following problems: For users who have not learned professional shooting technology , it is difficult to correctly apply the shooting method, resulting in the inability to shoot a satisfactory video and low efficiency.
本申请实施例的一个方面提供了一种视频拍摄信息获取方法,所述方法包括:An aspect of the embodiments of the present application provides a method for acquiring video shooting information, the method including:
确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
可选的,所述多个拍摄信息包括多个拍摄参数;Optionally, the multiple shooting information includes multiple shooting parameters;
所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
根据每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtain the field information of each field according to the shooting parameters of each mirror in each field;
其中,所述每个场的场信息包括该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
可选的,所述多个拍摄信息包括多个拍摄参数;Optionally, the multiple shooting information includes multiple shooting parameters;
所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
对所述多个场进行分析,以得到每个场的主题;analyzing the plurality of fields to obtain a theme for each field;
对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
根据所述每个场的主题和所述每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtaining the field information of each field according to the theme of each field and the shooting parameters of each mirror in each field;
其中,所述每个场的场信息包括该场的主题和该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field. The position distribution of the mirror in the field corresponds to that.
可选的,所述每个镜的拍摄参数包括以下一项或多项:景别、拍摄角度、人物信息、镜种类、运镜操作、场景。Optionally, the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
可选的,还包括:根据所述每个场内的各个镜的拍摄参数,为所述每个场分别生成多个分镜脚本;Optionally, it also includes: generating a plurality of storyboard scripts for each field according to the shooting parameters of each mirror in each field;
其中,所述每个场内的各个镜分别对应一个或多个分镜脚本,所述每个场内的多个分镜脚本在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, each mirror in each field corresponds to one or more mirror scripts respectively, and the position distribution of multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field. The location distribution within the field corresponds.
本申请实施例的一个方面又提供了一种视频拍摄信息获取系统,所述系统包括:An aspect of the embodiments of the present application further provides a system for obtaining video shooting information, the system comprising:
确定模块,用于确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;A determination module is used to determine a target video for analysis, and the target video corresponds to a time axis representing video progress;
分析模块,用于对所述目标视频进行分析,以获取多个拍摄信息;及An analysis module, configured to analyze the target video to obtain a plurality of shooting information; and
标注模块,用于将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。A labeling module, configured to label the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
本申请实施例的一个方面又提供了一种计算机设备,所述计算机设备包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被处理器执行时实现以下步骤:An aspect of the embodiments of the present application further provides a computer device, the computer device includes a memory, a processor, and computer-readable instructions stored in the memory and operable on the processor, the computer can The following steps are implemented when the read instruction is executed by the processor:
确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
本申请实施例的一个方面又提供了一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机可读指令,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器执行以下步骤:An aspect of the embodiments of the present application further provides a computer-readable storage medium, where computer-readable instructions are stored in the computer-readable storage medium, and the computer-readable instructions can be executed by at least one processor, so that The at least one processor performs the following steps:
确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
本申请实施例的一个方面又提供了一种视频拍摄和处理指示方法,所述方法包括:An aspect of the embodiments of the present application further provides a video shooting and processing instruction method, the method including:
接收客户端的请求信息;Receive client request information;
根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息的时间顺序;及Acquiring target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the time sequence of each shooting information; and
返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。returning the target field information to the client to instruct the client to perform video shooting or video processing according to the respective shooting information and marked positions of the respective shooting information on the time axis.
可选的,所述目标场信息为多个场中的目标场的场信息;所述方法还包括预先获取所述目标场信息:Optionally, the target field information is field information of a target field in multiple fields; the method further includes pre-acquiring the target field information:
对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
根据所述每个镜的拍摄参数,得到所述目标场信息;Obtaining the target field information according to the shooting parameters of each mirror;
其中,所述目标场信息包括所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在所述目标场内的位置分布具有对应关系。Wherein, the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
可选的,所述目标场信息为多个场中的目标场的场信息;所述方法还包括预先获取所述目标场信息:Optionally, the target field information is field information of a target field in multiple fields; the method further includes pre-acquiring the target field information:
对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
对所述目标场进行分析,以得到目标场的主题;analyzing the target field to obtain a theme of the target field;
对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
根据所述主题和所述每个镜的拍摄参数,得到所述目标场信息;Obtain the target field information according to the subject and the shooting parameters of each mirror;
其中,所述目标场信息包括所述主题和所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在该目标场内的位置分布具有对应关系。Wherein, the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field. The location distributions have correspondences.
本申请实施例的一个方面又提供了一种视频拍摄和处理指示系统,所述系统包括:An aspect of the embodiment of the present application provides a video shooting and processing instruction system, the system includes:
接收模块,用于接收客户端的请求信息;The receiving module is used to receive the request information of the client;
获取模块,用于根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息 的时间顺序;及An acquisition module, configured to acquire target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis indicates that each shooting information the chronological order of the information; and
返回模块,用于返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。A return module, configured to return the target field information to the client, to instruct the client to perform video shooting or video processing.
本申请实施例的一个方面又提供了一种计算机设备,所述计算机设备包括存储器、处理器以及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时用于实现如上所述的视频拍摄和处理指示方法的步骤。An aspect of the embodiments of the present application provides a computer device, the computer device includes a memory, a processor, and a computer program stored in the memory and operable on the processor, when the processor executes the computer program Steps for realizing the above-mentioned video shooting and processing instruction method.
本申请实施例的一个方面又提供了一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机程序,所述计算机程序可被至少一个处理器所执行,以使所述至少一个处理器执行如上所述的视频拍摄和处理指示方法的步骤。An aspect of the embodiments of the present application further provides a computer-readable storage medium, where a computer program is stored in the computer-readable storage medium, and the computer program can be executed by at least one processor, so that the at least one The processor executes the steps of the above-mentioned video shooting and processing instruction method.
本申请实施例提供的视频拍摄信息获取方法、系统、计算机设备及计算机可读存储介质,将目标视频(优质视频)的拍摄及剪辑相关拍摄信息进行解构,并将解构得到的拍摄信息依据其在目标视频中的位置分布于时间轴上。基于该分布于时间轴上的拍摄信息,可以指示客户端在拍摄或剪辑一段视频时随着时间推进所需要使用的各种拍摄信息,如场景布置、人物布置、拍摄手段等。本申请中,不仅可以指示客户端拍摄或剪辑出与优质视频类似的视频,而且提高了拍摄或剪辑效率。The video shooting information acquisition method, system, computer equipment, and computer-readable storage medium provided in the embodiments of the present application deconstruct the shooting and editing related shooting information of the target video (high-quality video), and deconstruct the shooting information according to its The positions in the target video are distributed on the time axis. Based on the shooting information distributed on the time axis, the client can be instructed to use various shooting information as time progresses when shooting or editing a video, such as scene layout, character layout, shooting means, etc. In this application, not only can the client be instructed to shoot or edit a video similar to a high-quality video, but also improve the efficiency of shooting or editing.
附图说明Description of drawings
图1示意性示出了根据本申请实施例的视频拍摄信息获取方法的应用环境图;FIG. 1 schematically shows an application environment diagram of a method for acquiring video shooting information according to an embodiment of the present application;
图2示意性示出了根据本申请实施例一的视频拍摄信息获取方法的流程图;FIG. 2 schematically shows a flowchart of a method for acquiring video shooting information according to Embodiment 1 of the present application;
图3为图2中步骤S202的子步骤流程图;Fig. 3 is the sub-step flowchart of step S202 in Fig. 2;
图4为图2中步骤S202的子步骤流程图;Fig. 4 is the sub-step flowchart of step S202 in Fig. 2;
图5示意性示出了根据本申请实施例一的视频拍摄信息获取方法的新增步骤流程图;FIG. 5 schematically shows a flow chart of newly added steps of the method for acquiring video shooting information according to Embodiment 1 of the present application;
图6示意性示出了基于本申请实施例一的视频拍摄信息获取方法的具体操作示例图;FIG. 6 schematically shows a specific operation example diagram of the video shooting information acquisition method based on Embodiment 1 of the present application;
图7示意性示出了根据本申请实施例二的视频拍摄和处理指示方法的流程图;FIG. 7 schematically shows a flow chart of a video shooting and processing instruction method according to Embodiment 2 of the present application;
图8示意性示出了基于本申请实施例二的具体操作示例图;FIG. 8 schematically shows a specific operation example diagram based on Embodiment 2 of the present application;
图9示意性示出了根据本申请实施例三的视频拍摄信息获取系统的框图;FIG. 9 schematically shows a block diagram of a system for acquiring video shooting information according to Embodiment 3 of the present application;
图10示意性示出了根据本申请实施例四的视频拍摄信息获取系统的框图;FIG. 10 schematically shows a block diagram of a system for acquiring video shooting information according to Embodiment 4 of the present application;
图11示意性示出了根据本申请实施例五的计算机设备的硬件架构示意图。FIG. 11 schematically shows a schematic diagram of a hardware architecture of a computer device according to Embodiment 5 of the present application.
具体实施方式detailed description
为了使本申请的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本 申请进行进一步详细说明。应当理解,此处所描述的具体实施例仅用以解释本申请,并不用于限定本申请。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solutions and advantages of the application clearer, the application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.
需要说明的是,在本申请实施例中涉及“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本申请要求的保护范围之内。It should be noted that the descriptions involving "first", "second", etc. in the embodiments of the present application are only for descriptive purposes, and should not be understood as indicating or implying their relative importance or implicitly indicating the indicated technical features quantity. Thus, the features defined as "first" and "second" may explicitly or implicitly include at least one of these features. In addition, the technical solutions of the various embodiments can be combined with each other, but it must be based on the realization of those skilled in the art. When the combination of technical solutions is contradictory or cannot be realized, it should be considered that the combination of technical solutions does not exist , nor within the scope of protection required by the present application.
发明人了解到,高质量视频拍摄涉及场景布局、镜头分布、运镜分布等信息,需要进行专业的影视创作学习,以丰富的拍摄经验为条件,进行视频创作。具体如下:The inventor understands that high-quality video shooting involves information such as scene layout, lens distribution, and mirror movement distribution, and requires professional film and television creation learning, and video creation is based on rich shooting experience. details as follows:
优质视频包括以下几个方面:High-quality video includes the following aspects:
(1)视频画质,以视觉感官为基础,通过运用专业拍摄设备在专业的拍摄场景中,对视频进行录制;(1) Video quality, based on the visual sense, by using professional shooting equipment to record the video in a professional shooting scene;
(2)拍摄内容,基于专业设计的脚本拍摄视频;(2) Shooting content, shooting videos based on professionally designed scripts;
(3)拍摄技术,基于专业的拍摄技巧、画面布局、构图和场景分布等,进行拍摄。(3) Shooting technology, based on professional shooting skills, screen layout, composition and scene distribution, etc., to shoot.
综上所示,要拍摄出高质量的视频,要具有多个要素,不仅需基于高要求的软硬件,还需要视频创作者具备专业的拍摄技术,无形中提高了优质视频创作的门槛。To sum up, in order to shoot high-quality videos, there must be multiple elements, not only based on high-demand software and hardware, but also video creators need to have professional shooting skills, which virtually raises the threshold for high-quality video creation.
有鉴于此,本申请旨在视频创作过程中利用视频理解技术对用户进行指导拍摄以拍摄或剪辑高质量的视频,例如:通过智能化手段筛选高质量视频,并利用人工智能相关算法,对构建“优质”视频的相关拍摄技巧进行量化分析。用户通过输入相关需求关键字,即可获取拍摄相似类型优质视频的拍摄信息,例如:时间轴上各景别切换、运镜操作、拍摄角度、镜头切换等要素在该故事线下的必要分布。因此,通过提供拍摄信息,即可以使得用户获知专业的拍摄技术进行视频创作,从而快速并具有针对性的进行视频创作,降低了视频创作的门槛。具体如下:In view of this, this application aims to use video understanding technology to guide users to shoot or edit high-quality videos during the video creation process, for example: screening high-quality videos through intelligent means, and using artificial intelligence related algorithms to build Quantitative analysis of shooting skills related to "high-quality" video. Users can obtain shooting information of similar types of high-quality videos by inputting relevant demand keywords, such as the necessary distribution of elements such as scene switching on the time axis, camera operation, shooting angle, and camera switching under the story line. Therefore, by providing shooting information, users can learn professional shooting techniques for video creation, so as to quickly and specifically perform video creation, lowering the threshold for video creation. details as follows:
第一,对优质视频进行分析。First, analyze high-quality videos.
所述优质视频的可以包括:通过专业视频拍摄硬件,拍摄出的高清晰度并具有优秀画质的视频;以及通过专业的视频拍摄手段(布局设计):场的设计、镜的分布、多样景别的分布、运镜手法的操作、稳像仪或后期利用稳像技术提高视频质量。The high-quality video can include: through professional video shooting hardware, high-definition video with excellent picture quality; and through professional video shooting means (layout design): field design, mirror distribution, various scene Other distribution, operation of mirror movement, image stabilizer or post-use image stabilization technology to improve video quality.
第二,通过人工智能相关算法,对场内:场景、镜、景别、运镜、人物进行检测,在时间方向,通过点(针对场景、人物、景别)及段(针对运镜)内各拍摄要素的时间分布 进行统计。Second, through artificial intelligence related algorithms, detect scenes, mirrors, scenes, mirrors, and people in the field, and in the time direction, through points (for scenes, characters, scenes) and segments (for mirrors) The time distribution of each shooting element is counted.
第三,用户可以确定个性化拍摄主题,根据拍摄主题查找与之匹配的优质视频。用户以该匹配的优质视频的各拍摄要素为指导拍摄或录制视频,以拍摄出相似类型的高质量视频。用户亦可以在所述各拍摄要素的基础上微调并基于微调拍摄或录制视频。Third, users can determine the personalized shooting theme, and find high-quality videos that match it according to the shooting theme. The user shoots or records a video guided by each shooting element of the matched high-quality video, so as to shoot a similar type of high-quality video. The user can also make fine-tuning on the basis of each shooting element, and shoot or record a video based on the fine-tuning.
第四,用户亦可以在拍摄完视频后,查找与之匹配(类似)的优质视频的拍摄技巧(如,景别,运镜方式),根据该优质视频的拍摄技巧对视频进行剪辑,使得优质视频的拍摄技巧在所述视频中被应用和体现,以高效地提高视频质量及降低创作门槛。Fourth, the user can also search for matching (similar) high-quality video shooting techniques (such as scene selection, camera movement mode) after shooting the video, and edit the video according to the high-quality video shooting techniques so that the high-quality The shooting skills of the video are applied and reflected in the video to efficiently improve the video quality and lower the threshold of creation.
本申请提供了多个实施例介绍视频拍摄信息获取方案以及视频拍摄或处理指示方案,具体参照下文。The present application provides a plurality of embodiments to introduce the video shooting information acquisition solution and the video shooting or processing instruction solution, and refer to the following for details.
在本申请的描述中,需要理解的是,步骤前的数字标号并不标识执行步骤的前后顺序,仅用于方便描述本申请及区别每一步骤,因此不能理解为对本申请的限制。In the description of the present application, it should be understood that the numerals before the steps do not indicate the order in which the steps are executed, but are only used to facilitate the description of the present application and to distinguish each step, so they should not be construed as limitations on the present application.
以下为本申请的术语解释:The following is an explanation of the terms used in this application:
指示,包括控制、引导和/或提示。Instructions, including controls, guidance and/or prompts.
AI(Artificial Intelligence,人工智能)拉片:通过人工智能对视频内容,拍摄手段进行检测。以此方式获取该视频的拍摄参数,如:拍摄技巧及相关统计分布信息,如每个镜头的内容、场面调度、运镜方式、景别、剪辑、声音、画面、节奏、表演、机位等。AI (Artificial Intelligence, artificial intelligence) pull film: use artificial intelligence to detect video content and shooting methods. In this way, the shooting parameters of the video are obtained, such as: shooting skills and related statistical distribution information, such as the content of each shot, scene scheduling, camera movement, scene, editing, sound, picture, rhythm, performance, camera position, etc. .
镜,表示具有时间起点和时间终点的视频片段。一部故事片一般由400-600个镜组成。A mirror, representing a video segment with a temporal start and a temporal end. A feature film generally consists of 400-600 shots.
场,对应一个视频桥段,该视频桥段可以由一个或多个镜构成。A field corresponds to a video bridge segment, which can be composed of one or more mirrors.
景别:摄影机与被摄对象的距离不同,造成被摄对象在摄影机录像器中呈现出的范围大小的区别。景别由近至远依次包括:特写(指人体肩部以上)、近景(指人体胸部以上)、中景(指人体膝部以上)、全景(人体的全部和周围部分环境)、远景(被摄对象所处环境)。Scene difference: The distance between the camera and the subject is different, resulting in the difference in the range of the subject in the camera video recorder. Scenes from near to far include: close-up (referring to the human body above the shoulder), close-up (referring to the human body above the chest), middle shot (referring to the human body above the knee), panorama (the entire human body and the surrounding environment), and long-range view (referring to the human body above the knee). environment of the subject).
运镜:镜头在拍摄过程中的推、拉、摇、移、静止等拍摄装置运动方式。Mirror movement: the movement mode of the shooting device such as pushing, pulling, shaking, moving, and stilling of the lens during shooting.
分镜脚本(storyboard script),是指电影,动画,电视剧,广告,MTV等各种影像媒体,在实际拍摄前,以故事图格的方式来说明影像的构成,将连续的画面以一次运镜作为单位分解,并标注运镜方式,时间长度,对白,特效等。以此方式在拍摄前期对所需拍摄内容进行简要记录,供拍摄过程中,对各个分镜进行提醒。Storyboard script refers to various video media such as movies, animations, TV dramas, advertisements, MTV, etc. Before the actual shooting, the composition of the image is explained in the form of a story grid, and the continuous screen is used as a mirror Decompose it as a unit, and mark the camera movement method, duration, dialogue, special effects, etc. In this way, the required shooting content is briefly recorded in the early stage of shooting, so as to remind each storyboard during the shooting process.
图1示意性示出了根据本申请实施例的环境应用示意图。如图1所示:Fig. 1 schematically shows a schematic diagram of an environment application according to an embodiment of the present application. As shown in Figure 1:
计算机设备10000可以通过网络20000连接客户端30000。The computer device 10000 can connect to the client 30000 via the network 20000 .
计算机设备10000可以提供服务,如提供拍摄信息,以控制或提示客户端30000的拍摄动作。The computer device 10000 may provide services, such as providing shooting information, to control or prompt the client 30000 to take a shooting action.
计算机设备10000可以位于诸如单个场所之类的数据中心,或者分布在不同的地理位 置(例如,在多个场所)中。计算机设备10000可以经由一个或多个网络20000提供服务。网络20000包括各种网络设备,例如路由器,交换机,多路复用器,集线器,调制解调器,网桥,中继器,防火墙,代理设备和/或类似。网络20000可以包括物理链路,例如同轴电缆链路,双绞线电缆链路,光纤链路,其组合等。网络20000可以包括无线链路,诸如蜂窝链路,卫星链路,Wi-Fi链路等。 Computer equipment 10000 may be located in a data center, such as a single site, or distributed among different geographical locations (e.g., across multiple sites). The computer device 10000 may provide services via one or more networks 20000 . Network 20000 includes various network devices such as routers, switches, multiplexers, hubs, modems, bridges, repeaters, firewalls, proxy devices and/or the like. Network 20000 may include physical links, such as coaxial cable links, twisted pair cable links, fiber optic links, combinations thereof, and the like. Network 20000 may include wireless links, such as cellular links, satellite links, Wi-Fi links, and the like.
计算机设备10000可以由一个或多个计算节点实现。一个或多个计算节点可以包括虚拟化的计算实例。虚拟化的计算实例可以包括虚拟机,例如计算机系统,操作系统,服务器等的仿真。计算节点可以基于虚拟映像和/或定义用于仿真的特定软件(例如,操作系统,专用应用程序,服务器)的其他数据,由计算节点加载虚拟机。随着对不同类型的处理服务的需求改变,可以在一个或多个计算节点上加载和/或终止不同的虚拟机。可以实现管理程序来管理同一计算节点上不同虚拟机的使用。 Computer device 10000 may be implemented by one or more computing nodes. One or more compute nodes may include virtualized compute instances. Virtualized computing instances may include virtual machines, such as emulations of computer systems, operating systems, servers, and the like. The compute node may load the virtual machine by the compute node based on the virtual image and/or other data defining the specific software (eg, operating system, application-specific, server) used for emulation. As the demand for different types of processing services changes, different virtual machines can be loaded and/or terminated on one or more computing nodes. A hypervisor can be implemented to manage the use of different virtual machines on the same compute node.
客户端30000可以被配置为访问计算机设备10000的内容和服务。客户端30000可以包括支持摄影功能的任何类型的电子设备,诸如移动设备、平板设备、摄像机等。 Client 30000 may be configured to access computer device 10000 content and services. The client 30000 may include any type of electronic device supporting a photography function, such as a mobile device, a tablet device, a video camera, and the like.
客户端30000可以将拍摄信息(如,技巧量化信息)等输出给用户。The client 30000 can output shooting information (such as technique quantification information) to the user.
以下将通过多个实施例介绍具体技术方案。该方案可以通过计算机设备10000实施。The following will introduce specific technical solutions through multiple embodiments. This scheme can be implemented by the computer device 10000 .
实施例一Embodiment one
图2示意性示出了根据本申请实施例一的视频拍摄信息获取方法的流程图。Fig. 2 schematically shows a flowchart of a method for acquiring video shooting information according to Embodiment 1 of the present application.
如图2所示,该视频拍摄信息获取方法可以包括步骤S200~S204,其中:As shown in FIG. 2, the method for obtaining video shooting information may include steps S200-S204, wherein:
步骤S200,确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴。In step S200, a target video for analysis is determined, and the target video is corresponding to a time axis representing video progress.
所述目标视频,可以是基于各种视频格式的视频稿件,例如:AVI(Audio Video Interleaved,音频视频交错)格式、H.264/AVC(Advanced Video Coding,高级视频编码)、H.265/HEVC(High Efficiency Video Coding,高效率视频编码)H.265格式等。Described target video, can be based on the video manuscript of various video formats, for example: AVI (Audio Video Interleaved, audio video is interleaved) format, H.264/AVC (Advanced Video Coding, advanced video coding), H.265/HEVC (High Efficiency Video Coding, high efficiency video coding) H.265 format, etc.
所述目标视频可以优选优质视频,如通过专业视频拍摄硬件,拍摄出的高清晰度并具有优秀画质的视频;及通过专业的视频拍摄手段(布局设计:场的设计、镜的分布、多样景别的分布、运镜手法的操作)或后期利用稳像技术提高的视频质量。The target video can be preferably a high-quality video, such as a high-definition video with excellent image quality shot by professional video shooting hardware; and through professional video shooting means (layout design: field design, mirror distribution, various The distribution of different scenes, the operation of mirror movement) or the video quality improved by image stabilization technology in the later stage.
步骤S202,对所述目标视频进行分析,以获取多个拍摄信息。Step S202, analyzing the target video to obtain a plurality of shooting information.
所述多个拍摄信息可以对应于视频拍摄或剪辑涉及的各种拍摄要素。即,根据所述各种拍摄要素在所述目标视频中的分布情况(如,出现的总时间、持续时长),可以分析出所述目标视频的拍摄手段(技巧)在所述目标视频的布局情况等拍摄信息。The multiple shooting information may correspond to various shooting elements involved in video shooting or editing. That is, according to the distribution of the various shooting elements in the target video (such as the total time of appearance and duration), the layout of the shooting means (skills) of the target video in the target video can be analyzed shooting information, etc.
所述目标视频可以以“场”和“镜”为单位进行分析,以得到所述多个拍摄信息。The target video can be analyzed in units of "field" and "mirror" to obtain the plurality of shooting information.
(1)以“场”为单元,获取每个场的主题。作为示例,可以通过ECO(Efficient Convolutional Network for Online Video Understanding End-to-End Dense Video Captioning with Masked Transformer,用于在线视频理解的高效卷积网络)识别所述每个场的主题。(1) Take "field" as a unit to obtain the theme of each field. As an example, the theme of each field can be identified by ECO (Efficient Convolutional Network for Online Video Understanding End-to-End Dense Video Captioning with Masked Transformer, Efficient Convolutional Network for Online Video Understanding).
(2)以“镜”为单元,获取每个镜的景别、拍摄角度、人物信息、镜种类、运镜、场景。(2) Taking "mirror" as a unit, obtain the scene, shooting angle, character information, mirror type, mirror movement, and scene of each mirror.
景别,可以包括:远景、全景、中景、近景、特写、大特写。Scene types can include: long shot, panoramic shot, medium shot, close shot, close-up, and close-up.
场景,可以包括:室内、室外。所述场景可以被进一步细化为办公室、广场、咖啡厅等。Scenes can include: indoors and outdoors. The scene can be further refined into offices, squares, coffee shops, etc.
人物信息,可以包括:人物位置、姿态(朝向等)、身份(男人、女人、老人、警察、律师等)。Character information may include: character position, posture (orientation, etc.), identity (man, woman, old man, policeman, lawyer, etc.).
作为示例,通过人工智能相关算法,对以上拍摄信息进行计算、估计以及统计等。如:As an example, the above shooting information is calculated, estimated and counted through artificial intelligence related algorithms. Such as:
可以通过基于主题中心镜头的镜头类型分类统一框架(Unified Framework for Shot Type Classification Based on Subject Centric Lens)识别所述每个镜的景别和运镜。The scene and movement of each shot can be identified through a unified framework for Shot Type Classification Based on Subject Centric Lens (Unified Framework for Shot Type Classification Based on Subject Centric Lens).
可以通过从像素到姿势学习稳健的相机定位(Back to the Feature:Learning Robust Camera Localization from Pixels to Pose)进行拍摄角度检测,识别所述每个镜的拍摄角度。The shooting angle of each mirror can be identified by performing shooting angle detection through Back to the Feature: Learning Robust Camera Localization from Pixels to Pose.
可以通过细粒度头部姿势估计(Towards Fast,Accurate and Stable 3D Dense Face Alignment Fine-Grained Head Pose Estimation Without Keypoints)检测各个镜内的各个人物的朝向。The orientation of each person in each mirror can be detected through fine-grained head pose estimation (Towards Fast, Accurate and Stable 3D Dense Face Alignment Fine-Grained Head Pose Estimation Without Keypoints).
可以通过人脸识别模型(Caption-Supervised Face Recognition:Training a State-of-the-Art Face Model without Manual Annotation Person Search in Videos with One Portrait Through Visual and Temporal Links)检测各个镜内的各个人物的身份。The identity of each person in each mirror can be detected through the face recognition model (Caption-Supervised Face Recognition: Training a State-of-the-Art Face Model without Manual Annotation Person Search in Videos with One Portrait Through Visual and Temporal Links).
基于以上示例性内容,可以获知所述目标视频内的各个拍摄信息的数量和分布等。Based on the above exemplary content, the quantity and distribution of each shooting information in the target video can be known.
步骤S204,将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。Step S204, marking the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
即,对每个拍摄信息按照时间轴的方向进行分布式展示。That is, each shooting information is displayed in a distributed manner according to the direction of the time axis.
本实施例提供的视频拍摄信息获取方法,将目标视频(优质视频)的拍摄及剪辑相关拍摄信息进行解构,并将解构得到的拍摄信息依据其在目标视频中的位置分布于时间轴上。基于该分布于时间轴上的拍摄信息,可以指示客户端在拍摄或剪辑一段视频时随着时间推进所需要使用的各种拍摄信息,如场景布置、人物布置、拍摄手段等,以拍摄或剪辑出与优质视频类似的视频,提高拍摄或剪辑效率。The video shooting information acquisition method provided in this embodiment deconstructs the shooting and editing related shooting information of the target video (high-quality video), and distributes the deconstructed shooting information on the time axis according to its position in the target video. Based on the shooting information distributed on the time axis, the client can be instructed to use various shooting information as time progresses when shooting or editing a video, such as scene layout, character layout, shooting means, etc., to shoot or edit Produce videos similar to high-quality videos and improve shooting or editing efficiency.
作为示例,所述多个拍摄信息包括多个拍摄参数。如图3所示,所述步骤S202可以包括:步骤S300,对所述目标视频进行场分割,以得到多个场;步骤S302,对所述多个场分 别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;步骤S304,对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及步骤S306,根据每个场内的各个镜的拍摄参数,得到所述每个场的场信息;其中,所述每个场的场信息包括该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。在本实施例中,以“场”和“镜”为单位对所述目标视频进行分析,以得到所述目标视频中的不同视频桥段中的各个镜的拍摄参数,方便存储、归类和用户查询。As an example, the multiple shooting information includes multiple shooting parameters. As shown in Figure 3, the step S202 may include: step S300, performing field segmentation on the target video to obtain multiple fields; step S302, performing mirror segmentation on the multiple fields respectively, to obtain multiple mirror images , each field includes one or more mirrors; step S304, analyze each mirror in the multiple mirrors to obtain the shooting parameters of each mirror; and step S306, according to each field The shooting parameters of each mirror in the field are obtained to obtain the field information of each field; wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the shooting parameters of each mirror in the field are listed in the The position distribution on the time axis corresponds to the position distribution of each mirror in the field in the field. In this embodiment, the target video is analyzed in units of "field" and "mirror", so as to obtain the shooting parameters of each mirror in different video bridge segments in the target video, which is convenient for storage, classification and user query.
作为示例,所述多个拍摄信息包括多个拍摄参数。如图4所示,所述步骤S202可以包括:步骤S400,对所述目标视频进行场分割,以得到多个场;步骤S402,对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;步骤S404,对所述多个场进行分析,以得到每个场的主题;步骤S406,对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及步骤S408,根据所述每个场的主题和所述每个场内的各个镜的拍摄参数,得到所述每个场的场信息;其中,所述每个场的场信息包括该场的主题和该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。在本实施例中,以“场”和“镜”为单位对所述目标视频进行分析,以得到所述目标视频中的不同视频桥段的主题和不同视频桥段内的各个镜的拍摄参数,方便存储、归类和用户根据不同桥段的主题等信息查询。As an example, the multiple shooting information includes multiple shooting parameters. As shown in Figure 4, the step S202 may include: step S400, performing field segmentation on the target video to obtain multiple fields; step S402, performing mirror segmentation on the multiple fields respectively, to obtain multiple mirror images , each field includes one or more mirrors; step S404, analyzing the multiple fields to obtain the subject of each field; step S406, analyzing each mirror in the multiple mirrors, to obtain the shooting parameters of each mirror; and step S408, according to the theme of each field and the shooting parameters of each mirror in each field, obtain the field information of each field; wherein, the The above-mentioned field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to the position distribution of each mirror in the field in the field. Corresponding to the location distribution within the field. In this embodiment, the target video is analyzed in units of "field" and "mirror", so as to obtain the themes of different video segments in the target video and shooting parameters of each mirror in different video segments , which is convenient for storage, classification and user query based on themes of different bridge sections.
作为示例,所述每个镜的拍摄参数包括以下一项或多项:景别、拍摄角度、人物信息、镜种类、运镜操作、场景。As an example, the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
作为示例,可以为各个镜分别生成图形化的分镜脚本,以更好的提示用户。As an example, a graphical storyboard script can be generated for each scene to better prompt the user.
如图5所示,所述视频拍摄信息获取方法还可以包括:步骤S500,根据所述每个场内的各个镜的拍摄参数,为所述每个场分别生成多个分镜脚本;其中,所述每个场内的各个镜分别对应一个或多个分镜脚本,所述每个场内的多个分镜脚本在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。具体如下:As shown in FIG. 5 , the method for obtaining video shooting information may also include: step S500, generating a plurality of storyboard scripts for each field according to the shooting parameters of each mirror in each field; wherein, Each mirror in each field corresponds to one or more storyboard scripts, and the position distribution of the multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field in the field. corresponding to the location distribution within . details as follows:
(1)可以根据人物信息等生成相应的元素和元素矢量信息,所述元素矢量信息包括所述元素的尺寸、姿态以及所述元素在相应镜内的关键帧中的相对位置等。以上人物信息可以以人物身份划分,如小孩、老人等,可以以人物职业划分,如警察、律师。(1) Corresponding elements and element vector information can be generated according to character information, etc., and the element vector information includes the size, posture, and relative position of the element in the key frame in the corresponding mirror. The above character information can be divided by character identity, such as children, old people, etc., and can be divided by character occupation, such as policeman and lawyer.
(2)可以根据所述元素的元素类别,从矢量元素素材库中获取与所述元素类别关联的矢量元素。(2) According to the element category of the element, the vector element associated with the element category may be obtained from a vector element material library.
(3)根据相应镜内的场景,从画布素材库中获取与所述场景匹配的指定画布。(3) According to the scene in the corresponding mirror, obtain the specified canvas matching the scene from the canvas material library.
(4)根据所述元素矢量信息,将各个矢量元素设置于指定画布上,以生成分镜脚本。 例如:根据所述元素的尺寸,确定所述矢量元素在所述指定画布中的尺寸;根据所述元素的姿态,确定所述矢量元素在所述指定画布中的姿态;根据所述元素在所述参考图像中的相对位置,确定所述矢量元素在所述指定画布中的相对位置。(4) According to the element vector information, each vector element is set on a designated canvas to generate a storyboard. For example: according to the size of the element, determine the size of the vector element in the designated canvas; determine the posture of the vector element in the designated canvas according to the posture of the element; Determine the relative position of the vector element in the specified canvas based on the relative position in the reference image.
通过识别相应镜内的关键帧中的元素,并将匹配的矢量元素对应的设置到指定画布中,以得到矢量图形式的分镜脚本,使得制作分镜脚本更加高效和容易,有效提高用户体验。By identifying the elements in the key frames in the corresponding mirror, and setting the matching vector elements to the specified canvas, the storyboard script in the form of vector graphics is obtained, which makes it more efficient and easy to make storyboard scripts, and effectively improves the user experience. .
示例性的,可以将景别、拍摄角度、人物信息、镜种类、运镜操作,以文字形式添加到指定画布。Exemplarily, the scene type, shooting angle, character information, mirror type, and mirror operation can be added to the specified canvas in text form.
示例性的,所述分镜脚本可以可编辑矢量图,其能够为使用用户需求(习惯)被矢量化修改。所述矢量化修改包括以下至少一项:修改所述矢量元素的尺寸、修改所述矢量元素的姿态、修改所述矢量元素在所述指定画布中的相对位置,删除所述矢量元素或添加新矢量元素。于此,用户可以实现分镜脚本的个性化提供。Exemplarily, the storyboard script can be an editable vector diagram, which can be modified by vectorization according to user needs (habits). The vectorization modification includes at least one of the following: modifying the size of the vector element, modifying the pose of the vector element, modifying the relative position of the vector element in the specified canvas, deleting the vector element or adding a new Vector elements. Here, the user can realize the personalized provision of the storyboard.
示例性的,通过用户习惯和画像等,调整矢量元素在指定画布中的布置以更准确地生成符合用户绘制习惯的分镜脚本,更进一步的提高脚本创作效率,及用户黏性。Exemplarily, through user habits and portraits, etc., adjust the arrangement of vector elements in the designated canvas to more accurately generate a storyboard script that conforms to the user's drawing habits, and further improve script creation efficiency and user stickiness.
为方便理解,以下结合图6提供一个操作示例:For ease of understanding, an operation example is provided below in conjunction with Figure 6:
S600:找到一个优质视频作为目标视频。S600: Find a high-quality video as a target video.
S602:对目标视频进行分割,以得到多个场。如图6所示,所述目标视频可以分割为多个场。S602: Divide the target video to obtain multiple fields. As shown in FIG. 6, the target video can be divided into multiple fields.
S604:对每个场进行分割,以得到多个镜,如图6所示,其中一个场景由5个镜构成。S604: Divide each field to obtain multiple mirrors. As shown in FIG. 6, one scene is composed of 5 mirrors.
S606:对每个镜进行各类检测,以解构得到各种拍摄信息,即对应于各种拍摄/剪辑要素的信息。S606: Perform various detections on each mirror to deconstruct and obtain various shooting information, that is, information corresponding to various shooting/editing elements.
如运镜检测、景别检测、人物分析(人物身份分析、人物朝向分析)、拍摄角度分析等。Such as mirror movement detection, scene detection, character analysis (character identity analysis, character orientation analysis), shooting angle analysis, etc.
S608,以场为单位,得到各个场的场信息。S608. Obtain field information of each field in units of fields.
每个场的场信息包括这个场中的各个镜的检测信息,如运镜、景别、人物、拍摄角度等。The field information of each field includes the detection information of each mirror in this field, such as the movement of the mirror, scene, characters, shooting angle, etc.
上述各种信息被标注在时间轴上。The above various information is marked on the time axis.
当在多个人需要同时拍摄或剪辑同一个视频时,可以根据被标注在时间轴的各种信息分段下发到各个客户端中,分别指示不同的客户端执行不同的操作,实现协作。When multiple people need to shoot or edit the same video at the same time, it can be sent to each client in segments according to various information marked on the time axis, respectively instructing different clients to perform different operations to achieve collaboration.
当不具备多人、多次或多角度拍摄情况下(例如,创作者只有1人,或1个拍摄装置,或诸如演唱会录制等无法重复多次取景的场景),在可以一个单一视频之后,可以基于上述场信息对该单一视频进行剪辑,以得到多个不同景别切换的视频。When there is no multi-person, multiple or multi-angle shooting (for example, there is only one creator, or one shooting device, or scenes such as concert recordings that cannot be repeated multiple times), after a single video , the single video can be edited based on the above field information to obtain multiple videos switched between different scenes.
实施例二Embodiment two
本实施例提供了一种视频拍摄和处理指示方法,部分技术细节和效果可参考上文。This embodiment provides a video shooting and processing instruction method, and some technical details and effects can be referred to above.
图7示意性示出了根据本申请实施例二的视频拍摄和处理指示方法的流程图。Fig. 7 schematically shows a flow chart of a video shooting and processing instruction method according to Embodiment 2 of the present application.
如图7所示,该视频拍摄和处理指示方法可以包括步骤S700~S704,其中:As shown in FIG. 7, the video shooting and processing instruction method may include steps S700-S704, wherein:
步骤S700,接收客户端的请求信息。Step S700, receiving request information from the client.
所述请求信息可以包括如下内容:The request information may include the following:
(1)文字类信息,包括拍摄场景、主题、拍摄地点、场景等;(1) Text information, including shooting scene, theme, shooting location, scene, etc.;
(2)已拍摄的视频信息,如视频标签等。(2) The video information that has been taken, such as video tags, etc.
步骤S702,根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息的时间顺序。Step S702, acquire target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the location of each shooting information Chronologically.
根据请求信息,从数据库中搜索到最匹配的视频桥段(目标场)。According to the requested information, the most matching video bridge segment (target field) is searched from the database.
该目标场中有对应视频桥段的拍摄信息(镜、景别、运镜、人物等)在时间轴上的分布。The target field has the distribution on the time axis of the shooting information (mirrors, scenes, mirrors, characters, etc.) corresponding to the bridge segment of the video.
步骤S704,返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。Step S704, returning the target field information to the client to instruct the client to perform video shooting or video processing according to the respective shooting information and the marked positions of the respective shooting information on the time axis .
所述客户端可以根据所述目标场的场信息指导用户拍摄、剪辑或自动拍摄、自动剪辑。以自动剪辑为例:拍摄得到同一景别(全景)的视频,后期参考搜索的拍摄信息,剪辑成为多种景别切换的多样化视频,从而得到与优质视频桥段相似的效果。The client can guide the user to shoot and edit or automatically shoot and edit according to the field information of the target field. Take automatic editing as an example: the video of the same scene (panorama) is captured, and the shooting information searched is referred to later, and it is edited into a diversified video with multiple scene switching, so as to obtain an effect similar to that of a high-quality video bridge.
本实施例提供的视频拍摄和处理指示方法,根据用户输入想要拍摄或剪辑的主题、内容等,找到符合用户拍期的优质视频桥段,将优质视频桥段解构得到的拍摄信息(场信息)返回给客户端,该解构得到的拍摄信息分布于时间轴上。基于该分布于时间轴上的拍摄信息,可以指示客户端在拍摄或剪辑一段视频时随着时间推进所需要使用的各种拍摄信息,如场景布置、人物布置、拍摄手段、剪辑等。The video shooting and processing instruction method provided by this embodiment finds the high-quality video bridge section that meets the user's shooting period according to the theme and content of the user's input of wanting to shoot or edit, and deconstructs the shooting information (field information) obtained by deconstructing the high-quality video bridge section. ) is returned to the client, and the shooting information obtained by the deconstruction is distributed on the time axis. Based on the shooting information distributed on the time axis, the client can be instructed to use various shooting information as time progresses when shooting or editing a video, such as scene layout, character layout, shooting means, editing, etc.
作为示例,所述目标场信息为多个场中的目标场的场信息;所述方法还包括预先获取所述目标场信息:As an example, the target field information is field information of a target field in multiple fields; the method further includes acquiring the target field information in advance:
对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
根据所述每个镜的拍摄参数,得到所述目标场信息;Obtaining the target field information according to the shooting parameters of each mirror;
其中,所述目标场信息包括所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时 间轴上的位置分布与所述每个镜在所述目标场内的位置分布具有对应关系。Wherein, the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
作为示例,所述目标场信息为多个场中的目标场的场信息;所述方法还包括预先获取所述目标场信息:As an example, the target field information is field information of a target field in multiple fields; the method further includes acquiring the target field information in advance:
对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
对所述目标场进行分析,以得到目标场的主题;analyzing the target field to obtain a theme of the target field;
对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
根据所述主题和所述每个镜的拍摄参数,得到所述目标场信息;Obtain the target field information according to the subject and the shooting parameters of each mirror;
其中,所述目标场信息包括所述主题和所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在该目标场内的位置分布具有对应关系。Wherein, the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field. The location distributions have correspondences.
为方便理解,以下结合图8提供一个操作示例:For ease of understanding, an operation example is provided below in conjunction with Figure 8:
S800:客户端30000接收用户输入的搜索内容,基于所述搜索内容发起搜索请求。S800: The client 30000 receives search content input by a user, and initiates a search request based on the search content.
所述搜索内容,可以包括视频主题、拍摄地点等。The search content may include video topics, shooting locations, and the like.
S802:计算机设备10000根据所述搜索请求,在数据库中进行搜索。S802: The computer device 10000 searches the database according to the search request.
S804:计算机设备10000将搜索到的与搜索内容最相关的视频桥段的场信息(其可以包括多个拍摄信息,各个拍摄信息分布于时间轴上)返回给客户端30000。S804: The computer device 10000 returns to the client 30000 the searched field information (which may include a plurality of shooting information, each shooting information is distributed on the time axis) of the video segment most relevant to the search content.
S806:客户端30000根据各个拍摄信息以及各个拍摄信息在所述时间轴的位置,进行拍摄或剪辑。S806: The client 30000 performs shooting or editing according to each shooting information and the position of each shooting information on the time axis.
S808:客户端30000根据拍摄或剪辑结果,生成具有高质量拍摄手段、内容的视频。S808: The client 30000 generates a video with high-quality shooting means and content according to the shooting or editing results.
实施例三Embodiment Three
图9示意性示出了根据本申请实施例三的视频拍摄信息获取系统的框图,该视频拍摄信息获取系统可以被分割成一个或多个程序模块,一个或者多个程序模块被存储于存储介质中,并由一个或多个处理器所执行,以完成本申请实施例。本申请实施例所称的程序模块是指能够完成特定功能的一系列计算机可读指令段,以下描述将具体介绍本申请实施例中各程序模块的功能。FIG. 9 schematically shows a block diagram of a video shooting information acquisition system according to Embodiment 3 of the present application. The video shooting information acquisition system can be divided into one or more program modules, and one or more program modules are stored in a storage medium. and executed by one or more processors to complete the embodiments of the present application. The program modules referred to in the embodiments of the present application refer to a series of computer-readable instruction segments capable of accomplishing specific functions. The following description will specifically introduce the functions of the program modules in the embodiments of the present application.
如图9所示,该视频拍摄信息获取系统900可以包括确定模块910、分析模块920和标注模块930,其中:As shown in FIG. 9, the video shooting information acquisition system 900 may include a determination module 910, an analysis module 920, and an annotation module 930, wherein:
确定模块910,用于确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determining module 910, is used for determining the target video that is used for analysis, and described target video is corresponding with the time axis that represents video progress;
分析模块920,用于对所述目标视频进行分析,以获取多个拍摄信息;及An analysis module 920, configured to analyze the target video to obtain a plurality of shooting information; and
标注模块930,用于将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The marking module 930 is configured to mark the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
作为示例,所述多个拍摄信息包括多个拍摄参数;所述分析模块920,还用于:As an example, the multiple shooting information includes multiple shooting parameters; the analyzing module 920 is further configured to:
对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
根据每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtain the field information of each field according to the shooting parameters of each mirror in each field;
其中,所述每个场的场信息包括该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
作为示例,所述多个拍摄信息包括多个拍摄参数;所述分析模块920,还用于:As an example, the multiple shooting information includes multiple shooting parameters; the analyzing module 920 is further configured to:
对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
对所述多个场进行分析,以得到每个场的主题;analyzing the plurality of fields to obtain a theme for each field;
对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
根据所述每个场的主题和所述每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtaining the field information of each field according to the theme of each field and the shooting parameters of each mirror in each field;
其中,所述每个场的场信息包括该场的主题和该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field. The position distribution of the mirror in the field corresponds to that.
作为示例,所述每个镜的拍摄参数包括以下一项或多项:景别、拍摄角度、人物信息、镜种类、运镜操作、场景。As an example, the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
作为示例,所述系统还包括脚本生成模块,用于:As an example, the system also includes a script generation module for:
根据所述每个场内的各个镜的拍摄参数,为所述每个场分别生成多个分镜脚本;According to the shooting parameters of each mirror in each field, a plurality of shot scripts are respectively generated for each field;
其中,所述每个场内的各个镜分别对应一个或多个分镜脚本,所述每个场内的多个分镜脚本在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, each mirror in each field corresponds to one or more mirror scripts respectively, and the position distribution of multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field. The location distribution within the field corresponds.
实施例四Embodiment four
图10示意性示出了根据本申请实施例四的视频拍摄和处理指示系统的框图,该视频拍摄和处理指示系统可以被分割成一个或多个程序模块,一个或者多个程序模块被存储于存储介质中,并由一个或多个处理器所执行,以完成本申请实施例。本申请实施例所称的程序模块是指能够完成特定功能的一系列计算机可读指令段,以下描述将具体介绍本申请实施例中各程序模块的功能。FIG. 10 schematically shows a block diagram of a video shooting and processing instruction system according to Embodiment 4 of the present application. The video shooting and processing instruction system can be divided into one or more program modules, and one or more program modules are stored in stored in a storage medium and executed by one or more processors to complete the embodiments of the present application. The program modules referred to in the embodiments of the present application refer to a series of computer-readable instruction segments capable of accomplishing specific functions. The following description will specifically introduce the functions of the program modules in the embodiments of the present application.
如图10所示,该视频拍摄和处理指示系统1000可以包括接收模块1010、获取模块1020和返回模块1030,其中:As shown in Figure 10, the video shooting and processing instruction system 1000 may include a receiving module 1010, an obtaining module 1020 and a returning module 1030, wherein:
接收模块1010,用于接收客户端的请求信息;A receiving module 1010, configured to receive request information from the client;
获取模块1020,用于根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息的时间顺序;及The obtaining module 1020 is configured to obtain target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis indicates that each the chronological order of the capture information; and
返回模块1030,用于返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。Returning module 1030, configured to return the target field information to the client to instruct the client to perform video shooting according to the respective shooting information and the marked positions of the respective shooting information on the time axis Or video processing.
可选的,所述目标场信息为多个场中的目标场的场信息;所述系统还包括预设获取模块,用于预先获取所述目标场信息:Optionally, the target field information is field information of a target field in multiple fields; the system further includes a preset acquisition module, configured to pre-acquire the target field information:
对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
根据所述每个镜的拍摄参数,得到所述目标场信息;Obtaining the target field information according to the shooting parameters of each mirror;
其中,所述目标场信息包括所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在所述目标场内的位置分布具有对应关系。Wherein, the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
可选的,所述目标场信息为多个场中的目标场的场信息;所述系统还包括预设获取模块,用于预先获取所述目标场信息:Optionally, the target field information is field information of a target field in multiple fields; the system further includes a preset acquisition module, configured to pre-acquire the target field information:
对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
对所述目标场进行分析,以得到目标场的主题;analyzing the target field to obtain a theme of the target field;
对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
根据所述主题和所述每个镜的拍摄参数,得到所述目标场信息;Obtain the target field information according to the subject and the shooting parameters of each mirror;
其中,所述目标场信息包括所述主题和所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在该目标场内的位置分布具有对应关系。Wherein, the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field. The location distributions have correspondences.
实施例五Embodiment five
图11示意性示出了根据本申请实施例五的计算机设备10000的硬件架构示意图。本实施例中,计算机设备10000是一种能够按照事先设定或者存储的指令,自动进行数值计算和/或信息处理的设备。例如,可以是机架式服务器、刀片式服务器、塔式服务器或机柜式 服务器(包括独立的服务器,或者多个服务器所组成的服务器集群)等。如图11所示,计算机设备10000至少包括但不限于:可通过系统总线相互通信链接存储器10010、处理器10020、网络接口10030。其中:FIG. 11 schematically shows a schematic diagram of a hardware architecture of a computer device 10000 according to Embodiment 5 of the present application. In this embodiment, the computer device 10000 is a device capable of automatically performing numerical calculation and/or information processing according to preset or stored instructions. For example, it may be a rack server, a blade server, a tower server, or a cabinet server (including an independent server, or a server cluster composed of multiple servers) and the like. As shown in FIG. 11 , the computer device 10000 at least includes but is not limited to: a memory 10010 , a processor 10020 , and a network interface 10030 that can communicate with each other through a system bus. in:
存储器10010至少包括一种类型的计算机可读存储介质,可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等。在一些实施例中,存储器10010可以是计算机设备10000的内部存储模块,例如该计算机设备10000的硬盘或内存。在另一些实施例中,存储器10010也可以是计算机设备10000的外部存储设备,例如该计算机设备10000上配备的插接式硬盘,智能存储卡(Smart Media Card,简称为SMC),安全数字(Secure Digital,简称为SD)卡,闪存卡(Flash Card)等。当然,存储器10010还可以既包括计算机设备10000的内部存储模块也包括其外部存储设备。本实施例中,存储器10010通常用于存储安装于计算机设备10000的操作系统和各类应用软件,例如视频拍摄信息获取方法、视频拍摄和处理指示方法的程序代码等。此外,存储器10010还可以用于暂时地存储已经输出或者将要输出的各类数据。The memory 10010 includes at least one type of computer-readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, etc. In some embodiments, the memory 10010 may be an internal storage module of the computer device 10000 , such as a hard disk or memory of the computer device 10000 . In some other embodiments, the memory 10010 can also be an external storage device of the computer device 10000, such as a plug-in hard disk equipped on the computer device 10000, a smart memory card (Smart Media Card, referred to as SMC), a secure digital (Secure Digital (referred to as SD) card, flash memory card (Flash Card) and so on. Certainly, the memory 10010 may also include both an internal storage module of the computer device 10000 and an external storage device thereof. In this embodiment, the memory 10010 is generally used to store the operating system and various application software installed in the computer device 10000, such as program codes for methods of acquiring video shooting information, video shooting and processing instruction methods, and the like. In addition, the memory 10010 can also be used to temporarily store various types of data that have been output or will be output.
处理器10020在一些实施例中可以是中央处理器(Central Processing Unit,简称为CPU)、控制器、微控制器、微处理器、或其他数据处理芯片。该处理器10020通常用于控制计算机设备10000的总体操作,例如执行与计算机设备10000进行数据交互或者通信相关的控制和处理等。本实施例中,处理器10020用于运行存储器10010中存储的程序代码或者处理数据。The processor 10020 may be a central processing unit (Central Processing Unit, CPU for short), a controller, a microcontroller, a microprocessor, or other data processing chips in some embodiments. The processor 10020 is generally used to control the overall operation of the computer device 10000 , such as performing control and processing related to data interaction or communication with the computer device 10000 . In this embodiment, the processor 10020 is configured to run program codes stored in the memory 10010 or process data.
网络接口10030可包括无线网络接口或有线网络接口,该网络接口10030通常用于在计算机设备10000与其他计算机设备之间建立通信链接。例如,网络接口10030用于通过网络将计算机设备10000与外部终端相连,在计算机设备10000与外部终端之间的建立数据传输通道和通信链接等。网络可以是企业内部网(Intranet)、互联网(Internet)、全球移动通讯系统(Global System of Mobile communication,简称为GSM)、宽带码分多址(Wideband Code Division Multiple Access,简称为WCDMA)、4G网络、5G网络、蓝牙(Bluetooth)、Wi-Fi等无线或有线网络。The network interface 10030 may include a wireless network interface or a wired network interface, and the network interface 10030 is generally used to establish a communication link between the computer device 10000 and other computer devices. For example, the network interface 10030 is used to connect the computer device 10000 with an external terminal through a network, and establish a data transmission channel and a communication link between the computer device 10000 and an external terminal. The network can be Intranet, Internet, Global System of Mobile Communication (GSM for short), Wideband Code Division Multiple Access (WCDMA for short), 4G network , 5G network, Bluetooth (Bluetooth), Wi-Fi and other wireless or wired networks.
需要指出的是,图11仅示出了具有部件10010-10030的计算机设备,但是应理解的是,并不要求实施所有示出的部件,可以替代的实施更多或者更少的部件。It should be noted that FIG. 11 only shows a computer device having components 10010-10030, but it should be understood that implementing all of the illustrated components is not a requirement and that more or fewer components may instead be implemented.
在本实施例中,存储于存储器10010中的视频拍摄信息获取方法、视频拍摄和处理指示方法还可以被分割为一个或者多个程序模块,并由一个或多个处理器(本实施例为处理 器10020)所执行,以完成本申请实施例。In this embodiment, the video shooting information acquisition method, video shooting and processing instruction method stored in the memory 10010 can also be divided into one or more program modules, and processed by one or more processors (this embodiment is a processing module) device 10020) to complete the embodiment of this application.
实施例六Embodiment six
本申请实施例还提供一种计算机可读存储介质,计算机可读存储介质其上存储有计算机可读指令,计算机可读指令被处理器执行时实现以下步骤:The embodiment of the present application also provides a computer-readable storage medium, on which computer-readable instructions are stored, and when the computer-readable instructions are executed by a processor, the following steps are implemented:
确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处;Marking the plurality of shooting information on the time axis, each shooting information is respectively distributed at a corresponding position on the time axis;
或者,计算机可读指令被处理器执行时实现以下步骤:Alternatively, the computer readable instructions implement the following steps when executed by the processor:
接收客户端的请求信息;Receive client request information;
根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息的时间顺序;及Acquiring target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the time sequence of each shooting information; and
返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。returning the target field information to the client to instruct the client to perform video shooting or video processing according to the respective shooting information and marked positions of the respective shooting information on the time axis.
本实施例中,计算机可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访问存储器(RAM)、静态随机访问存储器(SRAM)、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM)、可编程只读存储器(PROM)、磁性存储器、磁盘、光盘等。在一些实施例中,计算机可读存储介质可以是计算机设备的内部存储单元,例如该计算机设备的硬盘或内存。在另一些实施例中,计算机可读存储介质也可以是计算机设备的外部存储设备,例如该计算机设备上配备的插接式硬盘,智能存储卡(Smart Media Card,简称为SMC),安全数字(Secure Digital,简称为SD)卡,闪存卡(Flash Card)等。当然,计算机可读存储介质还可以既包括计算机设备的内部存储单元也包括其外部存储设备。本实施例中,计算机可读存储介质通常用于存储安装于计算机设备的操作系统和各类应用软件,例如实施例中视频拍摄信息获取方法、视频拍摄和处理指示方法的程序代码等。此外,计算机可读存储介质还可以用于暂时地存储已经输出或者将要输出的各类数据。In this embodiment, the computer-readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), random access memory (RAM), static random access memory (SRAM), read-only memory ( ROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Programmable Read-Only Memory (PROM), Magnetic Memory, Magnetic Disk, Optical Disk, etc. In some embodiments, the computer-readable storage medium may be an internal storage unit of a computer device, such as a hard disk or a memory of the computer device. In some other embodiments, the computer-readable storage medium can also be an external storage device of the computer device, such as a plug-in hard disk equipped on the computer device, a smart memory card (Smart Media Card, referred to as SMC), a secure digital ( Secure Digital (referred to as SD) card, flash memory card (Flash Card), etc. Of course, the computer-readable storage medium may also include both the internal storage unit of the computer device and its external storage device. In this embodiment, the computer-readable storage medium is usually used to store the operating system and various application software installed on the computer device, such as the program codes of the video shooting information acquisition method, video shooting and processing instruction method in the embodiment. In addition, the computer-readable storage medium can also be used to temporarily store various types of data that have been output or will be output.
显然,本领域的技术人员应该明白,上述的本申请实施例的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,可选地,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们 存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本申请实施例不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that the modules or steps of the above-mentioned embodiments of the present application can be implemented by general-purpose computing devices, and they can be concentrated on a single computing device, or distributed among multiple computing devices. Optionally, they may be implemented in program code executable by a computing device, thereby, they may be stored in a storage device to be executed by a computing device, and in some cases, may be implemented in a code different from that described herein The steps shown or described are executed in sequence, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. Thus, embodiments of the present application are not limited to any specific combination of hardware and software.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of the present application, and are not intended to limit the patent scope of the present application. All equivalent structures or equivalent process transformations made by using the description of the application and the accompanying drawings are directly or indirectly used in other related technical fields. , are all included in the patent protection scope of the present application in the same way.

Claims (20)

  1. 一种视频拍摄信息获取方法,其中,所述方法包括:A method for acquiring video shooting information, wherein the method includes:
    确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
    对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
    将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
  2. 根据权利要求1所述的视频拍摄信息获取方法,其中,所述多个拍摄信息包括多个拍摄参数;The video shooting information acquisition method according to claim 1, wherein the plurality of shooting information includes a plurality of shooting parameters;
    所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
    对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
    对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
    对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
    根据每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtain the field information of each field according to the shooting parameters of each mirror in each field;
    其中,所述每个场的场信息包括该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
  3. 根据权利要求1或2所述的视频拍摄信息获取方法,其中,所述多个拍摄信息包括多个拍摄参数;The video shooting information acquisition method according to claim 1 or 2, wherein the plurality of shooting information includes a plurality of shooting parameters;
    所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
    对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
    对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
    对所述多个场进行分析,以得到每个场的主题;analyzing the plurality of fields to obtain a theme for each field;
    对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
    根据所述每个场的主题和所述每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtaining the field information of each field according to the theme of each field and the shooting parameters of each mirror in each field;
    其中,所述每个场的场信息包括该场的主题和该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field. The position distribution of the mirror in the field corresponds to that.
  4. 根据权利要求2或3所述的视频拍摄信息获取方法,其中,所述每个镜的拍摄参数包括以下一项或多项:景别、拍摄角度、人物信息、镜种类、运镜操作、场景。The video shooting information acquisition method according to claim 2 or 3, wherein the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, scene .
  5. 根据权利要求2-4任意一项所述的视频拍摄信息获取方法,其中,还包括:The video shooting information acquisition method according to any one of claims 2-4, further comprising:
    根据所述每个场内的各个镜的拍摄参数,为所述每个场分别生成多个分镜脚本;According to the shooting parameters of each mirror in each field, a plurality of shot scripts are respectively generated for each field;
    其中,所述每个场内的各个镜分别对应一个或多个分镜脚本,所述每个场内的多个分镜脚本在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, each mirror in each field corresponds to one or more mirror scripts respectively, and the position distribution of multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field. The location distribution within the field corresponds.
  6. 一种视频拍摄信息获取系统,其中,所述系统包括:A video shooting information acquisition system, wherein the system includes:
    确定模块,用于确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;A determination module is used to determine a target video for analysis, and the target video corresponds to a time axis representing video progress;
    分析模块,用于对所述目标视频进行分析,以获取多个拍摄信息;及An analysis module, configured to analyze the target video to obtain a plurality of shooting information; and
    标注模块,用于将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。A labeling module, configured to label the plurality of shooting information on the time axis, and each shooting information is respectively distributed at a corresponding position on the time axis.
  7. 一种计算机设备,所述计算机设备包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机可读指令,所述计算机可读指令被处理器执行时实现以下步骤:A computer device, the computer device comprising a memory, a processor, and computer-readable instructions stored on the memory and operable on the processor, the computer-readable instructions are executed by the processor to implement the following steps :
    确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
    对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
    将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
  8. 根据权利要求7所述的计算机设备,其中,所述多个拍摄信息包括多个拍摄参数;The computer device according to claim 7, wherein the plurality of shooting information includes a plurality of shooting parameters;
    所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
    对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
    对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
    对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
    根据每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtain the field information of each field according to the shooting parameters of each mirror in each field;
    其中,所述每个场的场信息包括该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
  9. 根据权利要求7或8所述的计算机设备,其中,所述多个拍摄信息包括多个拍摄参数;The computer device according to claim 7 or 8, wherein the plurality of shooting information includes a plurality of shooting parameters;
    所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
    对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
    对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
    对所述多个场进行分析,以得到每个场的主题;analyzing the plurality of fields to obtain a theme for each field;
    对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
    根据所述每个场的主题和所述每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtaining the field information of each field according to the theme of each field and the shooting parameters of each mirror in each field;
    其中,所述每个场的场信息包括该场的主题和该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field. The position distribution of the mirror in the field corresponds to that.
  10. 根据权利要求7或8所述的计算机设备,其中,所述每个镜的拍摄参数包括以下一项或多项:景别、拍摄角度、人物信息、镜种类、运镜操作、场景。The computer device according to claim 7 or 8, wherein the shooting parameters of each mirror include one or more of the following: scene classification, shooting angle, character information, mirror type, mirror movement operation, and scene.
  11. 根据权利要求8-10任意一项所述的计算机设备,其中,所述计算机可读指令被处理器执行时还实现以下步骤:The computer device according to any one of claims 8-10, wherein the computer-readable instructions further implement the following steps when executed by the processor:
    根据所述每个场内的各个镜的拍摄参数,为所述每个场分别生成多个分镜脚本;According to the shooting parameters of each mirror in each field, a plurality of shot scripts are respectively generated for each field;
    其中,所述每个场内的各个镜分别对应一个或多个分镜脚本,所述每个场内的多个分镜脚本在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, each mirror in each field corresponds to one or more mirror scripts respectively, and the position distribution of multiple mirror scripts in each field on the time axis is the same as that of each mirror in the field. The location distribution within the field corresponds.
  12. 一种计算机可读存储介质,其中,所述计算机可读存储介质内存储有计算机可读指令,所述计算机可读指令可被至少一个处理器所执行,以使所述至少一个处理器执行以下步骤:A computer-readable storage medium, wherein computer-readable instructions are stored in the computer-readable storage medium, and the computer-readable instructions can be executed by at least one processor, so that the at least one processor performs the following step:
    确定用于分析的目标视频,所述目标视频对应有表示视频进度的时间轴;Determine the target video for analysis, and the target video corresponds to a time axis representing video progress;
    对所述目标视频进行分析,以获取多个拍摄信息;及Analyzing the target video to obtain a plurality of shooting information; and
    将所述多个拍摄信息标注到所述时间轴上,每个拍摄信息分别分布在所述时间轴的相应位置处。The plurality of shooting information is marked on the time axis, and each shooting information is respectively distributed at a corresponding position of the time axis.
  13. 根据权利要求12所述的计算机可读存储介质,其中,所述多个拍摄信息包括多个拍摄参数;The computer-readable storage medium according to claim 12, wherein the plurality of shooting information includes a plurality of shooting parameters;
    所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
    对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
    对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
    对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
    根据每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtain the field information of each field according to the shooting parameters of each mirror in each field;
    其中,所述每个场的场信息包括该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is the same as that of each mirror in the field. corresponding to the location distribution.
  14. 根据权利要求12或13所述的计算机可读存储介质,其中,所述多个拍摄信息包括多个拍摄参数;The computer-readable storage medium according to claim 12 or 13, wherein the plurality of shooting information includes a plurality of shooting parameters;
    所述对所述目标视频进行分析,以得到多个拍摄信息,包括:The target video is analyzed to obtain multiple shooting information, including:
    对所述目标视频进行场分割,以得到多个场;performing field segmentation on the target video to obtain multiple fields;
    对所述多个场分别进行镜分割,以得到多个镜,所述每个场包括一个或多个镜;performing mirror segmentation on the plurality of fields respectively to obtain a plurality of mirrors, each field comprising one or more mirrors;
    对所述多个场进行分析,以得到每个场的主题;analyzing the plurality of fields to obtain a theme for each field;
    对所述多个镜中的每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each of the plurality of mirrors to obtain shooting parameters of each of the mirrors; and
    根据所述每个场的主题和所述每个场内的各个镜的拍摄参数,得到所述每个场的场信息;Obtaining the field information of each field according to the theme of each field and the shooting parameters of each mirror in each field;
    其中,所述每个场的场信息包括该场的主题和该场内的各个镜的拍摄参数,该场内的各个镜的拍摄参数在所述时间轴上的位置分布与该场内的各个镜在该场内的位置分布对应。Wherein, the field information of each field includes the theme of the field and the shooting parameters of each mirror in the field, and the position distribution of the shooting parameters of each mirror in the field on the time axis is related to that of each mirror in the field. The position distribution of the mirror in the field corresponds to that.
  15. 一种视频拍摄和处理指示方法,其中,所述方法包括:A video shooting and processing instruction method, wherein the method includes:
    接收客户端的请求信息;Receive client request information;
    根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息的时间顺序;及Acquiring target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis represents the time sequence of each shooting information; and
    返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。returning the target field information to the client to instruct the client to perform video shooting or video processing according to the respective shooting information and marked positions of the respective shooting information on the time axis.
  16. 根据权利要求15所述的视频拍摄和处理指示方法,其中,所述目标场信息为多个场中的目标场的场信息;所述方法还包括预先获取所述目标场信息:The video shooting and processing instruction method according to claim 15, wherein the target field information is field information of a target field in a plurality of fields; the method also includes pre-acquiring the target field information:
    对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
    对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
    对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
    根据所述每个镜的拍摄参数,得到所述目标场信息;Obtaining the target field information according to the shooting parameters of each mirror;
    其中,所述目标场信息包括所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在所述目标场内的位置分布具有对应关系。Wherein, the target field information includes the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis has the same relationship with the position distribution of each mirror in the target field. Correspondence.
  17. 根据权利要求15或16所述的视频拍摄和处理指示方法,其中,所述目标场信息为多个场中的目标场的场信息;所述方法还包括预先获取所述目标场信息:The video shooting and processing instruction method according to claim 15 or 16, wherein the target field information is field information of a target field in a plurality of fields; the method also includes pre-acquiring the target field information:
    对所述目标视频进行场分割,以得到所述目标场;performing field segmentation on the target video to obtain the target field;
    对所述目标场进行镜分割,以得到一个或多个镜;performing mirror segmentation on the target field to obtain one or more mirrors;
    对所述目标场进行分析,以得到目标场的主题;analyzing the target field to obtain a theme of the target field;
    对每个镜进行分析,以得到所述每个镜的拍摄参数;及analyzing each mirror to obtain shooting parameters of each mirror; and
    根据所述主题和所述每个镜的拍摄参数,得到所述目标场信息;Obtain the target field information according to the subject and the shooting parameters of each mirror;
    其中,所述目标场信息包括所述主题和所述每个镜的拍摄参数,所述每个镜的拍摄参数在所述时间轴上的位置分布与所述每个镜在该目标场内的位置分布具有对应关系。Wherein, the target field information includes the subject and the shooting parameters of each mirror, and the position distribution of the shooting parameters of each mirror on the time axis is related to the position distribution of each mirror in the target field. The location distributions have correspondences.
  18. 一种视频拍摄和处理指示系统,其中,所述系统包括:A video shooting and processing instruction system, wherein the system includes:
    接收模块,用于接收客户端的请求信息;The receiving module is used to receive the request information of the client;
    获取模块,用于根据所述请求信息获取目标场信息,该目标场信息包括被标注在同一个时间轴上的多个拍摄信息,各个拍摄信息在所述时间轴上的位置表示所述各个拍摄信息的时间顺序;及An acquisition module, configured to acquire target field information according to the request information, the target field information includes a plurality of shooting information marked on the same time axis, and the position of each shooting information on the time axis indicates that each shooting information the chronological order of the information; and
    返回模块,用于返回所述目标场信息至所述客户端,以指示所述客户端依据所述各个拍摄信息以及所述各个拍摄信息在所述时间轴上的被标注位置,进行视频拍摄或者视频处理。A return module, configured to return the target field information to the client, to instruct the client to perform video shooting or video processing.
  19. 一种计算机设备,所述计算机设备包括存储器、处理器以及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时用于实现权利要求15至17中任意一项所述的视频拍摄和处理指示方法的步骤。A computer device, the computer device comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, the processor executing the computer program is used to implement any of claims 15 to 17 A step in the video shooting and processing instruction method.
  20. 一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机程序,所述计算机程序可被至少一个处理器所执行,以使所述至少一个处理器执行权利要求15至17中任意一项所述的视频拍摄和处理指示方法的步骤。A computer-readable storage medium, wherein a computer program is stored in the computer-readable storage medium, and the computer program can be executed by at least one processor, so that the at least one processor executes any of claims 15 to 17. A step in the video shooting and processing instruction method.
PCT/CN2022/098711 2021-07-15 2022-06-14 Video capture information acquisition method, and video capture and processing instruction method WO2023284469A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110801309.0 2021-07-15
CN202110801309.0A CN115701093A (en) 2021-07-15 2021-07-15 Video shooting information acquisition method and video shooting and processing indication method

Publications (1)

Publication Number Publication Date
WO2023284469A1 true WO2023284469A1 (en) 2023-01-19

Family

ID=84919029

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/098711 WO2023284469A1 (en) 2021-07-15 2022-06-14 Video capture information acquisition method, and video capture and processing instruction method

Country Status (2)

Country Link
CN (1) CN115701093A (en)
WO (1) WO2023284469A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050084232A1 (en) * 2003-10-16 2005-04-21 Magix Ag System and method for improved video editing
US10057537B1 (en) * 2017-08-18 2018-08-21 Prime Focus Technologies, Inc. System and method for source script and video synchronization interface
CN110012237A (en) * 2019-04-08 2019-07-12 厦门大学 Video generation method and system based on interaction guidance and cloud enhancing rendering
CN110139159A (en) * 2019-06-21 2019-08-16 上海摩象网络科技有限公司 Processing method, device and the storage medium of video material
CN110855893A (en) * 2019-11-28 2020-02-28 维沃移动通信有限公司 Video shooting method and electronic equipment
CN111601039A (en) * 2020-05-28 2020-08-28 维沃移动通信有限公司 Video shooting method and device and electronic equipment

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2012004739A (en) * 2010-06-15 2012-01-05 Sony Corp Information processor, information processing method and program
CN107613235B (en) * 2017-09-25 2019-12-27 北京达佳互联信息技术有限公司 Video recording method and device
CN108702464B (en) * 2017-10-16 2021-03-26 深圳市大疆创新科技有限公司 Video processing method, control terminal and mobile device
CN111147779B (en) * 2019-12-31 2022-07-29 维沃移动通信有限公司 Video production method, electronic device, and medium
CN112422831A (en) * 2020-11-20 2021-02-26 广州太平洋电脑信息咨询有限公司 Video generation method and device, computer equipment and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050084232A1 (en) * 2003-10-16 2005-04-21 Magix Ag System and method for improved video editing
US10057537B1 (en) * 2017-08-18 2018-08-21 Prime Focus Technologies, Inc. System and method for source script and video synchronization interface
CN110012237A (en) * 2019-04-08 2019-07-12 厦门大学 Video generation method and system based on interaction guidance and cloud enhancing rendering
CN110139159A (en) * 2019-06-21 2019-08-16 上海摩象网络科技有限公司 Processing method, device and the storage medium of video material
CN110855893A (en) * 2019-11-28 2020-02-28 维沃移动通信有限公司 Video shooting method and electronic equipment
CN111601039A (en) * 2020-05-28 2020-08-28 维沃移动通信有限公司 Video shooting method and device and electronic equipment

Also Published As

Publication number Publication date
CN115701093A (en) 2023-02-07

Similar Documents

Publication Publication Date Title
WO2022001593A1 (en) Video generation method and apparatus, storage medium and computer device
CN109803180B (en) Video preview generation method and device, computer equipment and storage medium
CN111464834B (en) Video frame processing method and device, computing equipment and storage medium
US10657379B2 (en) Method and system for using semantic-segmentation for automatically generating effects and transitions in video productions
CN112954450B (en) Video processing method and device, electronic equipment and storage medium
TW202123178A (en) Method for realizing lens splitting effect, device and related products thereof
CN105959814B (en) Video barrage display methods based on scene Recognition and its display device
CN111667557B (en) Animation production method and device, storage medium and terminal
KR20160098949A (en) Apparatus and method for generating a video, and computer program for executing the method
CN111371993A (en) Image shooting method and device, computer equipment and storage medium
CN114390193B (en) Image processing method, device, electronic equipment and storage medium
CN113163230A (en) Video message generation method and device, electronic equipment and storage medium
CN112543344B (en) Live broadcast control method and device, computer readable medium and electronic equipment
CN112509148A (en) Interaction method and device based on multi-feature recognition and computer equipment
CN113515997A (en) Video data processing method and device and readable storage medium
CN113727039B (en) Video generation method and device, electronic equipment and storage medium
US10924637B2 (en) Playback method, playback device and computer-readable storage medium
WO2023284469A1 (en) Video capture information acquisition method, and video capture and processing instruction method
CN110415318B (en) Image processing method and device
CN114143429B (en) Image shooting method, device, electronic equipment and computer readable storage medium
JP4395082B2 (en) Video generation apparatus and program
US20230419997A1 (en) Automatic Non-Linear Editing Style Transfer
US20220068313A1 (en) Systems and methods for mixing different videos
CN114125552A (en) Video data generation method and device, storage medium and electronic device
CN109523941B (en) Indoor accompanying tour guide method and device based on cloud identification technology

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE