WO2021196281A1 - 多媒体文件生成方法和装置、存储介质、电子设备 - Google Patents

多媒体文件生成方法和装置、存储介质、电子设备 Download PDF

Info

Publication number
WO2021196281A1
WO2021196281A1 PCT/CN2020/084738 CN2020084738W WO2021196281A1 WO 2021196281 A1 WO2021196281 A1 WO 2021196281A1 CN 2020084738 W CN2020084738 W CN 2020084738W WO 2021196281 A1 WO2021196281 A1 WO 2021196281A1
Authority
WO
WIPO (PCT)
Prior art keywords
multimedia
template
main body
preset
file
Prior art date
Application number
PCT/CN2020/084738
Other languages
English (en)
French (fr)
Inventor
陈超
黄文瀚
刘琨
柳超
Original Assignee
北京金堤科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京金堤科技有限公司 filed Critical 北京金堤科技有限公司
Publication of WO2021196281A1 publication Critical patent/WO2021196281A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/44Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/41Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof

Definitions

  • the present disclosure relates to multimedia file generation technology, in particular to a multimedia file generation method and device, storage medium, and electronic equipment.
  • the embodiments of the present disclosure provide a method and device for generating a multimedia file, a storage medium, and an electronic device.
  • a multimedia file generation method including:
  • the multimedia template includes at least one editable module
  • a multimedia display file corresponding to the preset main body is generated.
  • the obtaining the multimedia template corresponding to the preset subject includes:
  • the dimensional attribute set includes at least one dimensional attribute
  • the multimedia template corresponding to the preset body is determined according to the dimensional attribute set of the preset body.
  • the method before acquiring the multimedia template corresponding to the preset subject, the method further includes:
  • the multiple dimensional attributes are classified into at least one preset dimensional attribute set, and the multimedia template corresponding to the preset dimensional attribute set is determined.
  • classifying the multiple dimensional attributes into at least one preset dimensional attribute set includes:
  • the multiple dimensional attributes are classified into at least one preset dimensional attribute set, and the multimedia template corresponding to the preset dimensional attribute set is determined.
  • the obtaining the main body display data of the preset main body corresponding to the editable module includes:
  • search for subject display data matching the dimensional attribute corresponding to the editable module in the database.
  • the multimedia template includes multiple forms of multimedia templates; wherein, the multiple multimedia templates correspond to different display modes;
  • Obtaining the main body display data of the preset main body corresponding to the editable module, and filling the main body display data into the corresponding editable module in the multimedia template includes:
  • the generating a multimedia display file corresponding to the preset main body according to the multimedia template filled with the main body display data includes:
  • the multimedia template includes a video template
  • the filling of the main body display data into the corresponding editable module in the multimedia template includes:
  • the position of the editable module is determined in at least one frame of video image included in the video template according to the set coordinate range.
  • the main body display data includes: text and/or pictures;
  • the editable module that fills the main body display data to the determined position includes:
  • the text and/or picture whose adjusted size matches the editable module is filled into the editable module at the determined position.
  • the method before filling the text and/or picture whose adjusted size matches the editable module into the editable module at the determined position, the method further includes:
  • the text is line-wrapped or keywords are extracted, and/or the picture is zoomed.
  • the main body display data includes: video;
  • the editable module that fills the main body display data to the determined position includes:
  • the adjusted video that matches the editable module is filled into the editable module at the determined position, and the start time and end time corresponding to the video are written into the multimedia display file.
  • the multimedia template includes a text template
  • the filling of the main body display data into the corresponding editable module in the multimedia template includes:
  • the generating a multimedia display file corresponding to the preset main body according to the multimedia template filled with the main body display data includes:
  • the at least one text template filled with main body display data is connected according to the sorting order to obtain a multimedia display file corresponding to the preset main body.
  • the multimedia display file further includes an audio display file
  • the generating of the multimedia display file corresponding to the preset body includes:
  • the multimedia template includes an audio template
  • the filling of the main body display data into the corresponding editable module in the multimedia template includes:
  • a timbre from a plurality of preset timbres, and performing audio conversion on the attribute value based on the timbre to obtain at least one piece of audio;
  • the multimedia display files include: audio display files and video display files;
  • the method also includes:
  • the adjusted video display file and the audio display file are synthesized to obtain a display file corresponding to the preset body.
  • the multimedia display file further includes: a text display file
  • the method further includes:
  • the subtitle file is embedded in the synthesized file to obtain a display file corresponding to the preset body.
  • the method further includes:
  • a watermark picture with a set transparency is generated from the preset identification information, and the watermark picture with a set transparency is embedded in the multimedia display file.
  • the multimedia template includes at least one display style multimedia template; acquiring the multimedia template corresponding to the preset subject includes:
  • the multimedia templates of two or more display styles are obtained from the multimedia templates corresponding to the preset body.
  • a multimedia file generating device including:
  • a template acquisition module for acquiring a multimedia template corresponding to a preset subject; wherein the multimedia template includes at least one editable module;
  • An information filling module configured to obtain the main body display data of the preset main body corresponding to the editable module, and fill the main body display data into the corresponding editable module in the multimedia template;
  • the information display module is used to generate a multimedia display file corresponding to the preset main body according to the multimedia template filled with the main body display data.
  • the template obtaining module is configured to obtain a set of dimensional attributes of the preset subject; wherein the set of dimensional attributes includes at least one dimensional attribute; the preset is determined according to the set of dimensional attributes of the preset subject Set the multimedia template corresponding to the subject.
  • the device further includes:
  • the attribute statistics module is used to count multiple dimensional attributes corresponding to multiple subjects included in the database
  • the attribute allocation module is configured to classify the multiple dimensional attributes into at least one preset dimensional attribute set, and determine the multimedia template corresponding to the preset dimensional attribute set.
  • the attribute assignment module is configured to preset the priority corresponding to each dimensional attribute in the multiple dimensional attribute information; sort the multiple dimensional attributes in descending order to obtain a dimensional attribute sequence; The dimensional attribute sequence classifies the multiple dimensional attributes into at least one preset dimensional attribute set, and determines the multimedia template corresponding to the preset dimensional attribute set.
  • the information filling module is configured to determine at least one dimensional attribute corresponding to the editable module; according to the preset body, search the database for subject display data that matches the dimensional attribute corresponding to the editable module.
  • the multimedia template includes multiple forms of multimedia templates; wherein, the multiple multimedia templates correspond to different display modes;
  • the information filling module is configured to obtain the main body display data of the preset main body corresponding to the editable module according to a form of multimedia template, and fill the main body display data into the one form of multimedia The corresponding editable module in the template; obtain the main body display data of the editable module in the one form of multimedia template filled with the main body display data, and fill the obtained main body display data into the multiple multimedia Editable modules corresponding to other forms of multimedia templates in the template.
  • the information filling module when the information filling module generates a multimedia display file corresponding to the preset body according to the multimedia template filled with the main body display data, it is used to merge the multiple files filled with the main body display data.
  • Various types of multimedia templates are used to generate multimedia display files corresponding to the preset body.
  • the multimedia template includes a video template
  • the information filling module includes:
  • the position determining unit is configured to determine the position of the editable module based on the preset pixels or coordinates in the video template;
  • the module filling unit is used to fill the main body display data to the editable module in the determined position.
  • the position determining unit is configured to search for at least one coordinate point corresponding to the set pixel value in at least one frame of video image included in the video template according to the set pixel value, based on the at least one coordinate Click to determine the position of the editable module; and/or,
  • the position of the editable module is determined in at least one frame of video image included in the video template according to the set coordinate range.
  • the main body display data includes: text and/or pictures;
  • the module filling unit is configured to determine the size of the editable module according to the position of the editable module; adjust the size of the text and/or picture based on the size of the editable module; adjust the The text and/or picture whose size matches the editable module is filled into the editable module at the determined position.
  • the module filling unit is further configured to perform line break processing on the text or extract keywords in response to the size of the adjusted text and/or picture does not match the size of the editable module, And/or, performing zooming processing on the picture.
  • the main body display data includes: video;
  • the module filling unit is configured to determine the size of the editable module according to the position of the editable module; adjust the display size of the video based on the size of the editable module; The video matched by the editable module is filled into the editable module at the determined position, and the start time and end time corresponding to the video are written into the multimedia display file.
  • the multimedia template includes a text template
  • the information filling module is configured to extract the attribute value corresponding to at least one dimensional attribute in the main body display data; fill the attribute value into the corresponding editable module in at least one text template according to the corresponding dimensional attribute;
  • the information display module is configured to determine the priority corresponding to each text template in the at least one text template, and sort the at least one text template from high to low according to the priority; Connect the at least one text template filled with the main body display data to obtain the multimedia display file corresponding to the preset main body.
  • the multimedia display file further includes an audio display file
  • the information display module is configured to segment the multimedia display file determined by the text template according to dimensional attributes to obtain a plurality of paragraph texts in a certain order; determine a timbre from a plurality of preset timbres, based on all The tone color performs audio conversion processing on the multiple paragraph texts to obtain multiple paragraph audios; and connects the multiple paragraph audios according to the sequence between the multiple paragraph texts to obtain the audio editing file.
  • the multimedia template includes an audio template
  • the information filling module is used to extract the attribute value corresponding to at least one dimensional attribute in the main body display data; determine a timbre from a plurality of preset timbres, and perform audio conversion on the attribute value based on the timbre to obtain at least One piece of audio; filling the at least one piece of audio into the corresponding editable module in the audio template.
  • the multimedia display files include: audio display files and video display files;
  • the device also includes:
  • the adjusted video display file and the audio display file are synthesized to obtain a display file corresponding to the preset body.
  • the multimedia display file further includes: a text display file
  • the device also includes:
  • the subtitle file is embedded in the synthesized file to obtain a display file corresponding to the preset body.
  • the device further includes:
  • a watermark picture with a set transparency is generated from the preset identification information, and the watermark picture with a set transparency is embedded in the multimedia display file.
  • the multimedia template includes at least one multimedia template of a presentation style; the template acquisition module is used to obtain the multimedia display file corresponding to the preset body when the preset body of the multimedia presentation file to be generated is a preset number. Acquire two or more multimedia templates of display styles from the template.
  • a computer-readable storage medium stores a computer program, and the computer program is used to execute the multimedia file generation method described in any of the above embodiments.
  • an electronic device including:
  • a memory for storing executable instructions of the processor
  • the processor is configured to read the executable instruction from the memory, and execute the instruction to implement the multimedia file generation method described in any of the foregoing embodiments.
  • a multimedia template corresponding to a preset body is acquired; wherein the multimedia template includes at least one editable module; The main body display data of the preset main body corresponding to the editable module, and the main body display data is filled into the corresponding editable module in the multimedia template; according to the multimedia template filled with the main body display data, and The multimedia display file corresponding to the preset body; the embodiment of the present disclosure realizes the multimedia display of the preset body by filling the corresponding body display data in the multimedia template, making the information display more intuitive, and improving the efficiency of information display and users Experience; and the process of generating multimedia display files does not require manual participation, which saves labor costs and realizes the automatic generation of multimedia display files of the preset subject, especially when multimedia files need to be generated in batches, which significantly improves the generation efficiency of multimedia display files.
  • Fig. 1 is a schematic flowchart of a method for generating a multimedia file according to an exemplary embodiment of the present disclosure.
  • FIG. 2 is a schematic flowchart of step 102 in the embodiment shown in FIG. 1 of the present disclosure.
  • Fig. 3 is a schematic flowchart of a method for generating a multimedia file provided by another exemplary embodiment of the present disclosure.
  • FIG. 4 is a schematic flowchart of step 104 in the embodiment shown in FIG. 1 of the present disclosure.
  • FIG. 5 is another schematic flowchart of step 104 in the embodiment shown in FIG. 1 of the present disclosure.
  • FIG. 6 is another schematic flowchart of step 104 in the embodiment shown in FIG. 1 of the present disclosure.
  • FIG. 7 is a schematic flowchart of a method for generating a multimedia file according to another exemplary embodiment of the present disclosure.
  • FIG. 8 is another schematic flowchart of step 104 in the embodiment shown in FIG. 1 of the present disclosure.
  • FIG. 9 is a schematic flowchart of a method for generating a multimedia file according to another exemplary embodiment of the present disclosure.
  • Fig. 10 is a schematic structural diagram of a multimedia file generating device provided by an exemplary embodiment of the present disclosure.
  • Fig. 11 is a structural diagram of an electronic device provided by an exemplary embodiment of the present disclosure.
  • plural may refer to two or more than two, and “at least one” may refer to one, two, or more than two.
  • the term "and/or" in the present disclosure is only an association relationship describing associated objects, which means that there can be three types of relationships, for example, A and/or B can mean that A alone exists, and both A and B exist. , There are three cases of B alone.
  • the character "/" in the present disclosure generally indicates that the associated objects before and after are in an "or" relationship.
  • the embodiments of the present disclosure can be applied to computer systems/servers, which can operate with numerous other general-purpose or special-purpose computing system environments or configurations.
  • Examples of well-known computing systems, environments, and/or configurations suitable for use with computer systems/servers include, but are not limited to: personal computer systems, server computer systems, thin clients, thick clients, handheld or laptop-based devices, Microprocessor systems, set-top boxes, programmable consumer electronics, network personal computers, small computer systems, large computer systems, and distributed cloud computing technology environments including any of the above systems, etc.
  • the computer system/server may be described in the general context of computer system executable instructions (such as program modules) executed by the computer system.
  • program modules may include routines, programs, object programs, components, logic, data structures, etc., which perform specific tasks or implement specific abstract data types.
  • the computer system/server can be implemented in a distributed cloud computing environment. In the distributed cloud computing environment, tasks are executed by remote processing equipment linked through a communication network. In a distributed cloud computing environment, program modules may be located on a storage medium of a local or remote computing system including a storage device.
  • Fig. 1 is a schematic flowchart of a method for generating a multimedia file according to an exemplary embodiment of the present disclosure. This embodiment can be applied to an electronic device, as shown in FIG. 1, and includes the following steps:
  • Step 102 Acquire a multimedia template corresponding to a preset subject.
  • the multimedia template includes at least one editable module, and any content can be filled in the editable module.
  • the preset subject may include, but is not limited to, companies, groups, individuals, etc., or certain virtual characters, certain types of items, and the like.
  • Different preset bodies can correspond to the same or different multimedia templates; multimedia technology is to store and manage various information such as language, data, audio, video, etc. through the computer, so that users can communicate with the computer in real time through multiple senses
  • the multimedia template can include templates in multiple manifestations, such as language, text, data, audio, video, etc.
  • this step 102 may be executed by the processor calling a corresponding instruction stored in the memory, or may be executed by the template obtaining module 11 run by the processor.
  • Step 104 Obtain the main body display data of the preset main body corresponding to the editable module, and fill the main body display data into the corresponding editable module in the multimedia template.
  • the subject display data in this embodiment can be obtained from the public data of a preset subject. Taking the default subject as a company as an example, this embodiment can obtain subject display related data from the company’s public data, for example, subject display related data Including: the company's registered address, administrative division data, registered capital data, investment and financing data, etc.
  • the acquired subject display data can correspond to different dimensional attributes, and the subject display data is filled into the corresponding editable module through the different dimensional attributes, so that the multimedia template has specific information of the preset subject, which is based on the multimedia template
  • the display of the pre-set subject provides the basis.
  • the different dimension attributes can be the company's registered address, administrative division, registered capital, investment and financing and other attributes.
  • this step 104 may be executed by the processor calling a corresponding instruction stored in the memory, or may be executed by the information filling module 12 run by the processor.
  • Step 106 According to the multimedia template filled with the main body display data, a multimedia display file corresponding to the preset main body is generated.
  • the obtained multimedia display file can also include multiple display methods, and in the process of generating the multimedia display file based on the multimedia template filled with the main body display data, A multimedia display file in a new display mode can be generated based on synthesizing multiple multimedia templates. For example, a multimedia template in an audio display mode and a multimedia template in a video display mode are synthesized to generate a multimedia display file combined with audio and video.
  • this step 106 may be executed by the processor calling a corresponding instruction stored in the memory, or may be executed by the information display module 13 run by the processor.
  • the foregoing embodiment of the present disclosure provides a method for generating a multimedia file, which obtains a multimedia template corresponding to a preset subject; wherein the multimedia template includes at least one editable module; obtains the preset corresponding to the editable module Set the main body display data of the main body, and fill the main body display data into the corresponding editable module in the multimedia template; generate the multimedia display corresponding to the preset main body according to the multimedia template filled with the main body display data File;
  • the embodiment of the present disclosure fills the corresponding main body display data in the multimedia template, realizes the multimedia display of the preset main body, makes the information display more intuitive, and improves the efficiency and user experience of the information display; and the process of generating the multimedia display file No manual participation is required, labor costs are saved, and multimedia display files of the preset subject are automatically generated, especially when multimedia files need to be generated in batches, the generation efficiency of multimedia display files is significantly improved.
  • step 102 may include the following steps:
  • Step 1021 Obtain the dimensional attribute set of the preset subject.
  • the dimensional attribute set includes at least one dimensional attribute.
  • Step 1022 Determine a multimedia template corresponding to the preset body according to the dimensional attribute set of the preset body.
  • the preset body has multiple dimensional attributes, and the number and type of dimensional attributes corresponding to different preset bodies may be different.
  • the dimensional attribute set represents at least one dimensional attribute corresponding to the preset body.
  • multiple picture frames or text modules can be created for multiple dimensional attributes (for example, a picture frame or text module for each dimensional attribute).
  • these dimensional attributes can be used
  • Corresponding modules are spliced to obtain the multimedia template; or, multimedia templates are established for multiple different dimensional attribute sets.
  • Fig. 3 is a schematic flowchart of a method for generating a multimedia file provided by another exemplary embodiment of the present disclosure. As shown in Figure 3, the method provided in this embodiment includes:
  • Step 301 Count multiple dimensional attributes corresponding to multiple subjects included in the database.
  • multiple subjects can belong to the same category.
  • multiple subjects are companies.
  • the statistical database corresponds to 5 dimension attributes, namely, dimension attribute 1, dimension attribute 2, dimension attribute 3, dimension attribute 4, and dimension attribute 5.
  • Step 302 Classify the multiple dimensional attributes into at least one preset dimensional attribute set, and determine a multimedia template corresponding to the preset dimensional attribute set.
  • Step 102 Acquire a multimedia template corresponding to a preset subject.
  • the multimedia template includes at least one editable module.
  • the editable module can be a blank area or a blank text box with a preset size, and the editable module can be filled with any form of data (for example, text, audio, picture, video, etc.), which can be a rectangle, Circle, triangle or other irregular shapes.
  • any form of data for example, text, audio, picture, video, etc.
  • Step 104 Obtain the main body display data of the preset main body corresponding to the editable module, and fill the main body display data into the corresponding editable module in the multimedia template.
  • Step 106 According to the multimedia template filled with the main body display data, a multimedia display file corresponding to the preset main body is generated.
  • At least one preset dimensional attribute set is determined, and a corresponding multimedia template is established for at least one dimensional attribute set (for example, a corresponding multimedia template is created for each dimensional attribute set. Template), it can be considered that the preset dimensional attribute set corresponds to at least one subject.
  • a corresponding multimedia template is established for different dimensional attribute sets, it is realized that each subject corresponds to at least one multimedia template.
  • Multimedia templates provide the basis.
  • step 302 may include:
  • the multiple dimensional attributes are classified into at least one preset dimensional attribute set, and the multimedia template corresponding to the preset dimensional attribute set is determined.
  • multiple dimensional attributes can be sorted according to the priority, and multiple multimedia templates can be determined through the dimensional attribute set classified based on the dimensional attribute sequence, for example
  • the multimedia template includes a basic dimension template and/or a specific dimension template; a preset number of dimension attributes are classified into a first dimension set, and the basic dimension template is determined based on the preset number of dimension attributes included in the first dimension set; The remaining dimension attributes are classified into at least one second dimension set, and a specific dimension template is determined based on the preset number of dimension attributes included in each second dimension set to obtain at least one specific dimension template; wherein, the dimensional attribute sequence can be A preset number of dimension attributes are obtained in order and classified into the first dimension set.
  • the priority of the determined basic dimension template is greater than that of the specific dimension template, that is, the basic priority template can usually represent the dimension attributes that most subjects have.
  • step 104 may include the following steps:
  • Step 1041 Determine at least one dimension attribute corresponding to the editable module.
  • Step 1042 Search the database for subject display data matching the dimensional attribute corresponding to the editable module according to the preset subject.
  • each multimedia template since each multimedia template includes multiple editable modules, and each multimedia template corresponds to multiple dimensional attributes, each editable module included in the multimedia template corresponds to one dimensional attribute, and there may also be multiple editable modules. Each editable module corresponds to a dimension attribute.
  • this embodiment searches for the main body display data corresponding to the dimensional attributes of multiple editable modules by storing the preset main body information in the database , After obtaining the main body display data, bring these main body display data into the corresponding editable module according to the dimension attribute, so as to realize the automatic filling of the main body display data.
  • step 104 may further include the following steps:
  • Step 1043 Obtain the main body display data of the preset main body corresponding to the editable module according to a form of multimedia template, and fill the main body display data into the corresponding editable module in the form of multimedia template.
  • Step 1044 Obtain the main body display data of the editable module in a form of multimedia template filled with main body display data, and fill the obtained main body display data into the corresponding editable modules in other forms of multimedia templates in the multiple multimedia templates .
  • the multimedia template includes multiple forms of multimedia templates, such as a video format template, a text format template, an audio format template, and so on.
  • multiple multimedia templates of different forms are obtained for the preset subject; among them, multiple multimedia templates correspond to different display modes; the corresponding main body display data can be obtained through a multimedia template of one form (display mode) first.
  • the corresponding main body display data can be obtained through the multimedia templates that have been filled with main body display data before, and the obtained main body display data can be filled into other forms of multimedia templates without the need for other forms of multimedia templates Then repeat the steps of obtaining the main body display data through the database, thereby improving the obtaining efficiency and filling efficiency of the main body display data.
  • step 106 may include:
  • the multimedia template obtained based on the preset body includes multiple forms of multimedia templates, for example, a multimedia template in audio form (audio form template) and a multimedia template in video form (video form template) generated from the preset body. ) And a multimedia template in text form (text form template), after filling the main body display data into at least two forms of multimedia templates in multiple forms, respectively, by merging the at least two forms of multimedia templates filled with data.
  • a multimedia display file that displays a preset subject in a variety of ways is obtained, which improves the efficiency of information display, and users can view subject display data in multiple manifestations at the same time.
  • step 104 may further include the following steps:
  • Step 1045 Determine the position of the editable module based on the preset pixels or coordinates in the video template.
  • step 1045 may include: searching for at least one coordinate point corresponding to the set pixel value in at least one frame of video image included in the video template according to the set pixel value, and determining the position of the editable module based on the at least one coordinate point; and /or,
  • the range corresponding to the set pixel value search for the maximum coordinate point and the minimum coordinate point in the range corresponding to the set pixel value in at least one frame of video image included in the video template, and determine the editable module based on the maximum coordinate point and the minimum coordinate point Position; wherein the pixel range includes at least two pixel values of adjacent values; and/or,
  • Step 1046 Fill the main body display data into the editable module at the determined position.
  • step 1045 at least one coordinate point corresponding to the set pixel value is found in at least one frame of video image included in the video template according to the set pixel value, and the position of the editable module is determined based on the at least one coordinate point,
  • the coordinate point is determined by the pixel value
  • a pixel value can be preset
  • at least one coordinate point obtained by searching the coordinate point of the pixel value in the image is the position of the editable module, for example, in
  • the pixel value position of the reserved pixel value [0,0,0] in the video image is the position of the editable module, which is displayed in the video template as a black area in the video image.
  • step 1045 the maximum coordinate point and the minimum coordinate point in the range corresponding to the set pixel value are searched in at least one frame of video image included in the video template according to the range corresponding to the set pixel value, and the maximum coordinate point and the minimum coordinate point are determined based on the maximum coordinate point and the minimum coordinate point.
  • the way to edit the position of the module optionally: by setting the range corresponding to the pixel value to determine the coordinate point, a range corresponding to the set pixel value can be preset, and the corresponding range of the set pixel value can be found in the image.
  • the coordinate point where the range is located, the obtained at least one coordinate point is the position of the editable module.
  • the position of the editable module is determined by finding the coordinates of the pixel value in the range corresponding to the set pixel value.
  • the position of the editable module can also be determined by presetting the coordinate range of the editable module. Under the premise that the coordinate range is known, the coordinates can be searched to determine the location of the area corresponding to the editable module.
  • the position of the editable module is determined in at least one frame of video image included in the video template according to the set coordinate range.
  • a piece of the video image can be determined according to the coordinate range Area
  • the area is the position of the editable module.
  • the process of determining the area can first determine the multiple vertices of the editable module according to the coordinate range, and determine the position of the editable module according to the multiple vertices. For example, set the coordinate range to [0 ,0] to [5,5]. According to the range, the four vertices can be determined as [0,0], [0,5], [5,0] and [5,5]. Through these four vertices, The area in the video image can be determined, that is, the position of the editable module can be determined.
  • step 1046 may include:
  • the text and/or picture are resized (e.g., adjusted). Pixel value) to obtain text and/or pictures whose size meets the preset position and embed them in the editable module of the video template; optionally, the process of embedding the editable module into the video template may also include dynamic embedding or Static embedding, different embedding methods can achieve different video display effects.
  • the text is wrapped or keywords are extracted, and/or the picture is zoomed.
  • the processing in this step is for text and/or picture content that is compressed and still larger than the size of the editable module.
  • the text and/or picture can be processed as described above to ensure that the main The content is displayed in the video template.
  • step 1046 may include:
  • the playback time of the video (including but not limited to: start time and end time, etc.) should be written into the multimedia display file to realize that the multimedia display file expressed in the form of video starts at the set start time when it is played. Play the embedded video and end the playback of the video at the end time to realize the picture-in-picture display function.
  • FIG. 7 is a schematic flowchart of a method for generating a multimedia file according to another exemplary embodiment of the present disclosure.
  • the multimedia template in this embodiment includes a text template
  • the method provided in this embodiment includes the following steps:
  • Step 102 Acquire a multimedia template corresponding to a preset subject.
  • the multimedia template includes at least one editable module.
  • Step 703 Extract an attribute value corresponding to at least one dimensional attribute in the main body display data.
  • the acquired subject display data may include a lot of redundant information.
  • this embodiment extracts multiple attribute values (ie, keywords corresponding to related dimension attributes) from each subject display data. .
  • Step 704 Fill the attribute value into the corresponding editable module in the at least one text template according to the corresponding dimensional attribute.
  • Step 705 Determine the priority corresponding to each text template in the at least one text template, and sort the at least one text template from high to low according to the priority.
  • Step 706 Fill in at least one text template of the main body display data according to the sequence connection, and obtain the multimedia display file corresponding to the preset main body.
  • the priority of each text template can be determined according to the priority of the corresponding dimension attribute.
  • the more important dimension attribute has a higher priority, and the corresponding text template has a higher priority.
  • Priority is sorted, and at least one text template is connected according to the sorting order to obtain standardized body display data of the preset body expressed in the form of text, which is more conducive to other people's understanding of the preset body.
  • the multimedia display file also includes an audio display file
  • the multimedia display file corresponding to the preset subject is generated, including:
  • the multimedia display file determined by the text template is segmented according to dimensional attributes to obtain multiple paragraph texts in a certain order;
  • the corresponding audio editing file is generated based on the multimedia display file that has been filled with the text representation of the main body display data.
  • the matching performance of audio and text is improved by segmentation, and the error is reduced.
  • By presetting multiple tones The diversity of the obtained audio is improved, and the problem of poor user experience caused by the same timbre for any subject is avoided, and the user experience is improved.
  • step 104 may further include the following steps:
  • Step 1047 Extract an attribute value corresponding to at least one dimension attribute in the main body display data.
  • Step 1048 Determine a timbre from a plurality of preset timbres, and perform audio conversion on the attribute value based on the timbre to obtain at least one piece of audio.
  • Step 1049 Fill at least one piece of audio into the corresponding editable module in the audio template.
  • a timbre is determined from a plurality of preset timbres.
  • the timbre is determined based on the timbre in the audio template, and needs to be consistent with the timbre in the audio template.
  • the existing audio conversion technology converts the attribute value into audio, and fills at least one of the obtained audio into the corresponding editable module in the audio template to generate a complete audio, which realizes the realization of the preset subject To show.
  • FIG. 9 is a schematic flowchart of a method for generating a multimedia file according to another exemplary embodiment of the present disclosure.
  • the multimedia display files include: audio display files and video display files; including the following steps:
  • Step 102 Acquire a multimedia template corresponding to a preset subject.
  • the multimedia template includes at least one editable module.
  • Step 104 Obtain the main body display data of the preset main body corresponding to the editable module, and fill the main body display data into the corresponding editable module in the multimedia template.
  • Step 106 According to the multimedia template filled with the main body display data, a multimedia display file corresponding to the preset main body is generated.
  • Step 908 Adjust the playback speed of the video display file according to the duration of the audio display file.
  • Step 910 Synthesize the adjusted video display file and audio display file to obtain a display file corresponding to the preset subject.
  • the synthesis of audio and video can use any achievable technology in the prior art.
  • This embodiment does not limit the specific synthesis technology.
  • the playback speed of the video display file is adjusted under the premise of ensuring the audio duration (audio is not deformed) to unify the duration of the audio and video display file; optionally, in order to ensure that the audio display file is
  • Video display file matching can be segmented and synthesized according to the corresponding segment of the audio display file, that is, the video display file is segmented according to the attribute category to obtain multiple video paragraphs, and each segment of the audio display file is combined with its corresponding attribute
  • the video paragraphs corresponding to the categories are synthesized to obtain multiple introduction video paragraphs, and multiple introduction video paragraphs are connected to obtain the introduction video.
  • the multimedia display file further includes: a text display file; this embodiment, on the basis of the embodiment shown in FIG. 9, further includes:
  • the subtitle file is embedded in the synthesized file to obtain the display file corresponding to the preset body.
  • a subtitle file in an externally set format (for example, srt format, etc.) can be automatically generated according to the text display file, and the external subtitle is embedded in the video to generate an introduction video with subtitles to improve user experience .
  • the method provided in this embodiment further includes:
  • the preset identification information is generated into a watermark picture with a set transparency, and the watermark picture with a set transparency is embedded in a multimedia display file.
  • the security of the multimedia display file can also be improved by adding a watermark.
  • the method of adding a watermark can adopt any of the existing technologies. This embodiment There is no restriction on the way of adding watermark, and the setting transparency can be set or adjusted according to actual needs.
  • the multimedia template includes at least one display style multimedia template; acquiring the multimedia template corresponding to the preset subject includes:
  • the multimedia templates of two or more display styles are obtained from the multimedia templates corresponding to the preset body.
  • a preset body is provided with a variety of display style multimedia templates, where different display styles can be implemented according to different fonts and/or font sizes in the text display file, or textual Different display colors are used to achieve different display styles.
  • a text template is randomly obtained; for video display files, multiple content or displays (such as colors, styles, etc.) can be included in the corresponding video template.
  • a video template is randomly obtained to realize the differentiation of the introduction video; for audio display files, a variety of audio display files can be realized through a variety of optional tones Moreover, through different combinations (such as synthesis) of the above-mentioned multiple different forms of multimedia display files, the differentiation of multimedia display files is also realized.
  • Any method for generating a multimedia file provided in the embodiments of the present disclosure can be executed by any suitable device with data processing capabilities, including but not limited to: terminal devices and servers.
  • any multimedia file generation method provided by the embodiment of the present disclosure may be executed by a processor, for example, the processor executes any multimedia file generation method mentioned in the embodiment of the present disclosure by calling a corresponding instruction stored in a memory. I won't repeat them below.
  • Fig. 10 is a schematic structural diagram of a multimedia file generating device provided by an exemplary embodiment of the present disclosure. Applied to the server, as shown in Figure 10, this embodiment includes:
  • the template obtaining module 11 is used to obtain a multimedia template corresponding to a preset subject.
  • the multimedia template includes at least one editable module.
  • the information filling module 12 is used to obtain the main body display data of the preset main body corresponding to the editable module, and fill the main body display data into the corresponding editable module in the multimedia template.
  • the information display module 13 is used to generate a multimedia display file corresponding to the preset main body according to the multimedia template filled with main body display data.
  • the above-mentioned embodiment of the present disclosure provides a multimedia file generating device, which realizes the multimedia display of the preset main body by filling the corresponding main body display data in the multimedia template, makes the information display more intuitive, and improves the efficiency of the information display and the user experience ; And the process of generating multimedia display files does not require manual participation, which saves labor costs, realizes the automatic generation of multimedia display files of the preset subject, and improves the generation efficiency of multimedia display files.
  • the template obtaining module 11 is configured to obtain a set of dimensional attributes of a preset subject; wherein the set of dimensional attributes includes at least one dimensional attribute; and a multimedia template corresponding to the preset subject is determined according to the set of dimensional attributes of the preset subject.
  • the device provided in this implementation further includes:
  • the attribute statistics module is used to count multiple dimensional attributes corresponding to multiple subjects included in the database
  • the attribute assignment module is used to classify multiple dimensional attributes into at least one preset dimensional attribute set, and determine the multimedia template corresponding to the preset dimensional attribute set.
  • the attribute assignment module is used to preset the priority corresponding to each dimension attribute in the multiple dimension attribute information; sort the multiple dimension attributes in descending order to obtain the dimension attribute sequence; according to the dimension attribute sequence, the multiple dimensions
  • the attributes are classified into at least one preset dimensional attribute set, and a multimedia template corresponding to the preset dimensional attribute set is determined.
  • the multimedia template includes multiple forms of multimedia templates; among them, the multiple multimedia templates correspond to different display modes;
  • the information filling module 12 is used to obtain the main body display data of the preset main body corresponding to the editable module according to a form of multimedia template, and fill the main body display data into the corresponding editable module in the form of multimedia template;
  • the main body display data of the editable module in one form of multimedia template filled with the main body display data, and the obtained main body display data is filled into the corresponding editable modules in other forms of multimedia templates among the multiple multimedia templates.
  • the information filling module 12 when the information filling module 12 generates a multimedia display file corresponding to the preset body according to the multimedia template filled with the main body display data, it is used to merge multiple forms of multimedia templates filled with the main body display data to generate and pre- Set the multimedia display file corresponding to the main body.
  • the multimedia template includes a video template
  • the information filling module 12 includes:
  • the position determining unit is used to determine the position of the editable module based on the preset pixels or coordinates in the video template;
  • the module filling unit is used to fill the main body display data to the editable module in the determined position.
  • the position determining unit is configured to search for at least one coordinate point corresponding to the set pixel value in at least one frame of video image included in the video template according to the set pixel value, and determine the position of the editable module based on the at least one coordinate point; and / or,
  • the main body display data includes: text and/or pictures;
  • the module filling unit is used to determine the size of the editable module according to the position of the editable module; adjust the size of the text and/or picture based on the size of the editable module; adjust the adjusted size to match the text and/or of the editable module Or the picture is filled to the editable module in a certain position.
  • the module filling unit is also used to respond to the size of the adjusted text and/or picture does not match the size of the editable module, to wrap the text or extract keywords, and/or to zoom the picture deal with.
  • the main body display data includes: video;
  • the module filling unit is used to determine the size of the editable module according to the position of the editable module; adjust the display size of the video based on the size of the editable module; fill the adjusted video that matches the editable module to the determined position The editable module of the video, and write the corresponding start time and end time of the video into the multimedia display file.
  • the multimedia template includes a text template
  • the information filling module 12 is used for extracting the attribute value corresponding to at least one dimensional attribute in the main display data; filling the attribute value into the corresponding editable module in the at least one text template according to the corresponding dimensional attribute;
  • the information display module 13 is used to determine the priority corresponding to each text template in the at least one text template, and sort the at least one text template from high to low according to the priority; at least one text template filled with the main body display data according to the sort connection , To obtain the multimedia display file corresponding to the preset subject.
  • the multimedia display file also includes an audio display file
  • the information display module 13 is used to segment the multimedia display file determined by the text template according to the dimensional attributes to obtain a plurality of paragraph texts in a certain order; determine a timbre from a plurality of preset timbres, and respectively compare multiple paragraphs based on the timbre
  • the text undergoes audio conversion processing to obtain multiple paragraphs of audio; connect multiple paragraphs of audio according to the sequence between the multiple paragraphs of text to obtain an audio editing file.
  • the multimedia template includes an audio template
  • the information filling module 12 is used to extract the attribute value corresponding to at least one dimensional attribute in the main body display data; determine a timbre from a plurality of preset timbres, and perform audio conversion on the attribute value based on the timbre to obtain at least one piece of audio; Fill in the corresponding editable module in the audio template.
  • the multimedia display files include: audio display files and video display files;
  • the adjusted video display file and audio display file are synthesized to obtain the display file corresponding to the preset subject.
  • the multimedia display file further includes: a text display file
  • the subtitle file is embedded in the synthesized file to obtain the display file corresponding to the preset body.
  • the preset identification information is generated into a watermark picture with a set transparency, and the watermark picture with a set transparency is embedded in a multimedia display file.
  • the multimedia template includes at least one display style multimedia template; the template acquisition module is used to obtain two multimedia templates from the multimedia template corresponding to the preset body when the preset body of the multimedia display file to be generated is a preset number.
  • the template acquisition module is used to obtain two multimedia templates from the multimedia template corresponding to the preset body when the preset body of the multimedia display file to be generated is a preset number.
  • the electronic device may be either or both of the first device 100 and the second device 200, or a stand-alone device independent of them.
  • the stand-alone device may communicate with the first device and the second device to receive all information from them.
  • the collected input signal may be either or both of the first device 100 and the second device 200, or a stand-alone device independent of them.
  • the stand-alone device may communicate with the first device and the second device to receive all information from them.
  • the collected input signal may be either or both of the first device 100 and the second device 200, or a stand-alone device independent of them.
  • FIG. 11 illustrates a block diagram of an electronic device according to an embodiment of the present disclosure.
  • the electronic device 110 includes one or more processors 111 and a memory 112.
  • the processor 111 may be a central processing unit (CPU) or other form of processing unit with data processing capability and/or instruction execution capability, and may control other components in the electronic device 110 to perform desired functions.
  • CPU central processing unit
  • the processor 111 may be a central processing unit (CPU) or other form of processing unit with data processing capability and/or instruction execution capability, and may control other components in the electronic device 110 to perform desired functions.
  • the memory 112 may include one or more computer program products, and the computer program products may include various forms of computer-readable storage media, such as volatile memory and/or non-volatile memory.
  • the volatile memory may include random access memory (RAM) and/or cache memory (cache), for example.
  • the non-volatile memory may include, for example, read-only memory (ROM), hard disk, flash memory, and the like.
  • One or more computer program instructions may be stored on the computer-readable storage medium, and the processor 111 may run the program instructions to implement the sliding verification code verification method of the various embodiments of the present disclosure described above and / Or other desired functions.
  • Various contents such as input signals, signal components, noise components, etc. can also be stored in the computer-readable storage medium.
  • the electronic device 110 may further include: an input device 113 and an output device 114, and these components are interconnected by a bus system and/or other forms of connection mechanisms (not shown).
  • the input device 113 may be the aforementioned microphone or microphone array for capturing the input signal of the sound source.
  • the input device 113 may be a communication network connector for receiving collected input signals from the first device 100 and the second device 200.
  • the input device 113 may also include, for example, a keyboard, a mouse, and so on.
  • the output device 114 can output various information to the outside, including determined distance information, direction information, and so on.
  • the output device 94 may include, for example, a display, a speaker, a printer, a communication network and a remote output device connected to it, and so on.
  • the electronic device 110 may also include any other appropriate components according to specific application conditions.
  • the embodiments of the present disclosure may also be computer program products, which include computer program instructions that, when run by a processor, cause the processor to execute the “exemplary method” described above in this specification.
  • the steps in the multimedia file generation method according to various embodiments of the present disclosure are described in the section.
  • the computer program product may use any combination of one or more programming languages to write program codes for performing the operations of the embodiments of the present disclosure.
  • the programming languages include object-oriented programming languages, such as Java, C++, etc. , Also includes conventional procedural programming languages, such as "C" language or similar programming languages.
  • the program code can be executed entirely on the user's computing device, partly on the user's device, executed as an independent software package, partly on the user's computing device and partly executed on the remote computing device, or entirely on the remote computing device or server Executed on.
  • embodiments of the present disclosure may also be a computer-readable storage medium, on which computer program instructions are stored, and when the computer program instructions are executed by a processor, the processor executes the "exemplary method" part of this specification.
  • the steps in the multimedia file generation method according to various embodiments of the present disclosure are described in.
  • the computer-readable storage medium may adopt any combination of one or more readable media.
  • the readable medium may be a readable signal medium or a readable storage medium.
  • the readable storage medium may include, but is not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or a combination of any of the above, for example. More specific examples (non-exhaustive list) of readable storage media include: electrical connections with one or more wires, portable disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable Type programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above.
  • the method and apparatus of the present disclosure may be implemented in many ways.
  • the method and apparatus of the present disclosure can be implemented by software, hardware, firmware or any combination of software, hardware, and firmware.
  • the above-mentioned order of the steps for the method is for illustration only, and the steps of the method of the present disclosure are not limited to the order specifically described above, unless specifically stated otherwise.
  • the present disclosure can also be implemented as programs recorded in a recording medium, and these programs include machine-readable instructions for implementing the method according to the present disclosure.
  • the present disclosure also covers a recording medium storing a program for executing the method according to the present disclosure.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • User Interface Of Digital Computer (AREA)
  • Processing Or Creating Images (AREA)

Abstract

本公开实施例公开了一种多媒体文件生成方法和装置、存储介质、电子设备,其中,方法包括:获取与预设主体对应的多媒体模板;多媒体模板中包括至少一个可编辑模块;获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至多媒体模板中对应的可编辑模块;根据填充了主体展示数据的多媒体模板,生成与预设主体对应的多媒体展示文件。

Description

多媒体文件生成方法和装置、存储介质、电子设备
本公开要求在2020年3月30日提交中国专利局、申请号为CN 202010238531.X、发明名称为“多媒体文件生成方法和装置、存储介质、电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。
技术领域
本公开涉及多媒体文件生成技术,尤其是一种多媒体文件生成方法和装置、存储介质、电子设备。
背景技术
随着互联网技术的迅猛发展,人们的生活节奏越来越快,对信息的获取也越来越讲究效率和质量。然而传统的文本、数字等数据对于信息的传达方式比较耗时耗精力,需要很专注的进行阅读和理解。
发明内容
为了解决上述技术问题,提出了本公开。本公开的实施例提供了一种多媒体文件生成方法和装置、存储介质、电子设备。
根据本公开实施例的一个方面,提供了一种多媒体文件生成方法,包括:
获取与预设主体对应的多媒体模板;其中,所述多媒体模板中包括至少一个可编辑模块;
获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块;
根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
可选地,所述获取预设主体对应的多媒体模板,包括:
获取所述预设主体的维度属性集合;其中,所述维度属性集合包括至少一个维度属性;
根据所述预设主体的维度属性集合确定所述预设主体对应的多媒体模板。
可选地,所述获取预设主体对应的多媒体模板之前,还包括:
统计数据库中包括的多个主体对应的多个维度属性;
将所述多个维度属性分类到至少一个预设的维度属性集合中,并确定所述预设的维度属性集合对应的多媒体模板。
可选地,将所述多个维度属性分类到至少一个预设的维度属性集合中,包括:
预设所述多个维度属性信息中每个维度属性对应的优先级;
将所述多个维度属性按照降序排序,得到维度属性序列;
根据所述维度属性序列,将所述多个维度属性分类到至少一个预设的维度属性集合中,并确定所述预设的维度属性集合对应的多媒体模板。
可选地,所述获取与所述可编辑模块对应的所述预设主体的主体展示数据,包括:
确定所述可编辑模块对应的至少一个维度属性;
根据所述预设主体在数据库中查找与可编辑模块对应的维度属性匹配的主体展示数据。
可选地,所述多媒体模板包括多种形式的多媒体模板;其中,所述多种多媒体模板对应不同的展示方式;
获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多 媒体模板中对应的可编辑模块,包括:
根据一种形式的多媒体模板获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述一种形式的多媒体模板中对应的可编辑模块;
获取所述填充了所述主体展示数据的一种形式的多媒体模板中可编辑模块的主体展示数据,并将所述获取的主体展示数据填充至所述多种多媒体模板中的其他形式的多媒体模板中对应的可编辑模块。
可选地,所述根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件,包括:
合并所述填充了所述主体展示数据的多种形式的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
可选地,所述多媒体模板包括视频模板;
所述将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
基于所述视频模板中预设的像素或坐标确定可编辑模块的位置;
将所述主体展示数据填充至确定位置的可编辑模块。
可选地,所述基于所述视频模板中像素或坐标确定可编辑模块的位置,包括:
根据设定像素值在所述视频模板包括的至少一帧视频图像中查找所述设定像素值对应的至少一个坐标点,基于所述至少一个坐标点确定所述可编辑模块的位置;和/或,
根据设定像素值对应的范围在视频模板包括的至少一帧视频图像中查找设定像素值对应的范围内的最大坐标点及最小坐标点,基于最大坐标点及最小坐标点确定可编辑模块的位置;和/或,
根据设定坐标范围在所述视频模板包括的至少一帧视频图像中确定所述可编辑模块的位置。
可选地,所述主体展示数据包括:文本和/或图片;
所述将所述主体展示数据填充至所述确定位置的可编辑模块,包括:
根据所述可编辑模块的位置确定所述可编辑模块的大小;
基于所述可编辑模块的大小对所述文本和/或图片的大小进行调整;
将所述调整后大小与所述可编辑模块相匹配的文本和/或图片填充到所述确定位置的可编辑模块。
可选地,在将所述调整后大小与所述可编辑模块相匹配的文本和/或图片填充到所述确定位置的可编辑模块之前,还包括:
响应于所述调整后的文本和/或图片的大小与所述可编辑模块的大小不匹配,对所述文本进行换行处理或提取关键词,和/或,对所述图片进行缩放处理。
可选地,所述主体展示数据包括:视频;
所述将所述主体展示数据填充至所述确定位置的可编辑模块,包括:
根据所述可编辑模块的位置确定所述可编辑模块的大小;
基于所述可编辑模块的大小对所述视频的显示大小进行调整;
将所述调整后与所述可编辑模块相匹配的视频填充到所述确定位置的可编辑模块,并将所述视频对应的开始时间和结束时间写入所述多媒体展示文件。
可选地,所述多媒体模板包括文本模板;
所述将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
提取所述主体展示数据中对应至少一个维度属性的属性值;
将所述属性值按照对应的维度属性填充至至少一个文本模板中对应的可编辑模块;
所述根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件,包括:
确定所述至少一个文本模板中每个所述文本模板对应的优先级,根据所述优先级从高到低对所述至少一个文本模板进行排序;
按照所述排序连接所述填充了主体展示数据的至少一个文本模板,得到所述预设主体对应的多媒体展示文件。
可选地,所述多媒体展示文件还包括音频展示文件;
所述生成与所述预设主体对应的多媒体展示文件,包括:
对所述文本模板确定的所述多媒体展示文件按照维度属性进行分段,得到多个具有一定顺序的段落文本;
从多个预设音色中确定一个音色,基于所述音色分别对所述多个段落文本进行音频转换处理,得到多个段落音频;
按照所述多个段落文本之间的顺序连接所述多个段落音频,得到所述音频编辑文件。
可选地,所述多媒体模板包括音频模板;
所述将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
提取所述主体展示数据中对应至少一个维度属性的属性值;
从多个预设音色中确定一个音色,基于所述音色将所述属性值进行音频转换,得到至少一段音频;
将所述至少一段音频填充至所述音频模板中对应的可编辑模块。
可选地,所述多媒体展示文件包括:音频展示文件和视频展示文件;
所述方法还包括:
按照所述音频展示文件的时长对应所述视频展示文件的播放速度进行调整;
合成调整后的所述视频展示文件与所述音频展示文件,得到所述预设主体对应的展示文件。
可选地,所述多媒体展示文件还包括:文本展示文件;
所述合成调整后的所述视频展示文件与所述音频展示文件之后,还包括:
基于所述文本展示文件生成字幕文件;
将所述字幕文件嵌入所述合成后的文件中,得到所述预设主体对应的展示文件。
可选地,所述方法还包括:
将预设标识信息生成设定透明度的水印图片,将所述设定透明度的水印图片嵌入所述多媒体展示文件。
可选地,所述多媒体模板包括至少一种展示风格的多媒体模板;获取与预设主体对应的多媒体模板包括:
当待生成多媒体展示文件的预设主体为预设数量时,从与预设主体对应的多媒体模板中获取两种或两种以上展示风格的多媒体模板。
根据本公开实施例的另一方面,提供了一种多媒体文件生成装置,包括:
模板获取模块,用于获取与预设主体对应的多媒体模板;其中,所述多媒体模板中包括至少一个可编辑模块;
信息填充模块,用于获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块;
信息展示模块,用于根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
可选地,所述模板获取模块,用于获取所述预设主体的维度属性集合;其中,所述维度属性集合包括至少一个维度属性;根据所述预设主体的维度属性集合确定所述预设主体对应的多媒体模板。
可选地,所述装置还包括:
属性统计模块,用于统计数据库中包括的多个主体对应的多个维度属性;
属性分配模块,用于将所述多个维度属性分类到至少一个预设的维度属性集合中,并确定所述预设的维度属性集合对应的多媒体模板。
可选地,所述属性分配模块,用于预设所述多个维度属性信息中每个维度属性对应的优先级;将所述多个维度属性按照降序排序,得到维度属性序列;根据所述维度属性序列,将所述多个维度属性分类到至少一个预设的维度属性集合中,并确定所述预设的维度属性集合对应的多媒体模板。
可选地,所述信息填充模块,用于确定所述可编辑模块对应的至少一个维度属性;根据所述预设主体在数据库中查找与可编辑模块对应的维度属性匹配的主体展示数据。
可选地,所述多媒体模板包括多种形式的多媒体模板;其中,所述多种多媒体模板对应不同的展示方式;
所述信息填充模块,用于根据一种形式的多媒体模板获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述一种形式的多媒体模板中对应的可编辑模块;获取所述填充了所述主体展示数据的一种形式的多媒体模板中可编辑模块的主体展示数据,并将所述获取的主体展示数据填充至所述多种多媒体模板中的其他形式的多媒体模板中对应的可编辑模块。
可选地,所述信息填充模块在根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件时,用于合并所述填充了所述主体展示数据的多种形式的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
可选地,所述多媒体模板包括视频模板;
所述信息填充模块,包括:
位置确定单元,用于基于所述视频模板中预设的像素或坐标确定可编辑模块的位置;
模块填充单元,用于将所述主体展示数据填充至确定位置的可编辑模块。
可选地,所述位置确定单元,用于根据设定像素值在所述视频模板包括的至少一帧视频图像中查找所述设定像素值对应的至少一个坐标点,基于所述至少一个坐标点确定所述可编辑模块的位置;和/或,
根据设定像素值对应的范围在视频模板包括的至少一帧视频图像中查找设定像素值对应的范围内的最大坐标点及最小坐标点,基于最大坐标点及最小坐标点确定可编辑模块的位置;和/或,
根据设定坐标范围在所述视频模板包括的至少一帧视频图像中确定所述可编辑模块的位置。
可选地,所述主体展示数据包括:文本和/或图片;
所述模块填充单元,用于根据所述可编辑模块的位置确定所述可编辑模块的大小;基于所述可编辑模块的大小对所述文本和/或图片的大小进行调整;将所述调整后大小与所述可编辑模块相匹配的文本和/或图片填充到所述确定位置的可编辑模块。
可选地,所述模块填充单元,还用于响应于所述调整后的文本和/或图片的大小与所述可编辑模块的大小不匹配,对所述文本进行换行处理或提取关键词,和/或,对所述图片进行缩放处理。
可选地,所述主体展示数据包括:视频;
所述模块填充单元,用于根据所述可编辑模块的位置确定所述可编辑模块的大小;基于所述可编辑模块的大小对所述视频的显示大小进行调整;将所述调整后与所述可编辑模块相匹配的视频填充到所述确定位置的可编辑模块,并将所述视频对应的开始时间和结束时间写入所述多媒体展示文件。
可选地,所述多媒体模板包括文本模板;
所述信息填充模块,用于提取所述主体展示数据中对应至少一个维度属性的属性值;将所述属性值按照对应的维度属性填充至至少一个文本模板中对应的可编辑模块;
所述信息展示模块,用于确定所述至少一个文本模板中每个所述文本模板对应的优先级,根据所述优先级从高到低对所述至少一个文本模板进行排序;按照所述排序连接所述填充了主体展示数据的至少一个文本模板,得到所述预设主体对应的多媒体展示文件。
可选地,所述多媒体展示文件还包括音频展示文件;
所述信息展示模块,用于对所述文本模板确定的所述多媒体展示文件按照维度属性进行分段,得到多个具有一定顺序的段落文本;从多个预设音色中确定一个音色,基于所述音色分别对所述多个段落文本进行音频转换处理,得到多个段落音频;按照所述多个段落文本之间的顺序连接所述多个段落音频,得到所述音频编辑文件。
可选地,所述多媒体模板包括音频模板;
所述信息填充模块,用于提取所述主体展示数据中对应至少一个维度属性的属性值;从多个预设音色中确定一个音色,基于所述音色将所述属性值进行音频转换,得到至少一段音频;将所述至少一段音频填充至所述音频模板中对应的可编辑模块。
可选地,所述多媒体展示文件包括:音频展示文件和视频展示文件;
所述装置还包括:
按照所述音频展示文件的时长对应所述视频展示文件的播放速度进行调整;
合成调整后的所述视频展示文件与所述音频展示文件,得到所述预设主体对应的展示文件。
可选地,所述多媒体展示文件还包括:文本展示文件;
所述装置还包括:
基于所述文本展示文件生成字幕文件;
将所述字幕文件嵌入所述合成后的文件中,得到所述预设主体对应的展示文件。
可选地,所述装置还包括:
将预设标识信息生成设定透明度的水印图片,将所述设定透明度的水印图片嵌入所述多媒体展示文件。
可选地,所述多媒体模板包括至少一种展示风格的多媒体模板;所述模板获取模块,用于当待生成多媒体展示文件的预设主体为预设数量时,从与预设主体对应的多媒体模板中获取两种或两种以上展示风格的多媒体模板。
根据本公开实施例的又一个方面,提供了一种计算机可读存储介质,所述存储介质存储有计算机程序,所述计算机程序用于执行上述任一实施例所述的多媒体文件生成方法。
根据本公开实施例的还一方面,提供了一种电子设备,所述电子设备包括:
处理器;
用于存储所述处理器可执行指令的存储器;
所述处理器,用于从所述存储器中读取所述可执行指令,并执行所述指令以实现上述任一实施例所述的多媒体文件生成方法。
基于本公开上述实施例提供的一种多媒体文件生成方法和装置、存储介质、电子设备,获取与预设主体对应的多媒体模板;其中,所述多媒体模板中包括至少一个可编辑模块;获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块;根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件;本公开实施例通过在多媒体模板中填充对应的主体展示数据,实现对于预设主体的多媒体展示,使信息展示更直观,提升了信息展示的效率和用户体验;并且生成多媒体展示文件的过程无需人工参与,节省了人力成本,实现了自动生成预设主体的多媒体展示文件,特别是在需批量生成多媒体文件时,显著提高了多媒体展 示文件的生成效率。
下面通过附图和实施例,对本公开的技术方案做进一步的详细描述。
附图说明
构成说明书的一部分的附图描述了本公开的实施例,并且连同描述一起用于解释本公开的原理。
参照附图,根据下面的详细描述,可以更加清楚地理解本公开,其中:
图1是本公开一示例性实施例提供的多媒体文件生成方法的流程示意图。
图2是本公开图1所示的实施例中步骤102的一个流程示意图。
图3是本公开另一示例性实施例提供的多媒体文件生成方法的流程示意图。
图4是本公开图1所示的实施例中步骤104的一个流程示意图。
图5是本公开图1所示的实施例中步骤104的另一个流程示意图。
图6是本公开图1所示的实施例中步骤104的又一个流程示意图。
图7是本公开又一示例性实施例提供的多媒体文件生成方法的流程示意图。
图8是本公开图1所示的实施例中步骤104的还一个流程示意图。
图9是本公开还一示例性实施例提供的多媒体文件生成方法的流程示意图。
图10是本公开一示例性实施例提供的多媒体文件生成装置的结构示意图。
图11是本公开一示例性实施例提供的电子设备的结构图。
具体实施方式
现在将参照附图来详细描述本公开的各种示例性实施例。应注意到:除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本公开的范围。
本领域技术人员可以理解,本公开实施例中的“第一”、“第二”等术语仅用于区别不同步骤、设备或模块等,既不代表任何特定技术含义,也不表示它们之间的必然逻辑顺序。
还应理解,在本公开实施例中,“多个”可以指两个或两个以上,“至少一个”可以指一个、两个或两个以上。
还应理解,对于本公开实施例中提及的任一部件、数据或结构,在没有明确限定或者在前后文给出相反启示的情况下,一般可以理解为一个或多个。
另外,本公开中术语“和/或”,仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,本公开中字符“/”,一般表示前后关联对象是一种“或”的关系。
还应理解,本公开对各个实施例的描述着重强调各个实施例之间的不同之处,其相同或相似之处可以相互参考,为了简洁,不再一一赘述。
同时,应当明白,为了便于描述,附图中所示出的各个部分的尺寸并不是按照实际的比例关系绘制的。
以下对至少一个示例性实施例的描述实际上仅仅是说明性的,决不作为对本公开及其应用或使用的任何限制。
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为说明书的一部分。
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步讨论。
本公开实施例可以应用于计算机系统/服务器,其可与众多其它通用或专用计算系统环境或配置一起操作。适于与计算机系统/服务器一起使用的众所周知的计算系统、环境和/或配置的例子包括但不限于:个人计算机系统、服务器计算机系统、瘦客户机、厚客户机、手持或膝上设备、基于微处理器的系统、机顶盒、可编程消费电子产品、网络个人电脑、小型计算机系统﹑大型计算机系统和包括上述任何系统的分布式云计算技术环境,等等。
计算机系统/服务器可以在由计算机系统执行的计算机系统可执行指令(诸如程序模块)的一般语境下描述。通常,程序模块可以包括例程、程序、目标程序、组件、逻辑、数据结构等等,它们执行特定的任务或者实现特定的抽象数据类型。计算机系统/服务器可以在分布式云计算环境中实施,分布式云计算环境中,任务是由通过通信网络链接的远程处理设备执行的。在分布式云计算环境中,程序模块可以位于包括存储设备的本地或远程计算系统存储介质上。
示例性方法
图1是本公开一示例性实施例提供的多媒体文件生成方法的流程示意图。本实施例可应用在电子设备上,如图1所示,包括如下步骤:
步骤102,获取与预设主体对应的多媒体模板。
其中,多媒体模板中包括至少一个可编辑模块,可编辑模块中可填写任意内容。
在一些可选实施例中,预设主体可以包括但不限于公司、团体、个人等,或者某些虚拟角色、某类物品等。不同的预设主体可对应相同或不同的多媒体模板;多媒体技术是通过计算机对语言文字、数据、音频、视频等各种信息进行存储和管理,使用户能够通过多种感官跟计算机进行实时信息交流的技术,可选地,多媒体模板可以包括多种表现形式的模板,例如:语言文字、数据、音频、视频等。
在一个可选示例中,该步骤102可以由处理器调用存储器存储的相应指令执行,也可以由被处理器运行的模板获取模块11执行。
步骤104,获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至多媒体模板中对应的可编辑模块。
本实施例中的主体展示数据可从预设主体的公开数据中获取,以预设主体为公司为例,本实施例可以从公司的公开数据中获取主体展示相关数据,例如,主体展示相关数据包括:公司注册地址行政区划数据、注册资本数据、投融资数据等。可选地,获取的主体展示数据可对应不同维度属性,通过不同的维度属性将主体展示数据填充到对应的可编辑模块中,使多媒体模板中具有该预设主体的特定信息,为基于多媒体模板展示预设主体提供了基础。其中,不同的维度属性可以是公司注册地址行政区划、注册资本、投融资等属性。
在一个可选示例中,该步骤104可以由处理器调用存储器存储的相应指令执行,也可以由被处理器运行的信息填充模块12执行。
步骤106,根据填充了主体展示数据的多媒体模板,生成与预设主体对应的多媒体展示文件。
本实施例中,由于多媒体模板可以有多种展示方式,因此,得到的多媒体展示文件也可以包括多种展示方式,并且,在基于填充了主体展示数据的多媒体模板生成多媒体展示文件的过程中,可以基于对多个多媒体模板进行合成,生成新的展示方式的多媒体展示文件,例如,将音频展示方式的多媒体模板和视频展示方式的多媒体模板进行合成,生成音视频结合的多媒体展示文件。
在一个可选示例中,该步骤106可以由处理器调用存储器存储的相应指令执行,也可以由被处理器运行的信息展示模块13执行。
本公开上述实施例提供的一种多媒体文件生成方法,获取与预设主体对应的多媒体模板;其中,所 述多媒体模板中包括至少一个可编辑模块;获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块;根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件;本公开实施例通过在多媒体模板中填充对应的主体展示数据,实现对于预设主体的多媒体展示,使信息展示更直观,提升了信息展示的效率和用户体验;并且生成多媒体展示文件的过程无需人工参与,节省了人力成本,实现了自动生成预设主体的多媒体展示文件,特别是在需批量生成多媒体文件时,显著提高了多媒体展示文件的生成效率。
如图2所示,在上述图1所示实施例的基础上,步骤102可包括如下步骤:
步骤1021,获取预设主体的维度属性集合。
其中,维度属性集合包括至少一个维度属性。
步骤1022,根据预设主体的维度属性集合确定预设主体对应的多媒体模板。
本实施例中,预设主体具有多个维度属性,不同的预设主体对应的维度属性数量和类别可能存在差异,本实施例通过维度属性集合表示预设主体对应的至少一个维度属性,在设置多媒体模板时,可针对多种维度属性建立多个画面帧或文本模块(例如,针对每种维度属性建立一个画面帧或文本模块),当一个多媒体模板包括多种维度属性时,通过这些维度属性对应的模块进行拼接,得到该多媒体模板;或者,针对多种不同的维度属性集合分别建立多媒体模板。
图3是本公开另一示例性实施例提供的多媒体文件生成方法的流程示意图。如图3所示,该实施例提供的方法包括:
步骤301,统计数据库中包括的多个主体对应的多个维度属性。
本实施例中,多个主体可以属于同一类别,例如,多个主体都是公司,那么虽然多个主体之间存在维度属性的差异,但通过确定多个主体对应的维度属性的并集,可确定一定数量的维度属性;例如,一个主体包括维度属性1、维度属性2和维度属性3,另一个主体包括维度属性1、维度属性4和维度属性5,那么,当统计数据库中包括这两个主体时,统计数据库对应5个维度属性,分别是维度属性1、维度属性2、维度属性3、维度属性4和维度属性5。
步骤302,将多个维度属性分类到至少一个预设的维度属性集合中,并确定预设的维度属性集合对应的多媒体模板。
步骤102,获取与预设主体对应的多媒体模板。
其中,多媒体模板中包括至少一个可编辑模块。
可选地,可编辑模块可以是预先设定大小的空白区域或空白文字框等,该可编辑模块中可填充任意形式的数据(例如,文本、音频、图片、视频等),可以是矩形、圆形、三角形或其他不规则形状。
步骤104,获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至多媒体模板中对应的可编辑模块。
步骤106,根据填充了主体展示数据的多媒体模板,生成与预设主体对应的多媒体展示文件。
本实施例中,基于多个维度属性的不同组合方式,确定至少一个预设的维度属性集合,并为至少一个维度属性集合建立对应的多媒体模板(例如,为每个维度属性集合建立对应的多媒体模板),可认为,预设的维度属性集合对应至少一个主体,通过针对不同的维度属性集合建立对应的多媒体模板,实现了每个主体都对应至少一个多媒体模板,为后续获取预设主体对应的多媒体模板提供了基础。
可选地,在上述图3所示实施例的基础上,步骤302可包括:
预设多个维度属性信息中每个维度属性对应的优先级;
将多个维度属性按照降序排序,得到维度属性序列;
根据维度属性序列,将多个维度属性分类到至少一个预设的维度属性集合中,并确定预设的维度属 性集合对应的多媒体模板。
本实施例中,通过对每个维度属性预设一个对应的优先级,可将多个维度属性按照优先级进行排序,通过基于维度属性序列分类得到的维度属性集合可确定多种多媒体模板,例如,多媒体模板包括基础维度模板和/或特定维度模板;预设数量的维度属性分类到第一维度集合,基于第一维度集合中包括的预设数量的维度属性确定基础维度模板;将维度属性序列中剩余的维度属性分类到至少一个第二维度集合,基于每个第二维度集合中包括的预设数量的维度属性确定一个特定维度模板,得到至少一个特定维度模板;其中,可从维度属性序列按序获得预设数量的维度属性分类到第一维度集合,此时确定的基础维度模板的优先级大于特定维度模板,即基础优先级模板通常可以表示多数主体都具有的维度属性。
如图4所示,在上述图1所示实施例的基础上,步骤104可包括如下步骤:
步骤1041,确定可编辑模块对应的至少一个维度属性。
步骤1042,根据预设主体在数据库中查找与可编辑模块对应的维度属性匹配的主体展示数据。
本实施例中,由于每个多媒体模板中包括多个可编辑模块,并且每个多媒体模板对应多种维度属性,因此,多媒体模板中包括的每个可编辑模块对应一个维度属性,还可能存在多个可编辑模块对应一个维度属性,为了对多媒体模板中的可编辑模块进行填充,本实施例通过在数据库中存储预设主体的信息中,查找多个可编辑模块的维度属性对应的主体展示数据,在获取到主体展示数据之后将这些主体展示数据按照维度属性带入到对应的可编辑模块中,实现主体展示数据的自动填写。
如图5所示,在上述图1所示实施例的基础上,步骤104还可包括如下步骤:
步骤1043,根据一种形式的多媒体模板获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至一种形式的多媒体模板中对应的可编辑模块。
步骤1044,获取填充了主体展示数据的一种形式的多媒体模板中可编辑模块的主体展示数据,并将获取的主体展示数据填充至多种多媒体模板中的其他形式的多媒体模板中对应的可编辑模块。
可选地,多媒体模板包括多种形式的多媒体模板,如视频形式模板、文本形式模板、音频形式模板等。本实施例中,对预设主体获得多个不同形式的多媒体模板;其中,多种多媒体模板对应不同的展示方式;可先通过一种形式(展示方式)的多媒体模板获取对应的主体展示数据并填充,对于其他形式的多媒体模板可通过之前已经填充了主体展示数据的多媒体模板获得对应的主体展示数据,并将获得的主体展示数据填充到其他形式的多媒体模板中,而无需其他形式的多媒体模板再重复进行通过数据库获取主体展示数据的步骤,进而提高了主体展示数据的获取效率和填充效率。
可选地,在上述图5所示实施例的基础上,步骤106可包括:
合并填充主体展示数据的多种形式的多媒体模板,生成与预设主体对应的多媒体展示文件。
在上述实施例中,基于预设主体获得的多媒体模板包括多种形式的多媒体模板,例如,对预设主体生成的音频形式的多媒体模板(音频形式模板)、视频形式的多媒体模板(视频形式模板)以及文本形式的多媒体模板(文本形式模板),在分别将主体展示数据填充到多种形式中的至少两种形式的多媒体模板之后,通过对数据填充后的至少两种形式的多媒体模板进行合并的方式,获得对预设主体通过多种方式进行展示的多媒体展示文件,提高了信息展示效率,用户可同时查看到多种表现形式的主体展示数据。
如图6所示,在上述图1所示实施例的基础上,当多媒体模板包括视频模板时,步骤104又可包括如下步骤:
步骤1045,基于视频模板中预设的像素或坐标确定可编辑模块的位置。
可选地,步骤1045可包括:根据设定像素值在视频模板包括的至少一帧视频图像中查找设定像素值对应的至少一个坐标点,基于至少一个坐标点确定可编辑模块的位置;和/或,
根据设定像素值对应的范围在视频模板包括的至少一帧视频图像中查找设定像素值对应的范围内的最大坐标点及最小坐标点,基于最大坐标点及最小坐标点确定可编辑模块的位置;其中,像素范围包括相邻数值的至少两个像素值;和/或,
根据设定坐标范围在视频模板包括的至少一帧视频图像中确定可编辑模块的位置。
步骤1046,将主体展示数据填充至确定位置的可编辑模块。
本实施例中,步骤1045中根据设定像素值在视频模板包括的至少一帧视频图像中查找设定像素值对应的至少一个坐标点,基于至少一个坐标点确定可编辑模块的位置的方式,可选地:通过像素值确定坐标点的方式,可预先设定一个像素值,通过在图像中查找该像素值所在坐标点,得到的至少一个坐标点即为可编辑模块的位置,例如,在视频图像中预留像素值为[0,0,0]的像素值位置为可编辑模块的位置,显示在视频模板中为视频图像中的一块黑色区域。
步骤1045中根据设定像素值对应的范围在视频模板包括的至少一帧视频图像中查找设定像素值对应的范围内的最大坐标点及最小坐标点,基于最大坐标点及最小坐标点确定可编辑模块的位置的方式,可选地:通过设定像素值对应的范围确定坐标点的方式,可预先设定一个设定像素值对应的范围,通过在图像中查找该设定像素值对应的范围所在坐标点,得到的至少一个坐标点即为可编辑模块的位置,例如,在视频图像中设定像素值对应的范围为[0,0,0]到[0,0,5],显示在图像中为一块颜色较相近的区域,通过查找像素值在该设定像素值对应的范围内的坐标确定可编辑模块的位置。
还可以通过预先设定可编辑模块的坐标范围来确定可编辑模块的位置,在已知坐标范围的前提下,可查找坐标确定可编辑模块对应的区域位置。
步骤1045中根据设定坐标范围在视频模板包括的至少一帧视频图像中确定可编辑模块的位置的方式,可选地:通过设定坐标范围的方式,可根据坐标范围确定视频图像中的一块区域,该区域即为可编辑模块的位置,确定区域的过程可先根据坐标范围确定可编辑模块的多个顶点,根据多个顶点确定可编辑模块的位置,例如,设定坐标范围为[0,0]到[5,5],根据该范围可确定四个顶点分别为[0,0]、[0,5]、[5,0]和[5,5],通过这四个顶点即可确定视频图像中的区域,即确定可编辑模块的位置。
可选地,在上述图6所示实施例的基础上,当主体展示数据包括:文本和/或图片时;步骤1046可包括:
根据可编辑模块的位置确定可编辑模块的大小;
基于可编辑模块的大小对文本和/或图片的大小进行调整;
将调整后大小与可编辑模块相匹配的文本和/或图片填充到确定位置的可编辑模块。
本实施例中,由于在视频模板的图像中预留的位置是固定,因此,在将主体对应的文本和/或图片嵌入到视频模板之前,通过对文本和/或图片进行大小调整(例如调整像素值数量),以获得大小符合预设位置的文本和/或图片并嵌入到视频模板中的可编辑模块中;可选地,将可编辑模块嵌入到视频模板的过程还可以包括动态嵌入或静态嵌入,不同的嵌入方式可实现不同的视频显示效果。特殊情况下,响应于调整后的文本和/或图片的大小与可编辑模块的大小不匹配,对文本进行换行处理或提取关键词,和/或,对图片进行缩放处理。该步骤处理是针对文本和/或图片内容经过压缩仍然大于可编辑模块的大小时,为了能够将文本和/或图片嵌入到视频模板中,可对文本和/或图片进行上述处理,以保证主要内容在视频模板中得以展示。
可选地,在上述图6所示实施例的基础上,当主体展示数据还可以包括:视频;步骤1046可包括:
根据可编辑模块的位置确定可编辑模块的大小;
基于可编辑模块的大小对视频的显示大小进行调整;
将调整后与可编辑模块相匹配的视频填充到确定位置的可编辑模块,并将视频对应的开始时间和结 束时间写入多媒体展示文件。
在视频模板中嵌入视频时,与嵌入图片的方法类似,同样需要预先将视频的显示大小进行调整,以使该视频在可编辑模块中可以显示,视频的嵌入与图片的区别在于,嵌入视频的同时,要将视频的播放时间(包括但不限于:开始时间和结束时间等)写入到多媒体展示文件中,以实现在视频形式表现的多媒体展示文件在播放时,在设定的开始时间开始播放嵌入的视频,在结束时间结束该视频的播放,实现画中画的展示功能。
图7是本公开又一示例性实施例提供的多媒体文件生成方法的流程示意图。如7所示,该实施例中多媒体模板包括文本模板,该实施例提供的方法包括如下步骤:
步骤102,获取与预设主体对应的多媒体模板。
其中,多媒体模板中包括至少一个可编辑模块。
步骤703,提取主体展示数据中对应至少一个维度属性的属性值。
可选地,获取的主体展示数据可能包括很多冗余信息,为了提高得到的视频的价值,本实施例从每条主体展示数据中提取多个属性值(即,对应相关维度属性的关键词)。
步骤704,将属性值按照对应的维度属性填充至至少一个文本模板中对应的可编辑模块。
步骤705,确定至少一个文本模板中每个文本模板对应的优先级,根据优先级从高到低对至少一个文本模板进行排序。
步骤706,按照排序连接填充了主体展示数据的至少一个文本模板,得到预设主体对应的多媒体展示文件。
可选地,每个文本模板的优先级可根据对应的维度属性的优先级确定,越重要的维度属性对应的优先级越高,对应的文本模板的优先级也越高,通过对文本模板按照优先级进行排序,按照该排序连接至少一个文本模板,即可得到规范化的以文字表现形式表达的预设主体的主体展示数据,更利于其他人对预设主体进行了解。
在上述图7所示实施例的基础上,多媒体展示文件还包括音频展示文件;
此时,生成与预设主体对应的多媒体展示文件,包括:
对文本模板确定的多媒体展示文件按照维度属性进行分段,得到多个具有一定顺序的段落文本;
从多个预设音色中确定一个音色,基于音色分别对多个段落文本进行音频转换处理,得到多个段落音频;
按照多个段落文本之间的顺序连接多个段落音频,得到音频编辑文件。
本实施例中基于已经填充了主体展示数据的文本表现形式的多媒体展示文件生成对应的音频编辑文件,通过分段提高了音频与文字的匹配性能,减小了误差,通过预设多个音色,提高了得到的音频的多样性,避免了无论对于什么主体都是千篇一律的音色而导致的用户体验差的问题,提高了用户体验。
如图8所示,在上述图1所示实施例的基础上,当多媒体模板包括音频模板时,步骤104还可包括如下步骤:
步骤1047,提取主体展示数据中对应至少一个维度属性的属性值。
步骤1048,从多个预设音色中确定一个音色,基于音色将属性值进行音频转换,得到至少一段音频。
步骤1049,将至少一段音频填充至音频模板中对应的可编辑模块。
本实施例中,对于维度属性对应的属性值从多个预设音色中确定一个音色,可选地,该音色的确定基于音频模板中的音色进行确定,需要与音频模板中的音色一直,通过现有的音频转换技术,将属性值转换为音频,并将得到的至少一个音频填充至音频模板中对应的可编辑模块中,即可生成一段完整的音 频,实现了通过音频方式对预设主体进行展示。
图9是本公开还一示例性实施例提供的多媒体文件生成方法的流程示意图。如图9所示,多媒体展示文件包括:音频展示文件和视频展示文件;包括如下步骤:
步骤102,获取与预设主体对应的多媒体模板。
其中,多媒体模板中包括至少一个可编辑模块。
步骤104,获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至多媒体模板中对应的可编辑模块。
步骤106,根据填充了主体展示数据的多媒体模板,生成与预设主体对应的多媒体展示文件。
步骤908,按照音频展示文件的时长对应视频展示文件的播放速度进行调整。
步骤910,合成调整后的视频展示文件与音频展示文件,得到预设主体对应的展示文件。
本实施例中,音视频的合成,可采用现有技术中的任意可实现技术,本实施例对具体合成技术不做限制,同时,对于合成中音频展示文件和视频展示文件长度不同的情况,以音频展示文件时长为准,在保证音频时长(音频不变形)的前提下,对视频展示文件的播放速度进行调整,以统一音视频展示文件的时长;可选地,为了保证音频展示文件与视频展示文件匹配,可按照音频展示文件对应的分段进行分段合成,即,将视频展示文件按照属性类别进行分段,得到多段视频段落,将音频展示文件中每个段落音频与其对应的属性类别对应的视频段落进行合成,得到多段介绍视频段落,连接多段介绍视频段落,得到介绍视频。
可选地,多媒体展示文件还包括:文本展示文件;本实施例在上述图9所示的实施例的基础上,还包括:
基于文本展示文件生成字幕文件;
将字幕文件嵌入合成后的文件中,得到预设主体对应的展示文件。
本实施例中,在生成视频展示文件之后,还可以根据文本展示文件自动生成外接设定格式(例如,srt格式等)的字幕文件,外接字幕嵌入视频,生成具有字幕的介绍视频,提高用户体验。
可选地,本实施例提供的方法还包括:
将预设标识信息生成设定透明度的水印图片,将设定透明度的水印图片嵌入多媒体展示文件。
本实施例中,在生成介绍多媒体展示文件之后,还可以通过添加水印的方式提高多媒体展示文件的安全性,可选地,添加水印的方式可采用现有技术中的任意一种,本实施例不限制添加水印的方式,其设定透明度可根据实际需要进行设定或调整。
在一些可选的实施例,多媒体模板包括至少一种展示风格的多媒体模板;获取与预设主体对应的多媒体模板包括:
当待生成多媒体展示文件的预设主体为预设数量时,从与预设主体对应的多媒体模板中获取两种或两种以上展示风格的多媒体模板。
可选地,为了提供差异化的描述多媒体展示文件,为预设主体提供多种展示风格的多媒体模板,其中不同的展示风格可根据文本展示文件中的不同字体和/或字号实现,或者文字的显示颜色的不同来实现不同的展示风格,在针对不同主体进行描述时,随机获取一个文本模板;对于视频展示文件,可通过对应的视频模板中包括多种内容或显示(如,颜色、风格等)存在差异的视频模板,在针对不同主体进行描述时,随机获取一个视频模板,进而实现了介绍视频的差异化;对于音频展示文件,可通过多种可供选择的音色实现音频展示文件的多样化;并且,通过上述多种不同形式的多媒体展示文件的不同组合(如,合成),也实现了多媒体展示文件的差异化。
本公开实施例提供的任一种多媒体文件生成方法可以由任意适当的具有数据处理能力的设备执行, 包括但不限于:终端设备和服务器等。或者,本公开实施例提供的任一种多媒体文件生成方法可以由处理器执行,如处理器通过调用存储器存储的相应指令来执行本公开实施例提及的任一种多媒体文件生成方法。下文不再赘述。
示例性装置
图10是本公开一示例性实施例提供的多媒体文件生成装置的结构示意图。应用于服务端,如图10所示,本实施例包括:
模板获取模块11,用于获取与预设主体对应的多媒体模板。
其中,多媒体模板中包括至少一个可编辑模块。
信息填充模块12,用于获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至多媒体模板中对应的可编辑模块。
信息展示模块13,用于根据填充了主体展示数据的多媒体模板,生成与预设主体对应的多媒体展示文件。
本公开上述实施例提供的一种多媒体文件生成装置,通过在多媒体模板中填充对应的主体展示数据,实现对于预设主体的多媒体展示,使信息展示更直观,提升了信息展示的效率和用户体验;并且生成多媒体展示文件的过程无需人工参与,节省了人力成本,实现了自动生成预设主体的多媒体展示文件,提高了多媒体展示文件的生成效率。
可选地,模板获取模块11,用于获取预设主体的维度属性集合;其中,维度属性集合包括至少一个维度属性;根据预设主体的维度属性集合确定预设主体对应的多媒体模板。
可选地,本实施提供的装置还包括:
属性统计模块,用于统计数据库中包括的多个主体对应的多个维度属性;
属性分配模块,用于将多个维度属性分类到至少一个预设的维度属性集合中,并确定预设的维度属性集合对应的多媒体模板。
可选地,属性分配模块,用于预设多个维度属性信息中每个维度属性对应的优先级;将多个维度属性按照降序排序,得到维度属性序列;根据维度属性序列,将多个维度属性分类到至少一个预设的维度属性集合中,并确定预设的维度属性集合对应的多媒体模板。
可选地,信息填充模块12,用于确定可编辑模块对应的至少一个维度属性;根据预设主体在数据库中查找与可编辑模块对应的维度属性匹配的主体展示数据。
可选地,多媒体模板包括多种形式的多媒体模板;其中,多种多媒体模板对应不同的展示方式;
信息填充模块12,用于根据一种形式的多媒体模板获取与可编辑模块对应的预设主体的主体展示数据,并将主体展示数据填充至一种形式的多媒体模板中对应的可编辑模块;获取填充了所述主体展示数据的一种形式的多媒体模板中可编辑模块的主体展示数据,并将获取的主体展示数据填充至多种多媒体模板中的其他形式的多媒体模板中对应的可编辑模块。
可选地,信息填充模块12在根据填充了主体展示数据的多媒体模板,生成与预设主体对应的多媒体展示文件时,用于合并填充了主体展示数据的多种形式的多媒体模板,生成与预设主体对应的多媒体展示文件。
可选地,多媒体模板包括视频模板;
信息填充模块12,包括:
位置确定单元,用于基于视频模板中预设的像素或坐标确定可编辑模块的位置;
模块填充单元,用于将主体展示数据填充至确定位置的可编辑模块。
可选地,位置确定单元,用于根据设定像素值在视频模板包括的至少一帧视频图像中查找设定像素值对应的至少一个坐标点,基于至少一个坐标点确定可编辑模块的位置;和/或,
根据设定像素值对应的范围在视频模板包括的至少一帧视频图像中查找设定像素值对应的范围内的最大坐标点及最小坐标点,基于最大坐标点及最小坐标点确定可编辑模块的位置;和/或,
根据设定坐标范围在视频模板包括的至少一帧视频图像中确定可编辑模块的位置。
可选地,主体展示数据包括:文本和/或图片;
模块填充单元,用于根据可编辑模块的位置确定可编辑模块的大小;基于可编辑模块的大小对文本和/或图片的大小进行调整;将调整后大小与可编辑模块相匹配的文本和/或图片填充到确定位置的可编辑模块。
可选地,模块填充单元,还用于响应于调整后的文本和/或图片的大小与可编辑模块的大小不匹配,对文本进行换行处理或提取关键词,和/或,对图片进行缩放处理。
可选地,主体展示数据包括:视频;
模块填充单元,用于根据可编辑模块的位置确定可编辑模块的大小;基于可编辑模块的大小对所述视频的显示大小进行调整;将调整后与可编辑模块相匹配的视频填充到确定位置的可编辑模块,并将视频对应的开始时间和结束时间写入多媒体展示文件。
可选地,多媒体模板包括文本模板;
信息填充模块12,用于提取主体展示数据中对应至少一个维度属性的属性值;将属性值按照对应的维度属性填充至至少一个文本模板中对应的可编辑模块;
信息展示模块13,用于确定至少一个文本模板中每个文本模板对应的优先级,根据优先级从高到低对至少一个文本模板进行排序;按照排序连接填充了主体展示数据的至少一个文本模板,得到预设主体对应的多媒体展示文件。
可选地,多媒体展示文件还包括音频展示文件;
信息展示模块13,用于对文本模板确定的多媒体展示文件按照维度属性进行分段,得到多个具有一定顺序的段落文本;从多个预设音色中确定一个音色,基于音色分别对多个段落文本进行音频转换处理,得到多个段落音频;按照多个段落文本之间的顺序连接多个段落音频,得到音频编辑文件。
可选地,多媒体模板包括音频模板;
信息填充模块12,用于提取主体展示数据中对应至少一个维度属性的属性值;从多个预设音色中确定一个音色,基于音色将属性值进行音频转换,得到至少一段音频;将至少一段音频填充至音频模板中对应的可编辑模块。
可选地,多媒体展示文件包括:音频展示文件和视频展示文件;
本实施例提供的装置还包括:
按照音频展示文件的时长对应视频展示文件的播放速度进行调整;
合成调整后的视频展示文件与音频展示文件,得到预设主体对应的展示文件。
可选地,多媒体展示文件还包括:文本展示文件;
本实施例提供的装置还包括:
基于文本展示文件生成字幕文件;
将字幕文件嵌入合成后的文件中,得到预设主体对应的展示文件。
可选地,本实施例提供的装置还包括:
将预设标识信息生成设定透明度的水印图片,将设定透明度的水印图片嵌入多媒体展示文件。
可选地,多媒体模板包括至少一种展示风格的多媒体模板;模板获取模块,用于当待生成多媒体展 示文件的预设主体为预设数量时,从与预设主体对应的多媒体模板中获取两种或两种以上展示风格的多媒体模板。
示例性电子设备
下面,参考图11来描述根据本公开实施例的电子设备。该电子设备可以是第一设备100和第二设备200中的任一个或两者、或与它们独立的单机设备,该单机设备可以与第一设备和第二设备进行通信,以从它们接收所采集到的输入信号。
图11图示了根据本公开实施例的电子设备的框图。
如图11所示,电子设备110包括一个或多个处理器111和存储器112。
处理器111可以是中央处理单元(CPU)或者具有数据处理能力和/或指令执行能力的其他形式的处理单元,并且可以控制电子设备110中的其他组件以执行期望的功能。
存储器112可以包括一个或多个计算机程序产品,所述计算机程序产品可以包括各种形式的计算机可读存储介质,例如易失性存储器和/或非易失性存储器。所述易失性存储器例如可以包括随机存取存储器(RAM)和/或高速缓冲存储器(cache)等。所述非易失性存储器例如可以包括只读存储器(ROM)、硬盘、闪存等。在所述计算机可读存储介质上可以存储一个或多个计算机程序指令,处理器111可以运行所述程序指令,以实现上文所述的本公开的各个实施例的滑动验证码的验证方法以及/或者其他期望的功能。在所述计算机可读存储介质中还可以存储诸如输入信号、信号分量、噪声分量等各种内容。
在一个示例中,电子设备110还可以包括:输入装置113和输出装置114,这些组件通过总线系统和/或其他形式的连接机构(未示出)互连。
例如,在该电子设备是第一设备100或第二设备200时,该输入装置113可以是上述的麦克风或麦克风阵列,用于捕捉声源的输入信号。在该电子设备是单机设备时,该输入装置113可以是通信网络连接器,用于从第一设备100和第二设备200接收所采集的输入信号。
此外,该输入设备113还可以包括例如键盘、鼠标等等。
该输出装置114可以向外部输出各种信息,包括确定出的距离信息、方向信息等。该输出设备94可以包括例如显示器、扬声器、打印机、以及通信网络及其所连接的远程输出设备等等。
当然,为了简化,图11中仅示出了该电子设备110中与本公开有关的组件中的一些,省略了诸如总线、输入/输出接口等等的组件。除此之外,根据具体应用情况,电子设备110还可以包括任何其他适当的组件。
示例性计算机程序产品和计算机可读存储介质
除了上述方法和设备以外,本公开的实施例还可以是计算机程序产品,其包括计算机程序指令,所述计算机程序指令在被处理器运行时使得所述处理器执行本说明书上述“示例性方法”部分中描述的根据本公开各种实施例的多媒体文件生成方法中的步骤。
所述计算机程序产品可以以一种或多种程序设计语言的任意组合来编写用于执行本公开实施例操作的程序代码,所述程序设计语言包括面向对象的程序设计语言,诸如Java、C++等,还包括常规的过程式程序设计语言,诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算设备上执行、部分地在用户设备上执行、作为一个独立的软件包执行、部分在用户计算设备上部分在远程计算设备上执行、或者完全在远程计算设备或服务器上执行。
此外,本公开的实施例还可以是计算机可读存储介质,其上存储有计算机程序指令,所述计算机程序指令在被处理器运行时使得所述处理器执行本说明书上述“示例性方法”部分中描述的根据本公开 各种实施例的多媒体文件生成方法中的步骤。
所述计算机可读存储介质可以采用一个或多个可读介质的任意组合。可读介质可以是可读信号介质或者可读存储介质。可读存储介质例如可以包括但不限于电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。可读存储介质的更具体的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。
以上结合具体实施例描述了本公开的基本原理,但是,需要指出的是,在本公开中提及的优点、优势、效果等仅是示例而非限制,不能认为这些优点、优势、效果等是本公开的各个实施例必须具备的。另外,上述公开的具体细节仅是为了示例的作用和便于理解的作用,而非限制,上述细节并不限制本公开为必须采用上述具体的细节来实现。
本说明书中各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似的部分相互参见即可。对于系统实施例而言,由于其与方法实施例基本对应,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。
本公开中涉及的器件、装置、设备、系统的方框图仅作为例示性的例子并且不意图要求或暗示必须按照方框图示出的方式进行连接、布置、配置。如本领域技术人员将认识到的,可以按任意方式连接、布置、配置这些器件、装置、设备、系统。诸如“包括”、“包含”、“具有”等等的词语是开放性词汇,指“包括但不限于”,且可与其互换使用。这里所使用的词汇“或”和“和”指词汇“和/或”,且可与其互换使用,除非上下文明确指示不是如此。这里所使用的词汇“诸如”指词组“诸如但不限于”,且可与其互换使用。
本说明书中各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似的部分相互参见即可。对于系统实施例而言,由于其与方法实施例基本对应,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。
可能以许多方式来实现本公开的方法和装置。例如,可通过软件、硬件、固件或者软件、硬件、固件的任何组合来实现本公开的方法和装置。用于所述方法的步骤的上述顺序仅是为了进行说明,本公开的方法的步骤不限于以上具体描述的顺序,除非以其它方式特别说明。此外,在一些实施例中,还可将本公开实施为记录在记录介质中的程序,这些程序包括用于实现根据本公开的方法的机器可读指令。因而,本公开还覆盖存储用于执行根据本公开的方法的程序的记录介质。
本公开的描述是为了示例和描述起见而给出的,而并不是无遗漏的或者将本公开限于所公开的形式。很多修改和变化对于本领域的普通技术人员而言是显然的。选择和描述实施例是为了更好说明本公开的原理和实际应用,并且使本领域的普通技术人员能够理解本公开从而设计适于特定用途的带有各种修改的各种实施例。

Claims (22)

  1. 一种多媒体文件生成方法,其特征在于,包括:
    获取与预设主体对应的多媒体模板;其中,所述多媒体模板中包括至少一个可编辑模块;
    获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块;
    根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
  2. 根据权利要求1所述的方法,其特征在于,所述获取预设主体对应的多媒体模板,包括:
    获取所述预设主体的维度属性集合;其中,所述维度属性集合包括至少一个维度属性;
    根据所述预设主体的维度属性集合确定所述预设主体对应的多媒体模板。
  3. 根据权利要求1或2所述的方法,其特征在于,所述获取预设主体对应的多媒体模板之前,还包括:
    统计数据库中包括的多个主体对应的多个维度属性;
    将所述多个维度属性分类到至少一个预设的维度属性集合中,并确定所述预设的维度属性集合对应的多媒体模板。
  4. 根据权利要求3所述的方法,其特征在于,将所述多个维度属性分类到至少一个预设的维度属性集合中,包括:
    预设所述多个维度属性信息中每个维度属性对应的优先级;
    将所述多个维度属性按照降序排序,得到维度属性序列;
    根据所述维度属性序列,将所述多个维度属性分类到至少一个预设的维度属性集合中,并确定所述预设的维度属性集合对应的多媒体模板。
  5. 根据权利要求1-4任一所述的方法,其特征在于,所述获取与所述可编辑模块对应的所述预设主体的主体展示数据,包括:
    确定所述可编辑模块对应的至少一个维度属性;
    根据所述预设主体在数据库中查找与可编辑模块对应的维度属性匹配的主体展示数据。
  6. 根据权利要求1-4任一所述的方法,其特征在于,所述多媒体模板包括多种形式的多媒体模板,所述多种多媒体模板对应不同的展示方式;
    获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
    根据一种形式的多媒体模板获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述一种形式的多媒体模板中对应的可编辑模块;
    获取填充了所述主体展示数据的一种形式的多媒体模板中可编辑模块的主体展示数据,并将所述获取的主体展示数据填充至所述多种多媒体模板中的其他形式的多媒体模板中对应的可编辑模块。
  7. 根据权利要求6所述的方法,其特征在于,所述根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件,包括:
    合并所述填充了所述主体展示数据的多种形式的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
  8. 根据权利要求1-7任一所述的方法,其特征在于,所述多媒体模板包括视频模板;
    所述将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
    基于所述视频模板中预设的像素或坐标确定可编辑模块的位置;
    将所述主体展示数据填充至确定位置的可编辑模块。
  9. 根据权利要求8所述的方法,其特征在于,所述基于所述视频模板中像素或坐标确定可编辑模块的位置,包括:
    根据设定像素值在所述视频模板包括的至少一帧视频图像中查找所述设定像素值对应的至少一个坐标点,基于所述至少一个坐标点确定所述可编辑模块的位置;和/或,
    根据设定像素值对应的范围在视频模板包括的至少一帧视频图像中查找设定像素值对应的范围内的最大坐标点及最小坐标点,基于最大坐标点及最小坐标点确定可编辑模块的位置;和/或,
    根据设定坐标范围在所述视频模板包括的至少一帧视频图像中确定所述可编辑模块的位置。
  10. 根据权利要求8或9所述的方法,其特征在于,所述主体展示数据包括:文本和/或图片;
    所述将所述主体展示数据填充至所述确定位置的可编辑模块,包括:
    根据所述可编辑模块的位置确定所述可编辑模块的大小;
    基于所述可编辑模块的大小对所述文本和/或图片的大小进行调整;
    将所述调整后大小与所述可编辑模块相匹配的文本和/或图片填充到所述确定位置的可编辑模块。
  11. 根据权利要求10所述的方法,其特征在于,在将所述调整后大小与所述可编辑模块相匹配的文本和/或图片填充到所述确定位置的可编辑模块之前,还包括:
    响应于所述调整后的文本和/或图片的大小与所述可编辑模块的大小不匹配,对所述文本进行换行处理或提取关键词,和/或,对所述图片进行缩放处理。
  12. 根据权利要求8-11任一所述的方法,其特征在于,所述主体展示数据包括:视频;
    所述将所述主体展示数据填充至所述确定位置的可编辑模块,包括:
    根据所述可编辑模块的位置确定所述可编辑模块的大小;
    基于所述可编辑模块的大小对所述视频的显示大小进行调整;
    将所述调整后与所述可编辑模块相匹配的视频填充到所述确定位置的可编辑模块,并将所述视频对应的开始时间和结束时间写入所述多媒体展示文件。
  13. 根据权利要求1-12任一所述的方法,其特征在于,所述多媒体模板包括文本模板;
    所述将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
    提取所述主体展示数据中对应至少一个维度属性的属性值;
    将所述属性值按照对应的维度属性填充至至少一个文本模板中对应的可编辑模块;
    所述根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件,包括:
    确定所述至少一个文本模板中每个所述文本模板对应的优先级,根据所述优先级从高到低对所述至少一个文本模板进行排序;
    按照所述排序连接所述填充了主体展示数据的至少一个文本模板,得到所述预设主体对应的多媒体展示文件。
  14. 根据权利要求13所述的方法,其特征在于,所述多媒体展示文件还包括音频展示文件;
    所述生成与所述预设主体对应的多媒体展示文件,包括:
    对所述文本模板确定的所述多媒体展示文件按照维度属性进行分段,得到多个具有一定顺序的段落文本;
    从多个预设音色中确定一个音色,基于所述音色分别对所述多个段落文本进行音频转换处理,得到多个段落音频;
    按照所述多个段落文本之间的顺序连接所述多个段落音频,得到所述音频编辑文件。
  15. 根据权利要求1-13任一所述的方法,其特征在于,所述多媒体模板包括音频模板;
    所述将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块,包括:
    提取所述主体展示数据中对应至少一个维度属性的属性值;
    从多个预设音色中确定一个音色,基于所述音色将所述属性值进行音频转换,得到至少一段音频;
    将所述至少一段音频填充至所述音频模板中对应的可编辑模块。
  16. 根据权利要求1-15任一所述的方法,其特征在于,所述多媒体展示文件包括:音频展示文件和视频展示文件;
    所述方法还包括:
    按照所述音频展示文件的时长对应所述视频展示文件的播放速度进行调整;
    合成调整后的所述视频展示文件与所述音频展示文件,得到所述预设主体对应的展示文件。
  17. 根据权利要求16所述的方法,其特征在于,所述多媒体展示文件还包括:文本展示文件;
    所述合成调整后的所述视频展示文件与所述音频展示文件之后,还包括:
    基于所述文本展示文件生成字幕文件;
    将所述字幕文件嵌入所述合成后的文件中,得到所述预设主体对应的展示文件。
  18. 根据权利要求1-17任一所述的方法,其特征在于,所述方法还包括:
    将预设标识信息生成设定透明度的水印图片,将所述设定透明度的水印图片嵌入所述多媒体展示文件。
  19. 根据权利要求1-18任一所述的方法,其特征在于,所述多媒体模板包括至少一种展示风格的多媒体模板;获取与预设主体对应的多媒体模板包括:
    当待生成多媒体展示文件的预设主体为预设数量时,从与预设主体对应的多媒体模板中获取两种或两种以上展示风格的多媒体模板。
  20. 一种多媒体文件生成装置,其特征在于,包括:
    模板获取模块,用于获取与预设主体对应的多媒体模板;其中,所述多媒体模板中包括至少一个可编辑模块;
    信息填充模块,用于获取与所述可编辑模块对应的所述预设主体的主体展示数据,并将所述主体展示数据填充至所述多媒体模板中对应的可编辑模块;
    信息展示模块,用于根据填充了所述主体展示数据的多媒体模板,生成与所述预设主体对应的多媒体展示文件。
  21. 一种计算机可读存储介质,其特征在于,所述存储介质存储有计算机程序,所述计算机程序用于执行上述权利要求1-19任一所述的多媒体文件生成方法。
  22. 一种电子设备,其特征在于,所述电子设备包括:
    处理器;
    用于存储所述处理器可执行指令的存储器;
    所述处理器,用于从所述存储器中读取所述可执行指令,并执行所述指令以实现上述权利要求1-19任一所述的多媒体文件生成方法。
PCT/CN2020/084738 2020-03-30 2020-04-14 多媒体文件生成方法和装置、存储介质、电子设备 WO2021196281A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010238531.X 2020-03-30
CN202010238531.XA CN111460183B (zh) 2020-03-30 2020-03-30 多媒体文件生成方法和装置、存储介质、电子设备

Publications (1)

Publication Number Publication Date
WO2021196281A1 true WO2021196281A1 (zh) 2021-10-07

Family

ID=71681760

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/084738 WO2021196281A1 (zh) 2020-03-30 2020-04-14 多媒体文件生成方法和装置、存储介质、电子设备

Country Status (2)

Country Link
CN (1) CN111460183B (zh)
WO (1) WO2021196281A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114286181A (zh) * 2021-10-25 2022-04-05 腾讯科技(深圳)有限公司 一种视频优化方法、装置、电子设备和存储介质
CN117370584A (zh) * 2023-12-08 2024-01-09 中国信息通信研究院 多媒体数据深度合成方法和系统

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112073649B (zh) * 2020-09-04 2022-12-13 北京字节跳动网络技术有限公司 多媒体数据的处理方法、生成方法及相关设备
CN112291635B (zh) * 2020-10-28 2022-07-15 北京金堤科技有限公司 用于生成多媒体文件的方法、装置、电子设备及存储介质
CN112616085B (zh) * 2020-12-09 2023-05-26 四川金熊猫新媒体有限公司 基于iptv动态模板组合的epg呈现解决方法和装置
CN112634426B (zh) * 2020-12-17 2023-09-29 深圳万兴软件有限公司 多媒体数据显示的方法、电子设备及计算机存储介质
CN112561988A (zh) * 2020-12-22 2021-03-26 咪咕文化科技有限公司 多媒体资源的定位方法、电子设备及可读存储介质
CN112584061B (zh) * 2020-12-24 2023-08-01 咪咕文化科技有限公司 多媒体通用模板生成方法、电子设备及存储介质
CN113065007A (zh) * 2021-03-22 2021-07-02 平安银行股份有限公司 多媒体文件生成方法、装置、设备及存储介质
CN115269889A (zh) * 2021-04-30 2022-11-01 北京字跳网络技术有限公司 剪辑模板搜索方法及装置
CN113204657B (zh) * 2021-05-19 2024-02-06 广州九舞数字科技有限公司 一种集成组合式智能展示系统
CN113626632B (zh) * 2021-07-30 2023-10-31 北京达佳互联信息技术有限公司 影集素材的显示方法、装置及电子设备
CN113778419B (zh) * 2021-08-09 2023-06-02 北京有竹居网络技术有限公司 多媒体数据的生成方法、装置、可读介质及电子设备
CN114238689A (zh) 2021-12-17 2022-03-25 北京百度网讯科技有限公司 视频生成方法、装置、电子设备、存储介质和程序产品
CN117082292A (zh) * 2022-05-10 2023-11-17 北京字跳网络技术有限公司 视频生成方法、装置、设备、存储介质和程序产品
CN116150413A (zh) * 2023-02-07 2023-05-23 北京达佳互联信息技术有限公司 多媒体资源的展示方法及装置

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004097599A2 (en) * 2003-04-28 2004-11-11 Sony Pictures Entertainment Inc. Rich media publishing
CN108965737A (zh) * 2017-05-22 2018-12-07 腾讯科技(深圳)有限公司 媒体数据处理方法、装置及存储介质
CN109684565A (zh) * 2018-12-11 2019-04-26 北京字节跳动网络技术有限公司 网页关联视频的生成及展示方法、装置、系统及电子设备
CN110781418A (zh) * 2018-07-30 2020-02-11 上海哔哩哔哩科技有限公司 基于url识别的网页文本编辑方法、装置和存储介质

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100340169B1 (ko) * 1999-09-27 2002-06-10 서성철 자바를 이용한 동적 멀티미디어 웹 카타로깅 시스템 및 그 방법
TWI282926B (en) * 2005-10-06 2007-06-21 Fashionow Co Ltd Template-based multimedia editor and editing method thereof
US7813724B2 (en) * 2006-03-17 2010-10-12 Comverse Ltd. System and method for multimedia-to-video conversion to enhance real-time mobile video services
CN103986980B (zh) * 2014-05-30 2017-06-13 中国传媒大学 一种超媒体编辑制作方法及系统
CN105447016B (zh) * 2014-08-18 2018-09-14 北大方正集团有限公司 一种组件的快速搜索及重用的办法
US10446188B2 (en) * 2015-12-10 2019-10-15 Cine Design Group Llc Method and apparatus for low latency non-linear media editing using file-based inserts into finalized digital multimedia files
CN107241646B (zh) * 2017-07-12 2020-08-14 北京奇虎科技有限公司 多媒体视频的编辑方法及装置
CN110475157A (zh) * 2019-07-19 2019-11-19 平安科技(深圳)有限公司 多媒体信息展示方法、装置、计算机设备及存储介质
CN110826080B (zh) * 2019-09-18 2024-03-08 平安科技(深圳)有限公司 多媒体文件生成方法、装置、设备及计算机可读存储介质

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004097599A2 (en) * 2003-04-28 2004-11-11 Sony Pictures Entertainment Inc. Rich media publishing
CN108965737A (zh) * 2017-05-22 2018-12-07 腾讯科技(深圳)有限公司 媒体数据处理方法、装置及存储介质
CN110781418A (zh) * 2018-07-30 2020-02-11 上海哔哩哔哩科技有限公司 基于url识别的网页文本编辑方法、装置和存储介质
CN109684565A (zh) * 2018-12-11 2019-04-26 北京字节跳动网络技术有限公司 网页关联视频的生成及展示方法、装置、系统及电子设备

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114286181A (zh) * 2021-10-25 2022-04-05 腾讯科技(深圳)有限公司 一种视频优化方法、装置、电子设备和存储介质
CN114286181B (zh) * 2021-10-25 2023-08-15 腾讯科技(深圳)有限公司 一种视频优化方法、装置、电子设备和存储介质
CN117370584A (zh) * 2023-12-08 2024-01-09 中国信息通信研究院 多媒体数据深度合成方法和系统

Also Published As

Publication number Publication date
CN111460183B (zh) 2024-02-13
CN111460183A (zh) 2020-07-28

Similar Documents

Publication Publication Date Title
WO2021196281A1 (zh) 多媒体文件生成方法和装置、存储介质、电子设备
US10621988B2 (en) System and method for speech to text translation using cores of a natural liquid architecture system
CN111415399B (zh) 图像处理方法、装置、电子设备及计算机可读存储介质
CN104679902B (zh) 一种结合跨媒体融合的信息摘要提取方法
US8719029B2 (en) File format, server, viewer device for digital comic, digital comic generation device
EP1980960A2 (en) Methods and apparatuses for converting electronic content descriptions
KR20200109239A (ko) 이미지를 처리하는 방법, 장치, 서버 및 저장 매체
WO2022089170A1 (zh) 字幕区域识别方法、装置、设备及存储介质
CN106161873A (zh) 一种视频信息提取推送方法及系统
US20060090123A1 (en) System and method for acquisition and storage of presentations
CN109558513A (zh) 一种内容推荐方法、装置、终端及存储介质
JP2020005309A (ja) 動画編集サーバおよびプログラム
US20180151178A1 (en) Interactive question-answering apparatus and method thereof
US9940326B2 (en) System and method for speech to speech translation using cores of a natural liquid architecture system
US20140161423A1 (en) Message composition of media portions in association with image content
JP6730757B2 (ja) サーバおよびプログラム、動画配信システム
WO2019245033A1 (ja) 動画編集サーバおよびプログラム
JP6730760B2 (ja) サーバおよびプログラム、動画配信システム
US20040205655A1 (en) Method and system for producing a book from a video source
CN106162328A (zh) 一种视频同步信息展示方法及系统
JP6603929B1 (ja) 動画編集サーバおよびプログラム
US20220301285A1 (en) Processing picture-text data
KR102281298B1 (ko) 인공지능 기반 동영상 합성을 위한 시스템 및 방법
US20140297678A1 (en) Method for searching and sorting digital data
JP2020108162A (ja) サーバおよびプログラム

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20928789

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20928789

Country of ref document: EP

Kind code of ref document: A1