CN114285817A

CN114285817A - Method, device, medium and program product for generating video

Info

Publication number: CN114285817A
Application number: CN202111678905.0A
Authority: CN
Inventors: 黄冬冬; 陈大年
Original assignee: Shanghai Zhangmen Science and Technology Co Ltd
Current assignee: Shanghai Zhangmen Science and Technology Co Ltd
Priority date: 2021-12-31
Filing date: 2021-12-31
Publication date: 2022-04-05

Abstract

An object of the present application is to provide a method, apparatus, medium, and program product for generating a video, the method including: responding to a video generation triggering operation executed by a first user for a first mail to be sent, and generating video information corresponding to the first mail according to the text information and first attachment information of the first mail; and adding the video information into the text information of the first mail. According to the method and the device, the sender can synthesize the text and the attachment in the mail into the video and then send the mail comprising the video, so that the receiver can know the mail content more intuitively and comprehensively by watching the video without browsing the text and the attachment of the mail respectively, the time cost for checking the mail of the receiver can be saved, and great convenience is provided for the receiver.

Description

Method, device, medium and program product for generating video

Technical Field

The present application relates to the field of communications, and more particularly, to a technique for generating video.

Background

With the rapid development of the internet and the application of office automation in daily work, electronic mails have become essential tools for people to work daily, and carrying attachments and writing mail texts are one of the important functions of electronic mails. In the prior art, a sender needs to input a mail text and select an attachment on a mail editing interface and then click to send a mail, and a receiver needs to browse the mail text and the attachment respectively.

Disclosure of Invention

It is an object of the present application to provide a method, apparatus, medium, and program product for generating a video.

According to an aspect of the present application, there is provided a method for generating a video, the method comprising:

responding to a video generation triggering operation executed by a first user for a first mail to be sent, and generating video information corresponding to the first mail according to the text information and first attachment information of the first mail;

adding the video information to the first mail.

According to an aspect of the present application, there is provided a first user equipment for generating a video, the apparatus comprising:

the system comprises a one-to-one module, a first mail sending module and a second mail sending module, wherein the one-to-one module is used for responding to a video generation triggering operation executed by a first user aiming at a first mail to be sent, and generating video information corresponding to the first mail according to the text information and first attachment information of the first mail;

a second module for adding the video information to the first email.

According to an aspect of the present application, there is provided a computer device for generating video, comprising a memory, a processor and a computer program stored on the memory, wherein the processor executes the computer program to implement the operations of any of the methods as described above.

According to an aspect of the application, there is provided a computer-readable storage medium having a computer program stored thereon, wherein the computer program, when executed by a processor, performs the operations of any of the methods described above.

According to an aspect of the application, a computer program product is provided, comprising a computer program which, when executed by a processor, carries out the steps of any of the methods as described above.

Compared with the prior art, the method and the device have the advantages that the video generation triggering operation executed by the first user for the first mail to be sent can be responded, the video information corresponding to the first mail is generated according to the text information and the first attachment information of the first mail, and the video information is added into the first mail, so that the sending party can synthesize the text and the attachment in the mail into the video and then send the mail comprising the video, the receiving party can know the mail content more intuitively and comprehensively by watching the video without browsing the text and the attachment of the mail respectively, the time cost for viewing the mail of the receiving party can be saved, and great convenience is provided for the receiving party.

Drawings

Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:

FIG. 1 shows a flow diagram of a method for generating video according to one embodiment of the present application;

FIG. 2 illustrates a first user equipment structure diagram for generating a video according to one embodiment of the present application;

FIG. 3 shows a presentation schematic for generating a video according to an embodiment of the present application;

FIG. 4 illustrates an exemplary system that can be used to implement the various embodiments described in this application.

The same or similar reference numbers in the drawings identify the same or similar elements.

Detailed Description

The present application is described in further detail below with reference to the attached figures.

In a typical configuration of the present application, the terminal, the device serving the network, and the trusted party each include one or more processors (e.g., Central Processing Units (CPUs)), input/output interfaces, network interfaces, and memory.

The Memory may include forms of volatile Memory, Random Access Memory (RAM), and/or non-volatile Memory in a computer-readable medium, such as Read Only Memory (ROM) or Flash Memory. Memory is an example of a computer-readable medium.

Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, Phase-Change Memory (PCM), Programmable Random Access Memory (PRAM), Static Random-Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), electrically Erasable Programmable Read-Only Memory (EEPROM), flash Memory or other Memory technology, Compact Disc Read-Only Memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other non-transmission medium, which can be used to store information that can be accessed by a computing device.

The device referred to in the present application includes, but is not limited to, a terminal, a network device, or a device formed by integrating a terminal and a network device through a network. The terminal includes, but is not limited to, any mobile electronic product, such as a smart phone, a tablet computer, etc., capable of performing human-computer interaction with a user (e.g., human-computer interaction through a touch panel), and the mobile electronic product may employ any operating system, such as an Android operating system, an iOS operating system, etc. The network Device includes an electronic Device capable of automatically performing numerical calculation and information processing according to a preset or stored instruction, and the hardware includes, but is not limited to, a microprocessor, an Application Specific Integrated Circuit (ASIC), a Programmable Logic Device (PLD), a Field Programmable Gate Array (FPGA), a Digital Signal Processor (DSP), an embedded Device, and the like. The network device includes but is not limited to a computer, a network host, a single network server, a plurality of network server sets or a cloud of a plurality of servers; here, the Cloud is composed of a large number of computers or web servers based on Cloud Computing (Cloud Computing), which is a kind of distributed Computing, one virtual supercomputer consisting of a collection of loosely coupled computers. Including, but not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a VPN network, a wireless Ad Hoc network (Ad Hoc network), etc. Preferably, the device may also be a program running on the terminal, the network device, or a device formed by integrating the terminal and the network device, the touch terminal, or the network device and the touch terminal through a network.

Of course, those skilled in the art will appreciate that the foregoing is by way of example only, and that other existing or future devices, which may be suitable for use in the present application, are also encompassed within the scope of the present application and are hereby incorporated by reference.

In the description of the present application, "a plurality" means two or more unless specifically limited otherwise.

Fig. 1 shows a flowchart of a method for generating a video according to an embodiment of the present application, the method comprising step S11 and step S12. In step S11, in response to a video generation triggering operation executed by a first user for a first mail to be sent, a first user device generates video information corresponding to the first mail according to text information and first attachment information of the first mail; in step S12, the first user device adds the video information to the first mail.

In step S11, in response to a video generation trigger operation performed by the first user for the first mail to be sent, the first user equipment generates video information corresponding to the first mail according to the body information and the first attachment information of the first mail. In some embodiments, the first user performs a video generation triggering operation on a mail editing page in a mail application, including but not limited to any application for sending and receiving mails, on the first mail to be sent, where the first application may be a separately installed application program, or may also be a web page, or may also be an applet. In some embodiments, the video generation trigger operation may be a mail preview operation, e.g., the first user clicks a "preview" button on a mail edit page, or the video generation trigger operation may also be a mail send operation, e.g., the first user clicks a "send" button on a mail edit page. In some embodiments, the first email may have one or more first attachment information, the type of the first attachment information including, but not limited to, document type, picture type, audio type, video type, and the like. In some embodiments, the body information of the first mail needs to be preprocessed, for example, the signature information and the reference information in the body information are removed. In some embodiments, if the first attachment information is a video, the text information of the first email may be used as the audio information or the subtitle information of the video to synthesize a new video, and the new video may be used as the video information corresponding to the first email. In some embodiments, if the first attachment information is an audio, the text information of the first email may be split into one or more ordered text information, each text information is captured to generate one or more ordered video frame information, then a video is generated according to the one or more video frame information and the audio, and the video is used as the video information corresponding to the first email. In some embodiments, if the first attachment information is a document or a picture, one or more pieces of video frame information may be generated in order according to the first attachment information, where each piece of video frame information corresponds to one picture, or each piece of video frame information corresponds to one page in one document, then audio information or subtitle information may be generated according to the text information of the first email, then a video may be generated according to the one or more pieces of video frame information and the audio information or the subtitle information, and the video may be used as the video information corresponding to the first email. As an example, as shown in fig. 3, when a first user clicks a button at the upper right corner of a mail editing page, a menu pops up on the mail editing page, and three buttons of "composite video send", "discard", and "cancel" are presented in the menu, when the first user clicks the "composite video send" button, video information corresponding to a first mail is generated according to text information and first attachment information of the first mail, and the video information is used as second attachment information of the first mail, or the video information is added to text information of the first mail and the first mail is directly sent, when the first user clicks the "discard" button, the video information corresponding to the previously generated first mail is deleted, and when the first user clicks the "cancel" button, the presentation of the menu on the mail editing page is cancelled.

In step S12, the first user device adds the video information to the first mail. In some embodiments, the video information may be added to the first mail as second attachment information of the first mail. In some embodiments, the video information may also be added to the body information of the first mail. According to the method and the device, the sender can synthesize the text and the attachment in the mail into the video and then send the mail comprising the video, so that the receiver can know the mail content more intuitively and comprehensively by watching the video without browsing the text and the attachment of the mail respectively, the time cost for checking the mail of the receiver can be saved, and great convenience is provided for the receiver.

In some embodiments, the step S12 includes any one of: adding the video information to the first mail as second attachment information of the first mail; and adding the video information into the text information of the first mail. In some embodiments, if the first email is sent by the first user to the second user, the presentation mode of the second attachment information on the email browsing page of the second user is different from the presentation mode of the first attachment information, so that the second user can distinguish the special second attachment information from the common first attachment information. In some embodiments, the video information may also be added directly to the text information of the first email, for example, to a front or top of the original text information of the first email, so that the second user may notice the video information more easily.

In some embodiments, the video generation triggering operation comprises any one of: e, mail preview operation; and E, mail sending operation. In some embodiments, the video generation triggering operation may be a mail preview operation, for example, when the first user clicks a "preview" button on the mail editing page, video information corresponding to the first mail is generated and added to the first mail, or the video generation triggering operation may also be a mail sending operation, for example, when the first user clicks a "send" button on the mail editing page, when the video information corresponding to the first mail is generated and added to the first mail, the first mail is directly sent.

In some embodiments, the generating video information corresponding to the first email according to the body information and the first attachment information of the first email includes: generating video frame sequence information according to the first attachment information of the first mail, wherein the video frame sequence information comprises one or more ordered video frame information; and generating video information corresponding to the first mail according to the video frame sequence information and the text information of the first mail. In some embodiments, if the first attached information is document information or picture information, video frame sequence information is generated according to the first attached information, and the video frame sequence information includes one or more pieces of video frame information in order, where each piece of video frame information corresponds to one picture, or each piece of video frame information corresponds to one page in one document. In some embodiments, audio information or subtitle information is generated according to the text information of the first email, and then a video is generated according to the video frame sequence information and the audio information or the subtitle information, and the video is used as the video information corresponding to the first email.

In some embodiments, the first attachment information is first document information; wherein the generating of the video frame sequence information according to the first attachment information of the first mail comprises: and generating video frame sequence information by capturing one or more pages in the first document information. In some embodiments, if the first attachment information is first document information (e.g., a word file, a ppt file, a txt file, etc.), the ordered one or more video frame information is generated by respectively capturing each page in the first document information, or by capturing partial pages in the first document information, instead of capturing all pages, wherein each video frame information corresponds to one captured image.

In some embodiments, the first document information includes at least one page; wherein the generating of the video frame sequence information by capturing one or more pages of the first document information further comprises: one or more pages are determined from the at least one page. In some embodiments, the screenshot is performed on a part of pages in the first document information, but not all pages, and one or more pages to be screenshot need to be determined from at least one page included in the first document information, where the one or more pages to be screenshot may be extracted from the at least one page at a predetermined extraction interval, or may be determined from the at least one page by performing content recognition on the first document information.

In some embodiments, said determining one or more pages from said at least one page comprises: one or more pages are extracted from the at least one page at predetermined extraction intervals. In some embodiments, the extraction interval may be set by the first user equipment by default, or may be specified by a server corresponding to the mail application, or may be determined by the first user equipment according to the page number of the first document information, and if the first mail includes a plurality of first attached information, that is, the first mail includes a plurality of first document information, the first user equipment determines the extraction interval according to the total page number of the plurality of first document information, for example, the extraction interval is proportional to the page number of the first document information, and the extraction interval is larger as the page number of the first document information is larger, for example, the extraction interval is equal to the page number/predetermined coefficient of the first document information.

In some embodiments, said determining one or more pages from said at least one page comprises: determining one or more pages from the at least one page by content recognition of the first document information. In some embodiments, one or more pages in the first document information whose importance or criticality of the corresponding content satisfies a predetermined condition are identified by performing content identification on the first document information, or one or more pages in the first document information whose matching degree between the corresponding content and the file name of the first document information is greater than or equal to a predetermined matching degree threshold value are identified.

In some embodiments, said generating a sequence of video frames by screenshot one or more pages in said first document information comprises: and opening the first document information in the background of the first user equipment, and generating video frame sequence information by screenshot on one or more pages in the opened first document information. In some embodiments, the first document information is opened in the background of the first user device, but not in the foreground of the first user device, that is, the opening process of the first document information is invisible to the first user, one or more corresponding screenshots are obtained by screenshot one or more pages in the opened first document information, and the video frame sequence information is generated.

In some embodiments, said generating a sequence of video frames by screenshot one or more pages in said first document information comprises: calling an interface provided by a third-party document application on the first user equipment, wherein the input of the interface is the first document information, and the output of the interface is screenshot information corresponding to one or more pages in the first document information; and generating video frame sequence information according to the screenshot information. In some embodiments, a third-party document application is already present on the first user equipment, and the mail application transfers the first document information into an interface provided by the third-party document application by calling the interface, obtains one or more screenshots corresponding to one or more pages in the first document information returned by the interface, and generates video frame sequence information.

In some embodiments, the generating video information corresponding to the first email according to the video frame sequence information and the body information of the first email includes: splitting text information of the first mail to generate at least one subtitle information; and generating video information corresponding to the first mail according to the video frame sequence information and the at least one piece of subtitle information, wherein each piece of subtitle information corresponds to at least one piece of video frame information. In some embodiments, if the body information of the first e-mail is short, the entire body information may be displayed as subtitle information on each video frame information, or on some or some of the video frame information, for example, on the first video frame information or on the last video frame information. In some embodiments, if the text information of the first mail is long, the text information needs to be split into at least one ordered text information, and at least one ordered subtitle information is generated, where each subtitle information corresponds to one text information. In some embodiments, the amount of the subtitle information is the same as the amount of the video frame information, and each video frame information corresponds to one subtitle information in the respective order, i.e., display the subtitle information corresponding to each video frame information, on each video frame information, in some embodiments, the amount of the caption information is smaller than the amount of the video frame information, each of the caption information corresponds to at least one of the video frame information in the respective order, i.e. the subtitle information may be present in each of the at least one video frame information, alternatively, the subtitle information may be presented on one or some of the at least one video frame information, and the subtitle information is not presented over the other remaining video frame information of the at least one video frame information, e.g., the subtitle information is presented on the foremost or rearmost video frame information among the at least one video frame information. In some embodiments, the body information of the first mail may be split according to a predetermined interval symbol (e.g., punctuation), for example, one subtitle information per sentence of body, or one subtitle information per paragraph of body, or the body information of the first mail may be split according to a predetermined line number interval or word number interval, for example, one subtitle information per M lines of body, or one subtitle information per N words of body.

In some embodiments, the method further comprises: and the first user equipment determines the number of video frames corresponding to each subtitle information by performing content identification on each subtitle information. In some embodiments, each subtitle information corresponds to at least one piece of video frame information, and the number of video frames corresponding to each subtitle information may be the same or different. In some embodiments, content identification is performed on each subtitle information, and the number of video frames corresponding to the subtitle information may be determined according to the number of characters in the identified subtitle information, for example, the number of video frames is proportional to the number of characters, and the greater the number of characters, the greater the number of corresponding video frames. In some embodiments, for content identification of each subtitle information, the number of video frames corresponding to the subtitle information may be determined according to the content importance level or the content criticality of the identified subtitle information, for example, the number of video frames is proportional to the content importance level or the content criticality, and the higher the content importance level or the content criticality is, the greater the number of corresponding video frames is.

In some embodiments, the generating video information corresponding to the first email according to the video frame sequence information and the body information of the first email includes: generating audio information according to the text information of the first mail; and generating video information corresponding to the first mail according to the video frame sequence information and the audio information. In some embodiments, audio information is generated according to the body information of the first mail, a video is generated according to the video frame sequence information and the audio information, and the video is taken as the video information corresponding to the first mail. In some embodiments, the audio information may be played starting from the first video frame information in the sequence of video frames. In some embodiments, the audio information may be played only once, or may be played in a loop. In some embodiments, the speed of the audio information may be set by default, or may be determined according to the text information of the first mail, for example, the speed of the audio information is proportional to the number of words of the text information, and the higher the number of words of the text information is, the faster the corresponding speed of the audio information is.

In some embodiments, the generating audio information according to the body information of the first mail includes: splitting the text message of the first mail to generate at least one text message; generating at least one audio message according to the at least one text message; wherein, the generating the video information corresponding to the first mail according to the video frame sequence information and the audio information includes: and generating video information corresponding to the first mail according to the video frame sequence information and the at least one piece of audio information, wherein each piece of audio information corresponds to at least one piece of video frame information. In some embodiments, the text information may be split into at least one ordered text information, and according to the at least one text information, at least one ordered audio information is generated, and then according to the video frame sequence information and the at least one audio information, a video is generated, and the video is used as the video information corresponding to the first mail. In some embodiments, one or more video information and the at least one audio information in the video frame sequence information each correspond to at least one video frame information in a respective order, and for a certain audio information, the audio information starts playing or starts playing in a loop on a first video frame information of the at least one video frame information corresponding to the audio information until the audio information ends playing when a last video frame information of the at least one video frame information ends presenting. In some embodiments, the number of video frames corresponding to each piece of audio information may be the same or different, and the number of video frames corresponding to each piece of audio information may be determined according to the duration information of each piece of audio information, for example, the number of video frames is proportional to the duration of the piece of audio information, and the longer the duration of the piece of audio information is, the more the number of video frames is.

In some embodiments, the method further comprises: the first user equipment determines the playing duration information of each piece of the one or more pieces of video frame information. In some embodiments, the playing time duration of each video frame information in the video frame sequence information may be the same or different. In some embodiments, if the playing duration of each video frame information is the same as a default duration, the default duration may be set by the network device, or may also be set by the first user equipment. In some embodiments, if the playing time duration of each piece of video frame information is different, the playing time duration of each piece of video frame information may be determined by the first user equipment.

In some embodiments, the determining the play-out duration information for each of the one or more video frame information comprises at least one of: determining the playing time length information of each video frame information according to the number information of the first accessory information; and if the first attachment information is first document information, determining the playing duration information of each piece of video frame information according to the page number information of the first document information. In some embodiments, if the playing duration of each piece of video frame information is the same, the playing duration of each piece of video frame information may be determined according to the number information of the first accessory information, for example, the playing duration of each piece of video frame information is inversely proportional to the number of the first accessory information, and the greater the number of the first accessory information is, the shorter the playing duration of each piece of video frame information is. In some embodiments, if the playing duration of each piece of video frame information is the same and the first attachment information is the first document information, the playing duration of each piece of video frame information may be determined according to the page number information of the first document information, or if the first mail includes a plurality of pieces of first attachment information, that is, the first mail includes a plurality of pieces of first document information, the playing duration of each piece of video frame information may be determined according to the total page number information of the plurality of pieces of first document information, for example, the playing duration of each piece of video frame information is inversely proportional to the page number of the first document information, and the more the page number of the first document information, the shorter the playing duration of each piece of video frame information.

In some embodiments, the first attachment information is first document information; wherein the determining of the playing duration information of each of the one or more pieces of video frame information includes at least one of: determining the playing duration information of each video frame information according to the text information length of the video frame information; and determining the playing duration information of each video frame information by identifying the content of the text information of the video frame information. In some embodiments, if the first attachment information is the first document information, the playing duration of each video frame information in the video frame sequence information may be determined according to the length of the text information in the video frame information, for example, the playing duration of the video frame information is proportional to the length of the text information in the video frame information, and the longer the length of the text information is, the longer the playing duration of the video frame information is. In some embodiments, content identification may be performed on the text information of each video frame information, and the playing duration information of the video frame information may be determined according to the content importance degree or the content criticality degree of the identified text information, for example, the playing duration of the video frame information is proportional to the content importance degree or the content criticality degree of the text information in the video frame information, and the higher the content importance degree or the content criticality degree, the longer the playing duration of the video frame information.

In some embodiments, the first attachment information is first picture information; wherein the determining the playing duration information of each of the one or more pieces of video frame information includes: and determining the playing time length information of the video frame information according to the image identification result by carrying out image identification on the picture information of each video frame information. In some embodiments, if the first e-mail includes a plurality of first attached information, each of which is first picture information, that is, the first e-mail includes a plurality of first picture information, performs image recognition on the picture information in each of the video frame information, and determines the playing duration of the video frame information according to the result of the image recognition, for example, determines the playing duration of the video frame information according to the identified image complexity of the picture information in the video frame information, the playing duration of the video frame information is proportional to the image complexity of the picture information in the video frame information, the playing duration of the video frame information is longer the higher the image complexity is, and for example, determines the playing duration of the video frame information according to the identified number of words in the picture information in the video frame information, the playing duration of the video frame information is proportional to the number of words in the picture information in the video frame information, the more the number of the character words is, the longer the playing time of the video frame information is.

Fig. 2 shows a block diagram of a first user equipment for generating a video according to an embodiment of the present application, the first user equipment comprising a one-module 11 and a two-module 12. A one-to-one module 11, configured to respond to a video generation trigger operation executed by a first user for a first mail to be sent, and generate video information corresponding to the first mail according to text information and first attachment information of the first mail; a second module 12 for adding said video information to said first mail.

A module 11, configured to respond to a video generation trigger operation executed by a first user for a first email to be sent, and generate video information corresponding to the first email according to text information and first attachment information of the first email. In some embodiments, the first user performs a video generation triggering operation on a mail editing page in a mail application, including but not limited to any application for sending and receiving mails, on the first mail to be sent, where the first application may be a separately installed application program, or may also be a web page, or may also be an applet. In some embodiments, the video generation trigger operation may be a mail preview operation, e.g., the first user clicks a "preview" button on a mail edit page, or the video generation trigger operation may also be a mail send operation, e.g., the first user clicks a "send" button on a mail edit page. In some embodiments, the first email may have one or more first attachment information, the type of the first attachment information including, but not limited to, document type, picture type, audio type, video type, and the like. In some embodiments, the body information of the first mail needs to be preprocessed, for example, the signature information and the reference information in the body information are removed. In some embodiments, if the first attachment information is a video, the text information of the first email may be used as the audio information or the subtitle information of the video to synthesize a new video, and the new video may be used as the video information corresponding to the first email. In some embodiments, if the first attachment information is an audio, the text information of the first email may be split into one or more ordered text information, each text information is captured to generate one or more ordered video frame information, then a video is generated according to the one or more video frame information and the audio, and the video is used as the video information corresponding to the first email. In some embodiments, if the first attachment information is a document or a picture, one or more pieces of video frame information may be generated in order according to the first attachment information, where each piece of video frame information corresponds to one picture, or each piece of video frame information corresponds to one page in one document, then audio information or subtitle information may be generated according to the text information of the first email, then a video may be generated according to the one or more pieces of video frame information and the audio information or the subtitle information, and the video may be used as the video information corresponding to the first email.

A second module 12 for adding said video information to said first mail. In some embodiments, the video information may be added to the first mail as second attachment information of the first mail. In some embodiments, the video information may also be added to the body information of the first mail. According to the method and the device, the sender can synthesize the text and the attachment in the mail into the video and then send the mail comprising the video, so that the receiver can know the mail content more intuitively and comprehensively by watching the video without browsing the text and the attachment of the mail respectively, the time cost for checking the mail of the receiver can be saved, and great convenience is provided for the receiver.

In some embodiments, the secondary module 12 is used for any of: adding the video information to the first mail as second attachment information of the first mail; and adding the video information into the text information of the first mail. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the video generation triggering operation comprises any one of: e, mail preview operation; and E, mail sending operation. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the generating video information corresponding to the first email according to the body information and the first attachment information of the first email includes: generating video frame sequence information according to the first attachment information of the first mail, wherein the video frame sequence information comprises one or more ordered video frame information; and generating video information corresponding to the first mail according to the video frame sequence information and the text information of the first mail. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the first attachment information is first document information; wherein the generating of the video frame sequence information according to the first attachment information of the first mail comprises: and generating video frame sequence information by capturing one or more pages in the first document information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the first document information includes at least one page; wherein the generating of the video frame sequence information by screenshot one or more pages in the first document information is further configured to: one or more pages are determined from the at least one page. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, said determining one or more pages from said at least one page comprises: one or more pages are extracted from the at least one page at predetermined extraction intervals. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, said determining one or more pages from said at least one page comprises: determining one or more pages from the at least one page by content recognition of the first document information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, said generating a sequence of video frames by screenshot one or more pages in said first document information comprises: and opening the first document information in the background of the first user equipment, and generating video frame sequence information by screenshot on one or more pages in the opened first document information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, said generating a sequence of video frames by screenshot one or more pages in said first document information comprises: calling an interface provided by a third-party document application on the first user equipment, wherein the input of the interface is the first document information, and the output of the interface is screenshot information corresponding to one or more pages in the first document information; and generating video frame sequence information according to the screenshot information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the generating video information corresponding to the first email according to the video frame sequence information and the body information of the first email includes: splitting text information of the first mail to generate at least one subtitle information; and generating video information corresponding to the first mail according to the video frame sequence information and the at least one piece of subtitle information, wherein each piece of subtitle information corresponds to at least one piece of video frame information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the apparatus is further configured to: and determining the number of video frames corresponding to each subtitle information by performing content identification on each subtitle information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the generating video information corresponding to the first email according to the video frame sequence information and the body information of the first email includes: generating audio information according to the text information of the first mail; and generating video information corresponding to the first mail according to the video frame sequence information and the audio information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the generating audio information according to the body information of the first mail includes: splitting the text message of the first mail to generate at least one text message; generating at least one audio message according to the at least one text message; wherein, the generating the video information corresponding to the first mail according to the video frame sequence information and the audio information includes: and generating video information corresponding to the first mail according to the video frame sequence information and the at least one piece of audio information, wherein each piece of audio information corresponds to at least one piece of video frame information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the apparatus is further configured to: determining the playing time length information of each video frame information in the one or more video frame information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the determining the play-out duration information for each of the one or more video frame information comprises at least one of: determining the playing time length information of each video frame information according to the number information of the first accessory information; and if the first attachment information is first document information, determining the playing duration information of each piece of video frame information according to the page number information of the first document information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the first attachment information is first document information; wherein the determining of the playing duration information of each of the one or more pieces of video frame information includes at least one of: determining the playing duration information of each video frame information according to the text information length of the video frame information; and determining the playing duration information of each video frame information by identifying the content of the text information of the video frame information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In some embodiments, the first attachment information is first picture information; wherein the determining the playing duration information of each of the one or more pieces of video frame information includes: and determining the playing time length information of the video frame information according to the image identification result by carrying out image identification on the picture information of each video frame information. Here, the related operations are the same as or similar to those of the embodiment shown in fig. 1, and therefore are not described again, and are included herein by reference.

In addition to the methods and apparatus described in the embodiments above, the present application also provides a computer readable storage medium storing computer code that, when executed, performs the method as described in any of the preceding claims.

The present application also provides a computer program product, which when executed by a computer device, performs the method of any of the preceding claims.

The present application further provides a computer device, comprising:

one or more processors;

a memory for storing one or more computer programs;

the one or more computer programs, when executed by the one or more processors, cause the one or more processors to implement the method of any preceding claim.

FIG. 4 illustrates an exemplary system that can be used to implement the various embodiments described herein;

in some embodiments, as shown in FIG. 4, the system 300 can be implemented as any of the devices in the various embodiments described. In some embodiments, system 300 may include one or more computer-readable media (e.g., system memory or NVM/storage 320) having instructions and one or more processors (e.g., processor(s) 305) coupled with the one or more computer-readable media and configured to execute the instructions to implement modules to perform the actions described herein.

For one embodiment, system control module 310 may include any suitable interface controllers to provide any suitable interface to at least one of processor(s) 305 and/or any suitable device or component in communication with system control module 310.

The system control module 310 may include a memory controller module 330 to provide an interface to the system memory 315. Memory controller module 330 may be a hardware module, a software module, and/or a firmware module.

System memory 315 may be used, for example, to load and store data and/or instructions for system 300. For one embodiment, system memory 315 may include any suitable volatile memory, such as suitable DRAM. In some embodiments, the system memory 315 may include a double data rate type four synchronous dynamic random access memory (DDR4 SDRAM).

For one embodiment, system control module 310 may include one or more input/output (I/O) controllers to provide an interface to NVM/storage 320 and communication interface(s) 325.

For example, NVM/storage 320 may be used to store data and/or instructions. NVM/storage 320 may include any suitable non-volatile memory (e.g., flash memory) and/or may include any suitable non-volatile storage device(s) (e.g., one or more Hard Disk Drives (HDDs), one or more Compact Disc (CD) drives, and/or one or more Digital Versatile Disc (DVD) drives).

NVM/storage 320 may include storage resources that are physically part of the device on which system 300 is installed or may be accessed by the device and not necessarily part of the device. For example, NVM/storage 320 may be accessible over a network via communication interface(s) 325.

Communication interface(s) 325 may provide an interface for system 300 to communicate over one or more networks and/or with any other suitable device. System 300 may wirelessly communicate with one or more components of a wireless network according to any of one or more wireless network standards and/or protocols.

For one embodiment, at least one of the processor(s) 305 may be packaged together with logic for one or more controller(s) (e.g., memory controller module 330) of the system control module 310. For one embodiment, at least one of the processor(s) 305 may be packaged together with logic for one or more controller(s) of the system control module 310 to form a System In Package (SiP). For one embodiment, at least one of the processor(s) 305 may be integrated on the same die with logic for one or more controller(s) of the system control module 310. For one embodiment, at least one of the processor(s) 305 may be integrated on the same die with logic for one or more controller(s) of the system control module 310 to form a system on a chip (SoC).

In various embodiments, system 300 may be, but is not limited to being: a server, a workstation, a desktop computing device, or a mobile computing device (e.g., a laptop computing device, a handheld computing device, a tablet, a netbook, etc.). In various embodiments, system 300 may have more or fewer components and/or different architectures. For example, in some embodiments, system 300 includes one or more cameras, a keyboard, a Liquid Crystal Display (LCD) screen (including a touch screen display), a non-volatile memory port, multiple antennas, a graphics chip, an Application Specific Integrated Circuit (ASIC), and speakers.

It should be noted that the present application may be implemented in software and/or a combination of software and hardware, for example, implemented using Application Specific Integrated Circuits (ASICs), general purpose computers or any other similar hardware devices. In one embodiment, the software programs of the present application may be executed by a processor to implement the steps or functions described above. Likewise, the software programs (including associated data structures) of the present application may be stored in a computer readable recording medium, such as RAM memory, magnetic or optical drive or diskette and the like. Additionally, some of the steps or functions of the present application may be implemented in hardware, for example, as circuitry that cooperates with the processor to perform various steps or functions.

In addition, some of the present application may be implemented as a computer program product, such as computer program instructions, which when executed by a computer, may invoke or provide methods and/or techniques in accordance with the present application through the operation of the computer. Those skilled in the art will appreciate that the form in which the computer program instructions reside on a computer-readable medium includes, but is not limited to, source files, executable files, installation package files, and the like, and that the manner in which the computer program instructions are executed by a computer includes, but is not limited to: the computer directly executes the instruction, or the computer compiles the instruction and then executes the corresponding compiled program, or the computer reads and executes the instruction, or the computer reads and installs the instruction and then executes the corresponding installed program. Computer-readable media herein can be any available computer-readable storage media or communication media that can be accessed by a computer.

Communication media includes media by which communication signals, including, for example, computer readable instructions, data structures, program modules, or other data, are transmitted from one system to another. Communication media may include conductive transmission media such as cables and wires (e.g., fiber optics, coaxial, etc.) and wireless (non-conductive transmission) media capable of propagating energy waves such as acoustic, electromagnetic, RF, microwave, and infrared. Computer readable instructions, data structures, program modules, or other data may be embodied in a modulated data signal, for example, in a wireless medium such as a carrier wave or similar mechanism such as is embodied as part of spread spectrum techniques. The term "modulated data signal" means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. The modulation may be analog, digital or hybrid modulation techniques.

By way of example, and not limitation, computer-readable storage media may include volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer-readable instructions, data structures, program modules or other data. For example, computer-readable storage media include, but are not limited to, volatile memory such as random access memory (RAM, DRAM, SRAM); and non-volatile memory such as flash memory, various read-only memories (ROM, PROM, EPROM, EEPROM), magnetic and ferromagnetic/ferroelectric memories (MRAM, FeRAM); and magnetic and optical storage devices (hard disk, tape, CD, DVD); or other now known media or later developed that can store computer-readable information/data for use by a computer system.

An embodiment according to the present application comprises an apparatus comprising a memory for storing computer program instructions and a processor for executing the program instructions, wherein the computer program instructions, when executed by the processor, trigger the apparatus to perform a method and/or a solution according to the aforementioned embodiments of the present application.

It will be evident to those skilled in the art that the present application is not limited to the details of the foregoing illustrative embodiments, and that the present application may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the application being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference sign in a claim should not be construed as limiting the claim concerned. Furthermore, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or means recited in the apparatus claims may also be implemented by one unit or means in software or hardware. The terms first, second, etc. are used to denote names, but not any particular order.

Claims

1. A method for generating a video, applied to a first user equipment, wherein the method comprises:

adding the video information to the first mail.

2. The method of claim 1, wherein the adding the video information to the first email comprises any one of:

adding the video information to the first mail as second attachment information of the first mail;

and adding the video information into the text information of the first mail.

3. The method of claim 1, wherein the video generation trigger operation comprises any one of:

e, mail preview operation;

and E, mail sending operation.

4. The method of claim 1, wherein the generating video information corresponding to the first mail according to the body information and the first attachment information of the first mail comprises:

generating video frame sequence information according to the first attachment information of the first mail, wherein the video frame sequence information comprises one or more ordered video frame information;

and generating video information corresponding to the first mail according to the video frame sequence information and the text information of the first mail.

5. The method of claim 4, wherein the first attachment information is first document information;

wherein the generating of the video frame sequence information according to the first attachment information of the first mail, wherein the video frame sequence information includes one or more video frame information in order, comprises:

and generating video frame sequence information by capturing one or more pages in the first document information.

6. The method of claim 5, wherein the first document information includes at least one page;

wherein the generating of the video frame sequence information by capturing one or more pages of the first document information further comprises:

one or more pages are determined from the at least one page.

7. The method of claim 6, wherein said determining one or more pages from said at least one page comprises:

one or more pages are extracted from the at least one page at predetermined extraction intervals.

8. The method of claim 6, wherein said determining one or more pages from said at least one page comprises:

determining one or more pages from the at least one page by content recognition of the first document information.

9. The method of claim 5, wherein the generating video frame sequence information by screenshot one or more pages in the first document information comprises:

and opening the first document information in the background of the first user equipment, and generating video frame sequence information by screenshot on one or more pages in the opened first document information.

10. The method of claim 5, wherein the generating video frame sequence information by screenshot one or more pages in the first document information comprises:

calling an interface provided by a third-party document application on the first user equipment, wherein the input of the interface is the first document information, and the output of the interface is screenshot information corresponding to one or more pages in the first document information;

and generating video frame sequence information according to the screenshot information.

11. The method of claim 4, wherein the generating video information corresponding to the first mail according to the video frame sequence information and the body information of the first mail comprises:

splitting text information of the first mail to generate at least one subtitle information;

and generating video information corresponding to the first mail according to the video frame sequence information and the at least one piece of subtitle information, wherein each piece of subtitle information corresponds to at least one piece of video frame information.

12. The method of claim 11, wherein the method further comprises:

and determining the number of video frames corresponding to each subtitle information by performing content identification on each subtitle information.

13. The method of claim 4, wherein the generating video information corresponding to the first mail according to the video frame sequence information and the body information of the first mail comprises:

generating audio information according to the text information of the first mail;

and generating video information corresponding to the first mail according to the video frame sequence information and the audio information.

14. The method of claim 13, wherein generating audio information based on the body information of the first mail comprises:

splitting the text message of the first mail to generate at least one text message;

generating at least one audio message according to the at least one text message;

wherein, the generating the video information corresponding to the first mail according to the video frame sequence information and the audio information includes:

and generating video information corresponding to the first mail according to the video frame sequence information and the at least one piece of audio information, wherein each piece of audio information corresponds to at least one piece of video frame information.

15. The method of claim 4, wherein the method further comprises:

determining the playing time length information of each video frame information in the one or more video frame information.

16. The method of claim 15, wherein the determining the playout duration information for each of the one or more video frame information comprises at least one of:

determining the playing time length information of each video frame information according to the number information of the first accessory information;

and if the first attachment information is first document information, determining the playing duration information of each piece of video frame information according to the page number information of the first document information.

17. The method of claim 15, wherein the first attachment information is first document information;

wherein the determining of the playing duration information of each of the one or more pieces of video frame information includes at least one of:

determining the playing duration information of each video frame information according to the text information length of the video frame information;

and determining the playing duration information of each video frame information by identifying the content of the text information of the video frame information.

18. The method of claim 15, wherein the first attachment information is first picture information;

wherein the determining the playing duration information of each of the one or more pieces of video frame information includes:

and determining the playing time length information of the video frame information according to the image identification result by carrying out image identification on the picture information of each video frame information.

19. A computer device for generating video, comprising a memory, a processor and a computer program stored on the memory, characterized in that the processor executes the computer program to implement the steps of the method according to any of claims 1 to 18.

20. A computer-readable storage medium, on which a computer program/instructions are stored, which, when being executed by a processor, carry out the steps of the method according to any one of claims 1 to 18.

21. A computer program product comprising a computer program, characterized in that the computer program realizes the steps of the method according to any one of claims 1 to 18 when executed by a processor.