WO2020113733A1 - Animation generation method and apparatus, electronic device, and computer-readable storage medium - Google Patents

Animation generation method and apparatus, electronic device, and computer-readable storage medium Download PDF

Info

Publication number
WO2020113733A1
WO2020113733A1 PCT/CN2018/125392 CN2018125392W WO2020113733A1 WO 2020113733 A1 WO2020113733 A1 WO 2020113733A1 CN 2018125392 W CN2018125392 W CN 2018125392W WO 2020113733 A1 WO2020113733 A1 WO 2020113733A1
Authority
WO
WIPO (PCT)
Prior art keywords
music
animation
target
target music
picture
Prior art date
Application number
PCT/CN2018/125392
Other languages
French (fr)
Chinese (zh)
Inventor
都之夏
Original Assignee
北京微播视界科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京微播视界科技有限公司 filed Critical 北京微播视界科技有限公司
Publication of WO2020113733A1 publication Critical patent/WO2020113733A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel

Abstract

An animation generation method and apparatus, an electronic device, and a computer-readable storage medium. The method comprises: determining, by means of a predetermined speech recognition method, a music element feature of a target music (S101); determining a plurality of animation composition images, and determining, according to the music element feature of the target music, animation playback effects matching respective animation composition images (S102); generating, according to the animation playback effects and the plurality of animation composition images, a target animation (S103); and synthesizing the target music and the target animation, such that the target music and the target animation can be played correspondingly and synchronously (S104). The animation playback effect of each image is determined on the basis of the music element feature of the target music, thereby preventing a feeling of discomfort caused by incongruities between the playback effects of the images and the music element feature, and improving animation viewing experience for users.

Description

动画生成方法、装置、电子设备及计算机可读存储介质Animation generating method, device, electronic equipment and computer readable storage medium
相关申请的交叉引用Cross-reference of related applications
本公开要求于2018年12月7日在中国国家知识产权局提交的申请号为201811496521.5的中国专利申请的权益,其全部内容通过引用整体并入本文。This disclosure requires the rights and interests of the Chinese patent application with the application number 201811496521.5 filed at the State Intellectual Property Office of China on December 7, 2018, the entire contents of which are incorporated herein by reference.
技术领域Technical field
本公开涉及图片处理技术领域,具体而言,本公开涉及一种动画生成方法、装置、电子设备及计算机可读存储介质。The present disclosure relates to the technical field of image processing, and in particular, the present disclosure relates to an animation generation method, device, electronic device, and computer-readable storage medium.
背景技术Background technique
通过拍摄图片记录生活已经成为了人们的一种重要的生活方式。随着动画处理技术的发展,人们可以选择一定数量的图片及相应的音乐通过相应的动画处理技术直接生成相应的多媒体动画,然后可以通过社交网络平台分享生成的相应动画,向其他的社交网络用户展示自己的生活。Taking pictures to record life has become an important way of life for people. With the development of animation processing technology, people can select a certain number of pictures and corresponding music to directly generate corresponding multimedia animation through the corresponding animation processing technology, and then can share the corresponding animation generated through the social network platform to other social network users Show your life.
目前,根据所选择的图片及音乐生成的相应多媒体动画中的图片的播放方式是随机确定的,即生成的动画中的图片的播放方式与所选择的音乐没有关联关系。然而,在根据现有技术基于选择的图片及音乐生成的动画中,音乐的特征(如音乐的节奏、节拍及旋律等)并不影响生成的图片的播放方式,从而音乐的特征与图片的播放方式并不相关,这造成动画观看者的观看体验不高,如某一动画片段中音乐的节奏是较为舒缓的,而对应的图片的转场方式却非常快,这种突兀的图片播放方式可能给视频观看者带来不适感,从而降低了用户的观看体验。因此,现有技术存在基于图片及音乐生成的动画中图片的播放方式与音乐无关联,从而导致视频观看者体验差的问题。Currently, the playback mode of the pictures in the corresponding multimedia animation generated based on the selected pictures and music is randomly determined, that is, the playback mode of the pictures in the generated animation is not related to the selected music. However, in the animation generated based on the selected pictures and music according to the prior art, the characteristics of the music (such as the rhythm, beat, melody, etc.) of the music do not affect the playback mode of the generated pictures, and thus the characteristics of the music and the playback of the pictures The way is not related, which causes the viewing experience of the animation viewer to be low. For example, the rhythm of the music in a certain cartoon segment is relatively soothing, and the corresponding picture transition method is very fast. This abrupt picture playback method may be It brings discomfort to the video viewer, which reduces the user's viewing experience. Therefore, in the prior art, there is a problem that the playback method of the pictures in the animation generated based on the pictures and the music is not related to the music, resulting in a poor video viewer experience.
发明内容Summary of the invention
第一方面,本公开提供了一种动画生成方法,该方法包括:In a first aspect, the present disclosure provides an animation generation method, the method including:
通过预定的语音识别方法确定目标音乐的音乐要素特征;Determine the music element characteristics of the target music through a predetermined voice recognition method;
确定多张动画组成图片,并根据目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果;Determine multiple animation composition pictures, and determine the animation playback effect matching each animation composition picture according to the music element characteristics of the target music;
根据动画播放效果及多张动画组成图片生成目标动画;Generate target animation based on animation playback effect and multiple animation composition pictures;
将目标音乐与目标动画进行合成处理,以使得目标音乐与目标动画能相应地同步播放展示。The target music and the target animation are synthesized so that the target music and the target animation can be synchronously played and displayed accordingly.
第二方面,本公开提供了一种动画生成装置,该装置包括:In a second aspect, the present disclosure provides an animation generating device, which includes:
第一确定模块,用于通过预定的语音识别方法确定目标音乐的音乐要素特征;The first determining module is used to determine the music element characteristics of the target music through a predetermined voice recognition method;
第二确定模块,用于确定多张动画组成图片,并根据第一确定模块确定的目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果;The second determination module is used to determine a plurality of animation composition pictures, and determine the animation playback effect matching each animation composition picture according to the music element characteristics of the target music determined by the first determination module;
动画生成模块,用于根据第二确定模块确定的多张动画组成图片及与各张动画组成图片相匹配的动画播放效果生成目标动画;The animation generation module is used to generate a target animation according to the multiple animation composition pictures determined by the second determination module and the animation playback effects matching each animation composition picture;
合成处理模块,用于将目标音乐与动画生成模块生成的目标动画进行合成处理,以使得目标音乐与目标动画能相应地同步播放展示。The synthesis processing module is used for synthesizing the target music and the target animation generated by the animation generating module, so that the target music and the target animation can be synchronously played and displayed accordingly.
第三方面,本公开提供了一种电子设备,该电子设备包括:In a third aspect, the present disclosure provides an electronic device including:
处理器;processor;
存储器,所述存储器存储有至少一个应用程序,当该应用程序被该处理器执行时,使得该电子设备执行根据第一方面所示的动画生成方法。A memory that stores at least one application program, and when the application program is executed by the processor, causes the electronic device to execute the animation generation method according to the first aspect.
第四方面,本公开提供了一种计算机可读存储介质,计算机存储介质用于存储计算机指令,当该计算机指令在计算机上运行时,使得计算机执行根据第一方面所示的动画生成方法。According to a fourth aspect, the present disclosure provides a computer-readable storage medium for storing computer instructions, which when executed on a computer, causes the computer to execute the animation generation method according to the first aspect.
在本公开的方案中,根据目标音乐的音乐要素特征确定动画播放效果,各张动画组成图片都对应有与目标音乐的音乐要素特征相匹配的播放效 果,可以避免图片的播放效果与音乐要素特征不相关所带来的不适感(如相应的目标音乐的速度特征较为舒缓,而对应的图片的极快的转场方式所带来的与音乐特征不匹配的不适感),从而提升了用户的动画观看体验。In the solution of the present disclosure, the animation playback effect is determined according to the music element characteristics of the target music, and each animation composition picture corresponds to a playback effect matching the music element characteristics of the target music, which can avoid the picture playback effect and the music element characteristics Discomfort caused by irrelevance (for example, the speed characteristics of the corresponding target music are more comfortable, and the discomfort caused by the extremely fast transition of the corresponding picture does not match the music characteristics), thereby improving the user's Animation viewing experience.
本公开附加的方面和优点将在下面的描述中部分给出,这些将从下面的描述中变得明显,或通过本公开的实践了解到。Additional aspects and advantages of the present disclosure will be partially given in the following description, which will become apparent from the following description or be learned through the practice of the present disclosure.
附图说明BRIEF DESCRIPTION
本公开上述的和/或附加的方面和优点从下面结合附图对实施例的描述中将变得明显和容易理解,其中:The above and/or additional aspects and advantages of the present disclosure will become apparent and easy to understand from the following description of the embodiments in conjunction with the accompanying drawings, in which:
图1为本公开实施例的一种动画生成方法的流程示意图;FIG. 1 is a schematic flowchart of an animation generation method according to an embodiment of the present disclosure;
图2为本公开实施例的一种动画生成装置的结构示意图;2 is a schematic structural diagram of an animation generation device according to an embodiment of the present disclosure;
图3为本公开实施例的另一种动画生成装置的结构示意图;3 is a schematic structural diagram of another animation generating device according to an embodiment of the present disclosure;
图4为本公开实施例的一种电子设备的结构示意图。4 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
具体实施方式detailed description
下面详细描述本公开的实施例,各实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本公开,而不能解释为对本公开的限制。The embodiments of the present disclosure are described in detail below. Examples of the embodiments are shown in the drawings, in which the same or similar reference numerals indicate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below with reference to the drawings are exemplary, and are only used to explain the present disclosure, and cannot be construed as limiting the present disclosure.
本技术领域技术人员可以理解,除非特意声明,这里使用的单数形式“一”、“一个”和“该”也可包括复数形式。应该进一步理解的是,本公开的说明书中使用的措辞“包括”是指存在特征、整数、步骤、操作、元件和/或组件,但是并不排除存在或添加一个或多个其他特征、整数、步骤、操作、元件、组件和/或它们的组。这里使用的措辞“和/或”包括一个或更多个相关联的列出项的全部或任一单元和全部组合。Those skilled in the art can understand that unless specifically stated, the singular forms "a", "an", and "the" used herein may also include the plural form. It should be further understood that the word "comprising" used in the specification of the present disclosure refers to the presence of features, integers, steps, operations, elements and/or components, but does not exclude the presence or addition of one or more other features, integers, Steps, operations, elements, components and/or their groups. The expression "and/or" as used herein includes all or any unit and all combinations of one or more associated listed items.
为使本公开的目的、技术方案和优点更加清楚,下面将结合附图对本公开实施方式作进一步地详细描述。To make the objectives, technical solutions, and advantages of the present disclosure more clear, the embodiments of the present disclosure will be further described in detail below in conjunction with the accompanying drawings.
下面以具体地实施例对本公开的技术方案以及本公开的技术方案如 何解决上述技术问题进行详细说明。下面这几个具体的实施例可以相互结合,对于相同或相似的概念或过程可能在某些实施例中不再赘述。下面将结合附图,对本公开的实施例进行描述。The technical solutions of the present disclosure and how the technical solutions of the present disclosure solve the above technical problems will be described in detail below with specific embodiments. The following specific embodiments may be combined with each other, and the same or similar concepts or processes may not be repeated in some embodiments. The embodiments of the present disclosure will be described below with reference to the drawings.
本公开的一个实施例中提供了一种动画生成方法,如图1所示,该方法可以包括步骤S101至步骤S104。An embodiment of the present disclosure provides an animation generation method. As shown in FIG. 1, the method may include steps S101 to S104.
步骤S101:通过预定的语音识别方法确定目标音乐的音乐要素特征。Step S101: Determine the music element characteristics of the target music through a predetermined voice recognition method.
对于本实施例,音乐识别是一个交叉型的研究领域,涉及到音乐知识和信号处理技术,音乐识别包括通过对音乐的分析,得到目标音乐的音乐要素特征。其中,目标音乐可以是WAV(Wave form audio format)格式的音乐文件,WAV文件是一种存储无损音乐的波形文件;目标音乐也可以是MIDI(Musical Instrument Digital Interface,乐器数字接口)格式的音乐文件,与波形文件不同,MIDI文件不对音乐进行抽样,而是对音乐的每个音符记录为一个数字,所以与波形文件相比文件要小得多。For this embodiment, music recognition is a cross-type research field, involving music knowledge and signal processing technology. Music recognition includes analyzing the music to obtain the music element characteristics of the target music. Among them, the target music can be a music file in WAV (Wave form audio format) format. The WAV file is a waveform file that stores lossless music; the target music can also be a music file in MIDI (Musical Instrument Digital Interface) format. Unlike the wave file, the MIDI file does not sample the music, but records each note of the music as a number, so the file is much smaller than the wave file.
对于本实施例,目标音乐可以是通过弹唱或哼唱等方式输入得到的,也可以是通过查找本地音乐库或通过网络下载得到的。其中,如果目标音乐为MP3、WMA等格式的文件,由于MP3、WMA等格式的音乐文件为压缩格式的音乐文件,可以对目标音乐的格式进行解码处理(即解压缩),解码为WAV等格式的文件。For this embodiment, the target music may be input by playing or humming, or it may be obtained by searching a local music library or downloading through the network. Among them, if the target music is a file in the format of MP3, WMA, etc., because the music file in the format of MP3, WMA, etc. is a music file in a compressed format, the format of the target music can be decoded (ie decompressed), and decoded into a format such as WAV document.
对于本实施例,通过预定的语音识别方法确定目标音乐的音乐要素特征,其中,该预定的语音识别方法可以是基于时频分析的方法,也可以是基于时域分析的方法、基于频域分析的方法,或者通过相应的其他方法,此处不做限定。For this embodiment, the music element characteristics of the target music are determined by a predetermined voice recognition method, wherein the predetermined voice recognition method may be a method based on time-frequency analysis, a method based on time-domain analysis, or a method based on frequency-domain analysis The method, or through other corresponding methods, is not limited here.
步骤S102:确定多张动画组成图片,并根据目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果。Step S102: Determine a plurality of animation composition pictures, and determine an animation playback effect matching each animation composition picture according to the music element characteristics of the target music.
对于本实施例,确定多张动画组成图片,其中,多张动画组成图片可以是用户从图片库中人工选择确定的,也可以通过相应的图片确定方法从图片库中自动确定的。For this embodiment, multiple animation composition pictures are determined, where the multiple animation composition pictures may be manually selected by the user from the picture library, or may be automatically determined from the picture library through a corresponding picture determination method.
对于本实施例,可以基于预定的音乐要素特征与动画组成图片播放效 果的匹配规则,根据目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果。For this embodiment, the animation playback effect matching each animation composition picture may be determined based on the predetermined music element characteristic and the animation composition picture playing effect matching rule, according to the music element characteristic of the target music.
步骤S103:根据多张动画组成图片及与各张动画组成图片相匹配的动画播放效果生成目标动画。Step S103: Generate a target animation according to multiple animation composition pictures and animation playback effects matching each animation composition picture.
具体地,可以基于各张动画组成图片分别对应的相匹配的动画播放效果,对各张动画组成图片进行处理,使得各张动画组成图片能按照确定的相匹配的播放效果进行播放,然后对处理后的各张动画组成图片进行融合处理生成目标动画。Specifically, each animation component picture can be processed based on the matching animation playback effect corresponding to each animation component picture, so that each animation component picture can be played according to the determined matching playback effect, and then processed The subsequent animations form a picture and undergo fusion processing to generate the target animation.
步骤S104:将目标音乐与目标动画进行合成处理,以使得目标音乐与目标动画能相应地同步播放展示。Step S104: Synthesizing the target music and the target animation so that the target music and the target animation can be synchronously played and displayed accordingly.
具体地,可以基于时间状态信息,将目标音乐与目标动画进行合成处理,以使得目标音乐与目标动画能相应地同步播放显示,其中,相应地同步播放显示可以是动画组成图片根据对应的时间点或时间段的目标音乐的音乐要素特征进行相应播放效果的播放显示。Specifically, the target music and the target animation can be synthesized based on the time state information, so that the target music and the target animation can be played and displayed synchronously accordingly, wherein the corresponding synchronized play and display can be an animation composed picture according to the corresponding time point Or, the music element characteristics of the target music in the time period are played and displayed according to the corresponding play effect.
在本公开的实施例中,根据目标音乐的音乐要素特征确定动画播放效果,各张动画组成图片都对应有与目标音乐的音乐要素特征相匹配的播放效果,可以避免图片的播放效果与音乐要素特征不相关所带来的不适感(如相应的目标音乐的速度特征较为舒缓,而对应的图片的极快的转场方式所带来的与音乐特征不匹配的不适感),从而提升了用户的动画观看体验。In the embodiment of the present disclosure, the animation playback effect is determined according to the music element characteristics of the target music, and each animation composition picture corresponds to a playback effect matching the music element characteristics of the target music, which can avoid the picture playback effect and the music element Discomfort caused by irrelevant features (for example, the speed characteristic of the corresponding target music is more comfortable, and the discomfort caused by the extremely fast transition of the corresponding picture does not match the music features), thereby improving the user Animation viewing experience.
本公开实施例提供了一种可能的实现方式,具体地,步骤S101可以包括步骤S1011至步骤S1013。An embodiment of the present disclosure provides a possible implementation manner. Specifically, step S101 may include step S1011 to step S1013.
步骤S1011(图中未示出):提取目标音乐的音频信息。Step S1011 (not shown in the figure): extract the audio information of the target music.
具体地,通过相应的音频信息提取技术提取得到目标音乐的音频信息,其中,在目标音乐的音频信息中可能夹杂着很多噪音信息,因为在录制音乐时可能受到来自电器设备的干扰,或者是其他物体的杂音,也有可能是工频信号的干扰,从而噪音信息是不可避免的,所以可以对提取得到音频信息进行预处理,去除相应噪音信息的影响,以及压缩音频信息的数据以 减少计算量。Specifically, the audio information of the target music is extracted through the corresponding audio information extraction technology, where the audio information of the target music may be mixed with a lot of noise information, because it may be interfered by electrical equipment when recording music, or other The noise of the object may also be the interference of the power frequency signal, so the noise information is inevitable, so you can pre-process the extracted audio information to remove the influence of the corresponding noise information, and compress the audio information data to reduce the amount of calculation.
步骤S1012(图中未示出):对提取到的音频信息进行声学特征提取,得到相应的声学特征信息。Step S1012 (not shown in the figure): perform acoustic feature extraction on the extracted audio information to obtain corresponding acoustic feature information.
具体地,可以通过相应的滤波器对音频信息进行处理,从而得到目标音乐的声学特征信息;其中,可以通过基于高斯低通的FIR滤波器对音频信息进行处理,得到PCM格式的目标音乐信号的包络线,然后通过频域分析与时域分析相结合的方法进行峰值检测,确定目标音乐的音符位置,继而得到各个音符的声学特征信息;其中,声学特征信息包括但不限于过零率、短时能量等特征信息。Specifically, the audio information can be processed through a corresponding filter to obtain the acoustic characteristic information of the target music; wherein, the audio information can be processed through a Gaussian low-pass FIR filter to obtain the target music signal in PCM format Envelope, and then peak detection through a combination of frequency domain analysis and time domain analysis to determine the position of the note of the target music, and then obtain the acoustic feature information of each note; Among them, the acoustic feature information includes but is not limited to zero-crossing rate, Characteristic information such as short-term energy.
步骤S1013(图中未示出):基于提取得到的声学特征信息确定目标音乐的音乐要素特征。Step S1013 (not shown in the figure): determine the music element characteristics of the target music based on the extracted acoustic characteristic information.
对于本实施例,通过对提取得到的声学特征信息进行相应的分析,得到目标音乐的音乐要素特征。For this embodiment, through corresponding analysis of the extracted acoustic feature information, the music element features of the target music are obtained.
进一步地,音乐要素特征包括但不限于以下至少一项:Further, the characteristics of music elements include but are not limited to at least one of the following:
音强、音高、音长、节拍、节奏、速度和旋律。Intensity, pitch, length, beat, rhythm, tempo and melody.
对于本实施例,音乐要素特征包括但不限于音强、音高、音长、节拍、节奏、速度、旋律等特征。For this embodiment, the characteristics of music elements include, but are not limited to, features such as pitch, pitch, pitch, beat, rhythm, tempo, melody and so on.
其中,音高由物体振动的频率决定,如果振动的频率越快,则音越高,反之,则音越低;音长由物体振动时间的长短决定,振动时间越长,音就越长;音强指的是音的大小,它是由物体的振动幅度决定的,如果振动幅度越大,那么音越强;节奏是指用强弱组织起来的音的长短关系,节奏不尽与音的长短有关系,而且与音的强弱有关系;节拍就是有强有弱的相同的时间片段,按照一定的顺序循环重复,就形成了节拍;速度是衡量音乐节拍快慢的一个量,例如144BMP,表示每分钟有144个音符;旋律通常指若干乐音经过艺术构思而形成的有组织、节奏的序列,旋律是由许多音乐基本要素,如节奏、节拍、音强、音长等,有机结合而形成的,其中旋律有三种形式,分别为:下降式、水平式、上升式。Among them, the pitch is determined by the frequency of the object's vibration. If the frequency of the vibration is faster, the sound is higher, otherwise, the sound is lower; the sound length is determined by the length of the object's vibration time, the longer the vibration time, the longer the sound; Sound intensity refers to the size of the sound, which is determined by the vibration amplitude of the object. If the vibration amplitude is larger, the sound is stronger; rhythm refers to the length of the sound organized by the strength and weakness. The length is related, and it is related to the strength of the sound; the beat is the same time segment with strong and weak, repeated in a certain order to form the beat; the speed is a measure of the speed of the music beat, such as 144BMP, It means that there are 144 notes per minute; melody usually refers to an organized and rhythmic sequence formed by several musical sounds through artistic conception. Melody is formed by the organic combination of many basic elements of music, such as rhythm, beat, intensity, and length. There are three forms of melody, namely: descending, horizontal and ascending.
其中,从音乐的组织结构来看,音乐由乐段组成,乐段有音乐小节构 成,音乐小节由音符构成,音符是音乐的最基本要素;对目标音乐的音乐要素特征的提取可以先确定目标音乐音符的特征,其中,音符特征包括音高、音强、音长等基本特征,然后通过对目标音乐的音符特征进行分析得到目标音乐更复杂的音乐要素特征,如节拍、节奏、速度、旋律等特征。Among them, from the point of view of the organizational structure of music, music is composed of music segments, music segments are composed of music bars, music bars are composed of musical notes, and musical notes are the most basic elements of music; the extraction of the characteristics of the music elements of the target music can first determine the target The characteristics of musical notes, where the characteristics of notes include pitch, intensity, length and other basic characteristics, and then through the analysis of the note characteristics of the target music to obtain more complex musical element characteristics of the target music, such as beat, rhythm, speed, melody And other characteristics.
对于本公开实施例,音乐要素特征包括音强、音高、音长、节拍、节奏、速度、旋律,这样多样化的音乐要素特征为提升动画组成图片的播放效果与目标音乐的关联性提供了基础。For the embodiments of the present disclosure, the music element features include pitch, pitch, length, beat, rhythm, speed, and melody. Such diverse music element features provide an improvement in the correlation between the playback effect of the animation composition picture and the target music basis.
本公开的实施例提供了一种可能的实现方式,具体地,步骤S102可以包括步骤S1021和步骤S1022。The embodiments of the present disclosure provide a possible implementation manner. Specifically, step S102 may include step S1021 and step S1022.
步骤S1021(图中未示出):根据目标音乐的音乐要素特征,确定目标音乐的音乐类型。Step S1021 (not shown in the figure): determine the music type of the target music according to the music element characteristics of the target music.
具体地,可以基于目标音乐的音乐要素特征得到目标音乐的特征向量,然后可以通过预训练的神经网络模型得到目标音乐的音乐类型;其中,目标音乐的音乐类型可以是目标音乐的某一乐段的音乐类型,也可以是目标音乐整体的音乐类型;其中,目标音乐的音乐类型包括但不限于轻缓、激烈、平静等。Specifically, the feature vector of the target music can be obtained based on the music element characteristics of the target music, and then the music type of the target music can be obtained through a pre-trained neural network model; wherein, the music type of the target music can be a certain segment of the target music The music type of may also be the music type of the target music as a whole; where the music type of the target music includes but is not limited to gentle, intense, calm, etc.
步骤S1022(图中未示出):基于目标音乐的音乐类型,确定与目标音乐匹配的多张动画组成图片。Step S1022 (not shown in the figure): based on the music type of the target music, determine a plurality of animation composition pictures matching the target music.
具体地,根据目标音乐的音乐类型不同,分别确定与目标音乐相匹配的多张动画组成图片。Specifically, according to different music types of the target music, a plurality of animation composition pictures matching the target music are determined respectively.
对于本公开实施例,根据目标音乐的音乐类型确定与目标音乐匹配的多张动画组成图片,从而提升了动画组成图片与目标音乐的关联性。For the embodiments of the present disclosure, multiple animation composition pictures matching the target music are determined according to the music type of the target music, thereby improving the relevance of the animation composition pictures and the target music.
本公开的再一实施例提供了一种可能的实现方式,具体地,步骤S1022,可以包括步骤S10221和步骤S10222。Yet another embodiment of the present disclosure provides a possible implementation manner. Specifically, step S1022 may include step S10221 and step S10222.
步骤S10221(图中未示出):确定与目标音乐的音乐类型相匹配的图片场景类型;Step S10221 (not shown in the figure): determine the picture scene type that matches the music type of the target music;
对于本实施例,确定与目标音乐的音乐类型相匹配的图片场景类型,例如,对于轻缓型的音乐可以用于旅游风景的场景,对于激烈型的音乐可 以用于运动竞技比赛、摇滚音乐会现场等场景。For this embodiment, determine the picture scene type that matches the music type of the target music, for example, for gentle music, it can be used in the scene of tourist scenery, for intense music, it can be used in sports competitions, rock concerts Scenes such as the scene.
步骤S10222(图中未示出):确定符合图片场景类型的多张动画组成图片。Step S10222 (not shown in the figure): it is determined that a plurality of animations conforming to the picture scene type constitute a picture.
具体地,可以通过预先已经进行场景分类处理的图片库,基于图片的场景类型标签确定符合场景类型的多张动画组成图片;也可以通过相应的图片识别方法对多个候选图片进行识别,从而确定符合图片场景类型的多张动画组成图片。Specifically, it is possible to determine, based on the scene type tag of the picture, a plurality of animation composition pictures matching the scene type through a picture library that has been subjected to scene classification processing in advance; or to identify multiple candidate pictures through corresponding picture recognition methods to determine Multiple animations that match the type of picture scene make up the picture.
对于本公开实施例,根据与目标音乐的音乐类型相匹配的图片场景类型确定多张动画组成图片,解决了如何根据目标音乐的类型确定动画组成图片的问题。For the embodiment of the present disclosure, multiple animation composition pictures are determined according to the picture scene type that matches the music type of the target music, which solves the problem of how to determine the animation composition picture according to the type of the target music.
本公开的实施例提供了一种可能的实现方式,步骤S102可以包括步骤S1023和步骤S1024。An embodiment of the present disclosure provides a possible implementation manner, and step S102 may include step S1023 and step S1024.
步骤S1023(图中未示出):依据目标音乐的音乐要素特征对目标音乐进行分段处理,得到多个音乐片段。Step S1023 (not shown in the figure): perform segmentation processing on the target music according to the music element characteristics of the target music to obtain multiple music fragments.
对于本实施例,根据目标音乐的音乐要素特征对目标音乐进行分段处理,得到多个音乐片段;其中,音乐片段对应的可以是目标音乐的一个或多个音乐小节,也可以是目标音乐的一个是乐段;其中,音乐小节的划分可以基于音符之间的音乐要素的强弱特征得到的,乐段的划分可以是基于划分的音乐小节之间的相似度得到的。For this embodiment, the target music is segmented according to the music element characteristics of the target music to obtain multiple music fragments; wherein, the music fragments may correspond to one or more music bars of the target music, or may be the target music One is the music segment; among them, the division of the music bars can be obtained based on the strong and weak characteristics of the musical elements between the notes, and the division of the music bars can be based on the similarity between the divided music bars.
步骤S1024(图中未示出):依据各个音乐片段分别对应的音乐要素特征确定各个音乐段内对应的各张动画组成图片的动画播放效果,该动画播放效果包括转场模式、动画特效以及滤镜模式中的至少一项。Step S1024 (not shown in the figure): according to the characteristics of the music elements corresponding to the respective music fragments, determine the animation playback effect of the corresponding animations in each music segment to constitute the picture, the animation playback effect includes the transition mode, animation special effects and filtering At least one item in mirror mode.
对于本实施例,依据各个音乐片段分别对应的音乐要素特征确定各个音乐段内对应的各张动画组成图片的动画播放效果,其中,可以根据某一特征或多个特征的组合确定各张动画组成图片的动画播放效果,如可以根据节奏、速度确定动画组成图片的转场方式,根据旋律确定相应的滤镜模式。For this embodiment, the animation playback effect of each animation component picture corresponding to each music segment is determined according to the music element characteristics corresponding to each music segment, wherein each animation component can be determined according to a certain feature or a combination of multiple features The animation playback effect of the picture, for example, the transition mode of the animation composition picture can be determined according to the rhythm and speed, and the corresponding filter mode can be determined according to the melody.
对于本公开实施例,根据目标音乐的音乐要素特征对目标音乐进行分 段处理得到多个音乐片段,然后根据各个音乐片段分别对应的音乐要素特征确定各个音乐段内对应的各张动画组成图片的动画播放效果,解决了如何根据音乐要素特征确定动画组成图片的动画播放效果的问题。For the embodiment of the present disclosure, the target music is segmented according to the music element characteristics of the target music to obtain a plurality of music fragments, and then the corresponding animation composition pictures in each music segment are determined according to the music element characteristics corresponding to the respective music fragments The animation playback effect solves the problem of how to determine the animation playback effect of an animation composition picture according to the characteristics of music elements.
图2为本公开的一个实施例中提供的一种动画生成装置,该装置20包括:第一确定模块201、第二确定模块202、动画生成模块203和合成处理模块204,其中:FIG. 2 is an animation generation device provided in an embodiment of the present disclosure. The device 20 includes: a first determination module 201, a second determination module 202, an animation generation module 203, and a synthesis processing module 204, where:
第一确定模块201用于通过预定的语音识别方法确定目标音乐的音乐要素特征;The first determination module 201 is used to determine the music element characteristics of the target music through a predetermined voice recognition method;
第二确定模块202用于确定多张动画组成图片,并根据第一确定模块201确定的目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果;The second determination module 202 is used to determine a plurality of animation composition pictures, and determine the animation playback effect matching each animation composition picture according to the music element characteristics of the target music determined by the first determination module 201;
动画生成模块203用于根据第二确定模块202确定的多张动画组成图片及与各张动画组成图片相匹配的动画播放效果生成目标动画;The animation generation module 203 is used to generate a target animation according to the plurality of animation composition pictures determined by the second determination module 202 and the animation playback effect matching each animation composition picture;
合成处理模块204用于将目标音乐与动画生成模块203生成的目标动画进行合成处理,以使得目标音乐与目标动画能相应地同步播放展示。The synthesis processing module 204 is used for synthesizing the target music and the target animation generated by the animation generating module 203, so that the target music and the target animation can be synchronously played and displayed accordingly.
本实施例的装置可执行本公开上述实施例中提供的一种动画生成方法,其实现原理相类似,此处不再赘述。The device of this embodiment can execute an animation generation method provided in the above embodiments of the present disclosure, and its implementation principles are similar, and will not be repeated here.
本公开实施例提供了一种可能的动画生成装置,如图3所示,本实施例的装置30可以包括:第一确定模块301、第二确定模块302、动画生成模块303和合成处理模块304。An embodiment of the present disclosure provides a possible animation generating apparatus. As shown in FIG. 3, the apparatus 30 of this embodiment may include: a first determining module 301, a second determining module 302, an animation generating module 303, and a synthesis processing module 304 .
第一确定模块301用于通过预定的语音识别方法确定目标音乐的音乐要素特征。The first determination module 301 is used to determine the music element characteristics of the target music through a predetermined voice recognition method.
其中,图3中的第一确定模块301与图2中的第一确定模块201的功能相同或者相似。The functions of the first determining module 301 in FIG. 3 and the first determining module 201 in FIG. 2 are the same or similar.
第二确定模块302用于确定多张动画组成图片,并根据第一确定模块301确定的目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果。The second determination module 302 is used to determine a plurality of animation composition pictures, and determine an animation playing effect matching each animation composition picture according to the music element characteristics of the target music determined by the first determination module 301.
其中,图3中的第二确定模块302与图2中的第二确定模块202的功 能相同或者相似。The functions of the second determination module 302 in FIG. 3 and the second determination module 202 in FIG. 2 are the same or similar.
动画生成模块303用于根据第二确定模块302确定的多张动画组成图片及与各张动画组成图片相匹配的动画播放效果生成目标动画。The animation generation module 303 is used to generate a target animation according to the plurality of animation component pictures determined by the second determination module 302 and the animation playback effect matching each animation component picture.
其中,图3中的动画生成模块303与图2中的动画生成模块203的功能相同或者相似。The functions of the animation generating module 303 in FIG. 3 and the animation generating module 203 in FIG. 2 are the same or similar.
合成处理模块304用于将目标音乐与动画生成模块303生成的目标动画进行合成处理,以使得目标音乐与目标动画能相应地同步播放展示。The synthesis processing module 304 is used for synthesizing the target music and the target animation generated by the animation generating module 303, so that the target music and the target animation can be synchronously played and displayed accordingly.
其中,图3中的合成处理模块304与图2中的合成处理模块204的功能相同或者相似。The functions of the synthesis processing module 304 in FIG. 3 and the synthesis processing module 204 in FIG. 2 are the same or similar.
根据本公开的实施例,第一确定模块301可以包括第一提取单元3011、第二提取单元3012及第一确定单元3013,其中:According to an embodiment of the present disclosure, the first determination module 301 may include a first extraction unit 3011, a second extraction unit 3012, and a first determination unit 3013, where:
第一提取单元3011用于提取目标音乐的音频信息;The first extraction unit 3011 is used to extract audio information of the target music;
第二提取单元3012用于对第一提取单元3011提取到的音频信息进行声学特征提取,得到相应的声学特征信息;The second extraction unit 3012 is used to perform acoustic feature extraction on the audio information extracted by the first extraction unit 3011 to obtain corresponding acoustic feature information;
第一确定单元3013用于基于第二提取单元3012提取得到的声学特征信息确定目标音乐的音乐要素特征。The first determination unit 3013 is used to determine the music element characteristics of the target music based on the acoustic characteristic information extracted by the second extraction unit 3012.
进一步地,音乐要素特征包括以下至少一项:Further, the music element characteristics include at least one of the following:
音高、音强、音长、节拍、节奏、速度和旋律。Pitch, pitch, pitch, beat, rhythm, tempo and melody.
根据本公开的实施例,第二确定模块302可以包括第二确定单元3021及第三确定单元3022,其中:According to an embodiment of the present disclosure, the second determination module 302 may include a second determination unit 3021 and a third determination unit 3022, where:
第二确定单元3021用于根据目标音乐的音乐要素特征,确定目标音乐的音乐类型;The second determining unit 3021 is used to determine the music type of the target music according to the music element characteristics of the target music;
第三确定单元3022用于基于第二确定单元3021确定的目标音乐的音乐类型,确定与目标音乐匹配的多张动画组成图片。The third determination unit 3022 is used to determine a plurality of animation composition pictures matching the target music based on the music type of the target music determined by the second determination unit 3021.
根据本公开的实施例,第三确定单元3022还可以被配置成确定与目标音乐的音乐类型相匹配的图片场景类型,并确定符合图片场景类型的多张动画组成图片。According to an embodiment of the present disclosure, the third determining unit 3022 may be further configured to determine a picture scene type that matches the music type of the target music, and determine a plurality of animation constituent pictures that conform to the picture scene type.
根据本公开的实施例,第二确定模块302还可以包括分段处理单元 3023以及第四确定单元3024,其中:According to an embodiment of the present disclosure, the second determination module 302 may further include a segment processing unit 3023 and a fourth determination unit 3024, where:
分段处理单元3023用于依据目标音乐的音乐要素特征对目标音乐进行分段处理,得到多个音乐片段;The segmentation processing unit 3023 is used to segment the target music according to the characteristics of the music elements of the target music to obtain multiple music segments;
第四确定单元3024用于依据分段处理单元3023分段处理得到的各个音乐片段分别对应的音乐要素特征确定各个音乐段内对应的各张动画组成图片的动画播放效果,该动画播放效果包括转场模式、动画特效以及滤镜模式中的至少一项。The fourth determining unit 3024 is used to determine the animation playing effect of each animation composition picture corresponding to each music segment according to the music element characteristics corresponding to each music segment obtained by the segment processing by the segment processing unit 3023. At least one of field mode, animation effects, and filter mode.
本公开实施例提供的动画生成装置,适用于上述实施例所示的方法,在此不再赘述。The animation generating device provided by the embodiment of the present disclosure is applicable to the method shown in the above embodiment, and will not be repeated here.
在一个可选的实施例中,提供了一种电子设备,如图4所示,其示出了适于用来实现本公开实施例的电子设备(例如终端设备或服务器)40的结构示意图。本公开实施例中的终端设备可以包括但不限于诸如移动电话机、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等的移动终端以及诸如数字TV、台式计算机等等的固定终端。图4示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。In an alternative embodiment, an electronic device is provided, as shown in FIG. 4, which shows a schematic structural diagram of an electronic device (eg, terminal device or server) 40 suitable for implementing the embodiments of the present disclosure. The terminal devices in the embodiments of the present disclosure may include, but are not limited to, mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablet computers), PMPs (portable multimedia players), and in-vehicle terminals ( Mobile terminals such as car navigation terminals) and fixed terminals such as digital TVs, desktop computers, etc. The electronic device shown in FIG. 4 is only an example, and should not bring any limitation to the functions and use scope of the embodiments of the present disclosure.
如图4所示,电子设备40可以包括处理装置(例如中央处理器、图形处理器等)401,其可以根据存储在只读存储器(ROM)402中的程序或者从存储装置408加载到随机存取存储器(RAM)403中的程序而执行各种适当的动作和处理。在RAM 403中,还存储有电子设备40操作所需的各种程序和数据。处理装置401、ROM 402以及RAM 403通过总线404彼此相连。输入/输出(I/O)接口405也连接至总线404。As shown in FIG. 4, the electronic device 40 may include a processing device (for example, a central processing unit, a graphics processor, etc.) 401, which may be loaded into a random storage according to a program stored in a read-only memory (ROM) 402 or from the storage device 408 The program in the memory (RAM) 403 is fetched to perform various appropriate actions and processes. In the RAM 403, various programs and data necessary for the operation of the electronic device 40 are also stored. The processing device 401, ROM 402, and RAM 403 are connected to each other via a bus 404. An input/output (I/O) interface 405 is also connected to the bus 404.
通常,以下装置可以连接至I/O接口405:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置406;包括例如液晶显示器(LCD)、扬声器、振动器等的输出装置407;包括例如磁带、硬盘等的存储装置408;以及通信装置409。通信装置409可以允许电子设备40与其他设备进行无线或有线通信以交换数据。虽然图4示出了具有各种装置的电子设备40,但是应理解的是,并不要求实施或具 备所有示出的装置。可以替代地实施或具备更多或更少的装置。Generally, the following devices can be connected to the I/O interface 405: including input devices 406 such as touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, liquid crystal display (LCD), speaker, vibration An output device 407 such as a storage device; a storage device 408 including, for example, a magnetic tape or a hard disk; and a communication device 409. The communication device 409 may allow the electronic device 40 to perform wireless or wired communication with other devices to exchange data. Although FIG. 4 shows an electronic device 40 having various devices, it should be understood that it is not required to implement or have all the devices shown. More or fewer devices may be implemented or provided instead.
本实施例提供了一种电子设备适用于上述方法实施例,在此不再赘述。This embodiment provides an electronic device applicable to the foregoing method embodiments, and details are not described herein again.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置409从网络上被下载和安装,或者从存储装置408被安装,或者从ROM 402被安装。在该计算机程序被处理装置401执行时,执行本公开实施例的方法中限定的上述功能。In particular, according to an embodiment of the present disclosure, the process described above with reference to the flowchart may be implemented as a computer software program. For example, embodiments of the present disclosure include a computer program product that includes a computer program carried on a computer-readable medium, the computer program containing program code for performing the method shown in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network through the communication device 409, or from the storage device 408, or from the ROM 402. When the computer program is executed by the processing device 401, the above-mentioned functions defined in the method of the embodiments of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that, the above-mentioned computer-readable medium in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium or any combination of the two. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any combination of the above. More specific examples of computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing. In the present disclosure, the computer-readable storage medium may be any tangible medium containing or storing a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device. In this disclosure, the computer-readable signal medium may include a data signal that is propagated in baseband or as part of a carrier wave, in which computer-readable program code is carried. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. The computer-readable signal medium may also be any computer-readable medium other than a computer-readable storage medium, and the computer-readable signal medium may send, propagate, or transmit a program for use by or in combination with an instruction execution system, apparatus, or device . The program code contained on the computer-readable medium may be transmitted using any appropriate medium, including but not limited to: electric wires, optical cables, RF (radio frequency), etc., or any suitable combination of the foregoing.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The computer-readable medium may be included in the above-mentioned electronic device; or it may exist alone without being assembled into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个 程序被该电子设备执行时,使得该电子设备:执行上述方法实施例所示的动画生成方法。The computer-readable medium carries one or more programs. When the one or more programs are executed by the electronic device, the electronic device is caused to: execute the animation generation method shown in the above method embodiment.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。The computer program code for performing the operations of the present disclosure can be written in one or more programming languages or a combination thereof. The above programming languages include object-oriented programming languages such as Java, Smalltalk, C++, as well as conventional Procedural programming language-such as "C" language or similar programming language. The program code may be executed entirely on the user's computer, partly on the user's computer, as an independent software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In situations involving remote computers, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (for example, through an Internet service provider Internet connection).
本实施例提供了一种计算机可读存储介质适用于上述方法实施例,在此不再赘述。This embodiment provides a computer-readable storage medium suitable for the foregoing method embodiments, and details are not described herein again.
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowcharts and block diagrams in the drawings illustrate the possible implementation architecture, functions, and operations of systems, methods, and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of code that contains one or more logic functions Executable instructions. It should also be noted that in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks represented in succession may actually be executed in parallel, and they may sometimes be executed in reverse order, depending on the functions involved. It should also be noted that each block in the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts, can be implemented with dedicated hardware-based systems that perform specified functions or operations Or, it can be realized by a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的单元可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,单元的名称在某种情况下并不构成对该单元本身的限定。The units described in the embodiments of the present disclosure may be implemented in software or hardware. Among them, the name of the unit does not constitute a limitation on the unit itself under certain circumstances.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术 特征的特定组合而成的技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is only the preferred embodiment of the present disclosure and the explanation of the applied technical principles. Those skilled in the art should understand that the scope of the disclosure in this disclosure is not limited to the technical solutions formed by the specific combination of the above technical features, but should also cover the above technical features or without departing from the above disclosed concepts. Other technical solutions formed by arbitrary combinations of equivalent features. For example, the above features and the technical features disclosed in this disclosure (but not limited to) having similar functions are replaced with each other to form a technical solution.

Claims (10)

  1. 一种动画生成方法,包括:An animation generation method, including:
    通过预定的语音识别方法确定目标音乐的音乐要素特征;Determine the music element characteristics of the target music through a predetermined voice recognition method;
    确定多张动画组成图片,并根据所述目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果;Determine multiple animation composition pictures, and determine the animation playback effect matching each animation composition picture according to the music element characteristics of the target music;
    根据所述多张动画组成图片及所述与各张动画组成图片相匹配的动画播放效果生成目标动画;Generating a target animation according to the multiple animation composition pictures and the animation playback effect matching each animation composition picture;
    将所述目标音乐与所述目标动画进行合成处理,以使得所述目标音乐与所述目标动画能相应地同步播放展示。Synthesizing the target music and the target animation, so that the target music and the target animation can be synchronously played and displayed accordingly.
  2. 根据权利要求1所述的方法,其中,确定目标音乐的音乐要素特征包括:The method according to claim 1, wherein determining the music element characteristics of the target music comprises:
    提取所述目标音乐的音频信息;Extract audio information of the target music;
    对提取到的所述音频信息进行声学特征提取,得到相应的声学特征信息;Acoustic feature extraction is performed on the extracted audio information to obtain corresponding acoustic feature information;
    基于得到的所述声学特征信息确定所述目标音乐的音乐要素特征。The music element characteristics of the target music are determined based on the obtained acoustic characteristic information.
  3. 根据权利要求1或2所述的方法,其中,所述音乐要素特征包括以下至少一项:The method according to claim 1 or 2, wherein the music element characteristics include at least one of the following:
    音高、音强、音长、节拍、节奏、速度和旋律。Pitch, pitch, pitch, beat, rhythm, tempo and melody.
  4. 根据权利要求1所述的方法,其中,确定多张动画组成图片包括:The method according to claim 1, wherein the determining of the plurality of animation composition pictures comprises:
    根据所述目标音乐的音乐要素特征,确定所述目标音乐的音乐类型;Determine the music type of the target music according to the music element characteristics of the target music;
    基于所述目标音乐的音乐类型,确定与所述目标音乐相匹配的多张动画组成图片。Based on the music type of the target music, a plurality of animation composition pictures matching the target music are determined.
  5. 根据权利要求4所述的方法,其中,确定与所述目标音乐相匹配的多张动画组成图片包括:The method according to claim 4, wherein determining a plurality of animation composition pictures matching the target music includes:
    确定与所述目标音乐的音乐类型相匹配的图片场景类型;Determine a picture scene type that matches the music type of the target music;
    确定符合所述图片场景类型的多张动画组成图片。It is determined that a plurality of animation composition pictures conforming to the picture scene type.
  6. 根据权利要求1所述的方法,其中,确定与各张动画组成图片相 匹配的动画播放效果包括:The method according to claim 1, wherein the determination of the animation playback effect matching each animation composition picture comprises:
    依据所述目标音乐的音乐要素特征对所述目标音乐进行分段处理,得到多个音乐片段;Segmenting the target music according to the characteristics of the music elements of the target music to obtain multiple music fragments;
    依据各个音乐片段分别对应的音乐要素特征,确定各个音乐段内对应的各张动画组成图片的动画播放效果,所述播放效果包括转场模式、动画特效以及滤镜模式中的至少一项。According to the characteristics of the music elements corresponding to the respective music segments, the animation playback effect of each animation corresponding to each music segment is determined, and the playback effect includes at least one of a transition mode, an animation special effect, and a filter mode.
  7. 一种动画生成装置,包括:An animation generating device, including:
    第一确定模块,用于通过预定的语音识别方法确定目标音乐的音乐要素特征;The first determining module is used to determine the music element characteristics of the target music through a predetermined voice recognition method;
    第二确定模块,用于确定多张动画组成图片,并根据所述第一确定模块确定的所述目标音乐的音乐要素特征确定与各张动画组成图片相匹配的动画播放效果;A second determining module, configured to determine a plurality of animation composition pictures, and determine an animation playing effect matching each animation composition picture according to the music element characteristics of the target music determined by the first determination module;
    动画生成模块,用于根据所述第二确定模块确定的所述多张动画组成图片及所述与各张动画组成图片相匹配的动画播放效果生成目标动画;An animation generation module, configured to generate a target animation according to the plurality of animation component pictures determined by the second determination module and the animation playback effect matching each animation component picture;
    合成处理模块,用于将所述目标音乐与所述动画生成模块生成的所述目标动画进行合成处理,以使得所述目标音乐与所述目标动画能相应地同步播放展示。The synthesis processing module is used for synthesizing the target music and the target animation generated by the animation generating module, so that the target music and the target animation can be synchronously played and displayed accordingly.
  8. 根据权利要求7所述的装置,其中,所述第一确定模块包括:The apparatus according to claim 7, wherein the first determination module comprises:
    第一提取单元,用于提取所述目标音乐的音频信息;A first extraction unit for extracting audio information of the target music;
    第二提取单元,用于对所述第一提取单元提取到的所述音频信息进行声学特征提取,得到相应的声学特征信息;A second extraction unit, configured to perform acoustic feature extraction on the audio information extracted by the first extraction unit to obtain corresponding acoustic feature information;
    第一确定单元,用于基于所述第二提取单元得到的所述声学特征信息确定所述目标音乐的音乐要素特征。The first determining unit is configured to determine the music element feature of the target music based on the acoustic feature information obtained by the second extracting unit.
  9. 一种电子设备,包括:An electronic device, including:
    处理器;processor;
    存储器,所述存储器存储有至少一个应用程序,当所述至少一个应用程序被所述处理器执行时,使得所述电子设备执行根据权利要求1至6中任一项所述的动画生成方法。A memory that stores at least one application program, and when the at least one application program is executed by the processor, causes the electronic device to execute the animation generation method according to any one of claims 1 to 6.
  10. 一种计算机可读存储介质,所述计算机存储介质用于存储计算机指令,当该计算机指令在计算机上运行时,使得计算机执行权利要求1至6中任一项所述的动画生成方法。A computer-readable storage medium for storing computer instructions, which when executed on a computer, causes the computer to execute the animation generation method according to any one of claims 1 to 6.
PCT/CN2018/125392 2018-12-07 2018-12-29 Animation generation method and apparatus, electronic device, and computer-readable storage medium WO2020113733A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811496521.5 2018-12-07
CN201811496521.5A CN109615682A (en) 2018-12-07 2018-12-07 Animation producing method, device, electronic equipment and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2020113733A1 true WO2020113733A1 (en) 2020-06-11

Family

ID=66008353

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/125392 WO2020113733A1 (en) 2018-12-07 2018-12-29 Animation generation method and apparatus, electronic device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN109615682A (en)
WO (1) WO2020113733A1 (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110097618B (en) * 2019-05-09 2023-05-12 广州小鹏汽车科技有限公司 Music animation control method and device, vehicle and storage medium
CN110209844B (en) * 2019-05-17 2021-08-31 腾讯音乐娱乐科技(深圳)有限公司 Multimedia data matching method, device and storage medium
CN111611430A (en) * 2020-05-26 2020-09-01 广州酷狗计算机科技有限公司 Song playing method, device, terminal and storage medium
CN113852521A (en) * 2020-06-09 2021-12-28 广东美的制冷设备有限公司 Household appliance, control method thereof and computer readable storage medium
CN113938751B (en) * 2020-06-29 2023-12-22 抖音视界有限公司 Video transition type determining method, device and storage medium
CN113938744B (en) * 2020-06-29 2024-01-23 抖音视界有限公司 Video transition type processing method, device and storage medium
CN111857923B (en) * 2020-07-17 2022-10-28 北京字节跳动网络技术有限公司 Special effect display method and device, electronic equipment and computer readable medium
CN112164128A (en) * 2020-09-07 2021-01-01 广州汽车集团股份有限公司 Music visual interaction method and computer equipment for vehicle-mounted multimedia
CN112804578A (en) * 2021-01-28 2021-05-14 广州虎牙科技有限公司 Atmosphere special effect generation method and device, electronic equipment and storage medium
CN113365134B (en) * 2021-06-02 2022-11-01 北京字跳网络技术有限公司 Audio sharing method, device, equipment and medium
CN116152393A (en) * 2021-11-18 2023-05-23 脸萌有限公司 Video generation method, device, equipment and storage medium
CN116800908A (en) * 2022-03-18 2023-09-22 北京字跳网络技术有限公司 Video generation method and device, electronic equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101826217A (en) * 2010-05-07 2010-09-08 上海交通大学 Rapid generation method for facial animation
US20140178043A1 (en) * 2012-12-20 2014-06-26 International Business Machines Corporation Visual summarization of video for quick understanding
CN107172485A (en) * 2017-04-25 2017-09-15 北京百度网讯科技有限公司 A kind of method and apparatus for being used to generate short-sighted frequency
CN108428441A (en) * 2018-02-09 2018-08-21 咪咕音乐有限公司 Multimedia file producting method, electronic equipment and storage medium

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4660861B2 (en) * 2006-09-06 2011-03-30 富士フイルム株式会社 Music image synchronized video scenario generation method, program, and apparatus
CN101853668B (en) * 2010-03-29 2014-10-29 北京中星微电子有限公司 Method and system for transforming MIDI music into cartoon
CN101901595B (en) * 2010-05-05 2014-10-29 北京中星微电子有限公司 Method and system for generating animation according to audio music
CN105224581B (en) * 2014-07-03 2019-06-21 北京三星通信技术研究有限公司 The method and apparatus of picture are presented when playing music
CN105227864A (en) * 2015-10-16 2016-01-06 南阳师范学院 A kind of picture generates animation and splices with video segment the video editing method synthesized
CN105550251A (en) * 2015-12-08 2016-05-04 小米科技有限责任公司 Picture play method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101826217A (en) * 2010-05-07 2010-09-08 上海交通大学 Rapid generation method for facial animation
US20140178043A1 (en) * 2012-12-20 2014-06-26 International Business Machines Corporation Visual summarization of video for quick understanding
CN107172485A (en) * 2017-04-25 2017-09-15 北京百度网讯科技有限公司 A kind of method and apparatus for being used to generate short-sighted frequency
CN108428441A (en) * 2018-02-09 2018-08-21 咪咕音乐有限公司 Multimedia file producting method, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN109615682A (en) 2019-04-12

Similar Documents

Publication Publication Date Title
WO2020113733A1 (en) Animation generation method and apparatus, electronic device, and computer-readable storage medium
WO2020253806A1 (en) Method and apparatus for generating display video, device and storage medium
CN109543064B (en) Lyric display processing method and device, electronic equipment and computer storage medium
WO2020098115A1 (en) Subtitle adding method, apparatus, electronic device, and computer readable storage medium
CN111798821B (en) Sound conversion method, device, readable storage medium and electronic equipment
CN111782576B (en) Background music generation method and device, readable medium and electronic equipment
WO2021259300A1 (en) Sound effect adding method and apparatus, storage medium, and electronic device
WO2021057740A1 (en) Video generation method and apparatus, electronic device, and computer readable medium
CN112153460B (en) Video dubbing method and device, electronic equipment and storage medium
JP2019091014A (en) Method and apparatus for reproducing multimedia
WO2023051246A1 (en) Video recording method and apparatus, device, and storage medium
JP2019015951A (en) Wake up method for electronic device, apparatus, device and computer readable storage medium
US11272136B2 (en) Method and device for processing multimedia information, electronic equipment and computer-readable storage medium
CN112908292A (en) Text voice synthesis method and device, electronic equipment and storage medium
WO2023040520A1 (en) Method and apparatus for performing music matching of video, and computer device and storage medium
WO2022237665A1 (en) Speech synthesis method and apparatus, electronic device, and storage medium
WO2022160603A1 (en) Song recommendation method and apparatus, electronic device, and storage medium
JP2023541182A (en) Custom tone singing voice synthesis method, device, electronic equipment and storage medium
CN110413834A (en) Voice remark method of modifying, system, medium and electronic equipment
WO2024001548A1 (en) Song list generation method and apparatus, and electronic device and storage medium
WO2023061229A1 (en) Video generation method and device
WO2022143530A1 (en) Audio processing method and apparatus, computer device, and storage medium
CN109495786B (en) Pre-configuration method and device of video processing parameter information and electronic equipment
CN115619897A (en) Image processing method, image processing device, electronic equipment and storage medium
KR102431737B1 (en) Method of searching highlight in multimedia data and apparatus therof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18941956

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 22.09.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 18941956

Country of ref document: EP

Kind code of ref document: A1