CN107122393A

CN107122393A - Electron album generation method and device

Info

Publication number: CN107122393A
Application number: CN201710138877.0A
Authority: CN
Inventors: 沙安澜
Original assignee: Beijing Small Mutual Entertainment Technology Co Ltd
Current assignee: Beijing Small Mutual Entertainment Technology Co Ltd
Priority date: 2017-03-09
Filing date: 2017-03-09
Publication date: 2017-09-01
Anticipated expiration: 2037-03-09
Also published as: CN107122393B

Abstract

The invention discloses a kind of electron album generation method and device.Wherein method includes：Content analysis is carried out to photo, the descriptive information of photo is generated；And Video Composition is carried out according to photo and descriptive information, to generate target electronic photograph album.Thus, in the generating process of electron album, the making of electron album also can intelligently be completed without requiring that user possesses image procossing knowledge, while manual manufacture cost is saved, photograph album content is enriched, photograph album is improved interesting, imagination space has been expanded, and has improved Consumer's Experience.

Description

Electron album generation method and device

Technical field

The present invention relates to technical field of image processing, more particularly to a kind of electron album generation method and device.

Background technology

With the development of computer technology and multimedia technology, the multimedia resource that people touch becomes increasingly abundant.With People interest is widened, and many people directly shoot video with video camera or digital camera now, on computers using playing Device software watches video frequency program, has become a kind of very universal study, leisure, entertainment way of user.

At the same time, some simply marvelous pictures can be concerned about on film, and want to preserve, by these Excellent picture constitutes an exquisite electron album.Electron album is to produce and generate video by one group of given photo, is led to Reason condition can be with background music and descriptive text.In correlation technique, Most electronic photograph album Making programme is all user's base In Video editing software, descriptive text and background music are mixed on photo to complete electron album.

But, the problem of presently, there are is：Based on Video editing software to complete electron album, this production method belongs to Pure artifact is, it is necessary to which the user for possessing certain professional knowledge could complete descriptive text of being arranged in pairs or groups on photo, significantly Manual manufacture cost is increased, it is not intelligent.

The content of the invention

The purpose of the present invention is intended at least solve one of above-mentioned technical problem to a certain extent.

Therefore, first purpose of the present invention is to propose a kind of electron album generation method.This method is saving artificial While cost of manufacture, photograph album content is enriched, photograph album interest is improved, has expanded imagination space, and improve user's body Test.

Second object of the present invention is to propose a kind of electron album generating means.

To reach above-mentioned purpose, the electron album generation method that first aspect present invention embodiment is proposed, including：To photo Content analysis is carried out, the descriptive information of the photo is generated；And regarded according to the photo and the descriptive information Frequency is synthesized, to generate target electronic photograph album.

The electron album generation method of the embodiment of the present invention, can generate retouching for photo by carrying out content analysis to photo The property stated information, and Video Composition is carried out according to photo and descriptive information, to generate target electronic photograph album.It is based on photograph image Content analyzed, and appropriate descriptive information is provided for the photo based on analysis result automatically, so, in electron album Generating process in, also can intelligently complete the making of electron album without requiring that user possesses image procossing knowledge, section While less manpower cost of manufacture, photograph album content is enriched, photograph album interest is improved, has expanded imagination space, and improve Consumer's Experience.

To reach above-mentioned purpose, the electron album generating means that second aspect of the present invention embodiment is proposed, including：It is descriptive Information generating module, for carrying out content analysis to photo, generates the descriptive information of the photo；And electron album generation Module, for carrying out Video Composition according to the photo and the descriptive information, to generate target electronic photograph album.

The electron album generating means of the embodiment of the present invention, can carry out content by descriptive information generation module to photo Analysis, generates the descriptive information of photo, and electron album generation module carries out Video Composition according to photo and descriptive information, with Generate target electronic photograph album.I.e. the content based on photograph image is analyzed, and is provided automatically for the photo based on analysis result Appropriate descriptive information, so, in the generating process of electron album, also can without requiring that user possesses image procossing knowledge The making of electron album is intelligently completed, while manual manufacture cost is saved, photograph album content is enriched, improves photograph album Interest, has expanded imagination space, and improve Consumer's Experience.

The additional aspect of the present invention and advantage will be set forth in part in the description, and will partly become from the following description Obtain substantially, or recognized by the practice of the present invention.

Brief description of the drawings

The above-mentioned and/or additional aspect and advantage of the present invention will become from description of the accompanying drawings below to embodiment is combined Substantially and be readily appreciated that, wherein：

Fig. 1 is the flow chart of electron album generation method according to an embodiment of the invention；

Fig. 2 is the flow chart of the electron album generation method according to a specific embodiment of the invention；

Fig. 3 (a), (b), (c) are the signal of the classification results of object type in photo according to embodiments of the present invention with (d) Figure；

Fig. 3 (e) is the schematic diagram of the vector representation of word " present " according to embodiments of the present invention；

Fig. 4 is the exemplary plot of vgg16 model structures according to embodiments of the present invention；

Fig. 5 is the flow chart of the alternative descriptive information set of generation according to an embodiment of the invention；

Fig. 6 is the flow chart of generation descriptive information according to an embodiment of the invention；

Fig. 7 is the structural representation of electron album generating means according to an embodiment of the invention；

Fig. 8 is the structural representation of the electron album generating means according to a specific embodiment of the invention；

Fig. 9 is the structural representation of generation submodule according to an embodiment of the invention；

Figure 10 is the structural representation of the electron album generating means according to another specific embodiment of the invention；

Figure 11 is the structural representation of the electron album generating means according to another specific embodiment of the invention.

Embodiment

Embodiments of the invention are described below in detail, the example of the embodiment is shown in the drawings, wherein from beginning to end Same or similar label represents same or similar element or the element with same or like function.Below with reference to attached The embodiment of figure description is exemplary, it is intended to for explaining the present invention, and be not considered as limiting the invention.

Below with reference to the accompanying drawings the electron album generation method and device of the embodiment of the present invention described.

Fig. 1 is the flow chart of electron album generation method according to an embodiment of the invention.It should be noted that this hair The electron album generation method of bright embodiment can be applied to the electron album generating means of the embodiment of the present invention.The electron album is given birth to Terminal device can be configured in into device, the terminal device can be that (such as mobile phone, tablet personal computer, individual digital are helped mobile terminal The hardware devices such as reason), PC etc..

As shown in figure 1, the electron album generation method can include：

S110, content analysis is carried out to photo, generates the descriptive information of photo.

It is appreciated that the content analysis to photo for convenience, can enter before content analysis is carried out to photo to photo Row pretreatment.For example, original photo can have situations such as oversized, length-width ratio is inconsistent, it is necessary to carry out size change over processing, In the case of keeping the ratio of width to height immovable in photo, the width and height dimensions of suitable target electronic photograph album are converted to.

After being pre-processed to photo, content analysis can be carried out to pretreated photo, and based on analysis result Generate the descriptive information of photo.Specifically, CNN (Convolutional Neural Network, convolutional Neural net can be used Network) method, load object in the good model comparison piece of training in advance and analyzed, obtain the object type in photo, afterwards, can The descriptive information of the photo is selected from the alternative descriptive information set previously generated according to object type.It is specific real Existing mode can be found in the description of subsequent embodiment.

It should be noted that the language form of descriptive information is not limited to Chinese, English, German, Russian etc. can also be； The file format of descriptive information can be text and/or voice etc..That is, arranging in pairs or groups to the description on target electronic photograph album Property information can be Chinese text, can also be other spoken and written languages, also can also be Chinese speech etc., specific presentation mode can Determine, be not especially limited herein according to the actual requirements.

S120, carries out Video Composition, to generate target electronic photograph album according to photo and descriptive information.

Specifically, the Video Composition of electron album can be carried out by video coding technique according to photo and descriptive information, Finally give target electronic photograph album.Concrete implementation process can be found in the description of subsequent embodiment.

In order that the realization of the present invention can clearly be understood by obtaining those skilled in the art, below in conjunction with Fig. 2 Electron album generation method to the present invention is described further.

Fig. 2 is the flow chart of the electron album generation method according to a specific embodiment of the invention.As shown in Fig. 2 should Electron album generation method can include：

S210, photo is identified according to default disaggregated model, to obtain the object type in photo.

For example, so that descriptive information is Chinese text as an example, after being pre-processed to photo, CNN can be used The good disaggregated model of method loading training in advance, classifies to the object in photo.It is appreciated that during classification results are photo Object belong to the probability of each classification, threshold value min_prob (for example, can be set as 0.5) can be pre-set, each class is taken Maximum in other probability, if the value is more than min_prob, takes the corresponding Chinese label (label) of the category, otherwise Extract label failures, i.e. object type and obtain failure.If the object type multiple Chinese label of correspondence in photo, take each One label of word frequency highest in label.

It is for instance possible to use the object in the good vgg16 model comparison pieces of pre-training is classified, classification results are photo In object belong to the probability of 1000 classifications.As shown in Fig. 3 (a), prob is the probability of identification classification, predict_index For vgg16 classification results sequence numbers, cn_label is by the corresponding Chinese label of the former classifications of vgg16 1000.As can be seen that Fig. 3 (a) most probable value is 0.628759 in, more than threshold value 0.5, then can assert that main object is hamster in the photo.

It should be noted that the present invention uses CNN as sorting algorithm, it is because current CNN classification accuracies have reached To a certain extent, as shown in Fig. 3 (b)-Fig. 3 (d), to carry out the classification that object classification is obtained to each photo using vgg16 models As a result.Wherein, most probable value is 0.374364 in Fig. 3 (b), is the maximum in these each class probabilities, then can assert Main object is chihuahua in the photo；Most probable value is 0.403069 in Fig. 3 (c), in being these each class probabilities Maximum, and multiple label (i.e. billiard table, pool table) are now corresponded to, because the occurrence number of billiard table is more than pool table Occurrence number, so the object predominantly billiard table in the photo can be assert；Most probable value is 0.756128 in Fig. 3 (d), is Maximum in these each class probabilities, then can assert that main object is upright piano in the photo.

As a kind of example, as shown in figure 4, being vgg16 model structures, the model is 16 layers of convolutional neural networks.Need It is noted that this step can also use other CNN disaggregated models, many class probability results of photo only need to be obtained, This is not especially limited.

S220, according to object type, chooses the descriptive letter of photo from the alternative descriptive information set previously generated Breath.

It should be noted that in an embodiment of the present invention, alternative descriptive information set can be previously generated.Make For a kind of example, as shown in figure 5, the alternative descriptive information set can be previously generated by following steps：

S510, obtains the descriptive words list in high frequency word list, wherein, descriptive words list includes multiple describe Word sample and multiple adverbial word samples.

For example, so that descriptive information is Chinese text as an example, Chinese high frequency words can be got by internet collection Descriptive words list in list, the descriptive words list may include multiple adjective samples and multiple adverbial word samples.Example Such as, the adjective sample in Chinese high frequency word list can be taken, such as：It is " good new general high long basic old complete few fast low big more than size Measure the easy freely strong actively difficult closely complicated identical many many rapid red vast simply remote abundant substantially objective happinesss of old morning of nature It is maximum beautiful to being also clear that the black very huge necessary stable bad deep great heat normal extensively of universal advanced balance conscientiously writes Name ... "；The adverbial word sample in Chinese high frequency word list is taken, the adverbial word sample of the positive degree of description is obtained by artificial filter, Such as：" very very especially quite most very very too many is so exactly extremely tangible well extremely very really extremely the most special really It is definitely how rather abnormal enough ".

S520, obtains the vector representation of each adjective sample and each adverbial word sample respectively, calculates each adjective sample Originally the second similarity between each adverbial word sample.

Specifically, continue so that descriptive information is Chinese text as an example, Chinese corpus can be obtained by internet collection, By Chinese Word Segmentation, by neural metwork training, the vector representation of word in corpus is obtained.For example：Word2vec can be used Word in language material is handled, the K dimension real number vector representations of generation word.For example, " present " one word, is counted by word2vec Calculation can obtain its vector representation for shown in Fig. 3 (e).So, from the corpus term vector set, each can be obtained respectively The vector representation of adjective sample and each adverbial word sample, calculates between each adjective sample and each adverbial word sample Two similarities (such as cosine cosine similarities).

S530, for each adjective sample, obtains the N number of adverbial word sample of the second similarity highest with each adjective sample This, wherein, N is positive integer.

That is, each adjective sample acquisition and the N number of adverbial word sample of its similarity highest.For example, N is 3, you can obtain Take 3 adverbial word samples of similarity highest with each adjective sample.

S540, each adjective sample is combined to generate alternative descriptive information with corresponding N number of adverbial word sample Set.

That is, each adjective sample is combined with as one group of Collocation with corresponding N number of adverbial word sample Language, such as：" extremely excellent, how glorious, enough beautiful " etc., finally, all collocation words is gathered together, alternatively retouched The property stated information aggregate.It is appreciated that the collocation word obtained by above-mentioned acquisition modes, as meets the language material speech habits Text is described, photo text description alternately.

It should be added that, the composition of corpus may decide that the style of generation descriptive information.For example, can come From the then daily description of output of encyclopaedia language material, can come from social networks output has neologisms term of network style etc..

It should also be noted that, in one embodiment of the invention, can also constantly be obtained from internet and newly enter text, with Update corpus.Thus, the continuous renewal to corpus is passed through, it is ensured that the freshness of word in corpus, so as to improve The degree of accuracy of the descriptive information of generation and flexibility ratio.

Thus, alternative descriptive information set can be generated by above-mentioned steps S510-S540.So, in this step, It can be chosen by neural LISP program LISP NLP technologies, according to object type from the alternative descriptive information set previously generated The descriptive information of photo, i.e., generate descriptive information to the photo.As a kind of example, as shown in fig. 6, according to object class Not, from the alternative descriptive information set previously generated choose photo descriptive information implement process may include as Lower step：

S610, obtains the text term vector of object type.

Specifically, the text of object type can be handled by word2vec, to generate the text of the object type Term vector.

S620, obtains high frequency corresponding with object type and describes word list, wherein, high frequency is described in word list with multiple High frequency adjective.

As a kind of example, it can be obtained from the adjective sample in the descriptive words list got in advance with being somebody's turn to do The corresponding high frequency of object type describes word list.

S630, the text term vector of calculating object type is similar to first between the adjectival term vector of each high frequency Degree.

S640, obtains the target adjective that the first similarity is more than predetermined threshold value.

It is appreciated that predetermined threshold value min_similar can be pre-defined, for example, the predetermined threshold value can value 0.3.

S650, descriptive information is chosen according to target adjective from alternative descriptive information set.

Specifically, adjective sample corresponding with the target adjective can be found out first from alternative descriptive information set With the matched combined of adverbial word sample, afterwards, can be according to the adjective sample the second phase corresponding with the matched combined of adverbial word sample Determine that the target adjective is directed to the descriptive information of the photo like degree.

Alternatively, in one embodiment of the invention, according to target adjective from alternative descriptive information set Choose before descriptive information, can also choose wherein the first phase from the first similarity is more than the target adjective of predetermined threshold value Like degree M adjective of highest as target adjective, M is positive integer, and such as M is 3.So, according to the M target adjective Descriptive information is chosen from alternative descriptive information set.

For example, by taking the upright piano object shown in Fig. 3 (d) as an example, as shown in Table 1 below, Label->Adj is photograph Piece label texts and target adjective top3 similarities, Adj->Adv is the collocations in alternative descriptive information set, Description is the photo descriptive information selected.So, the descriptive information of photo is obtained by the present invention, even if Some recognition effects are not absolutely accurate, but to describe effect still good for text.

S230, video segment is converted to by the descriptive information of photo and photo.

For example, so that the number of descriptive information is 3 as an example, using predefined electron album target size as mark Standard, it is random from upper left 1/3, upper right 1/3, lower-left 1/3, bottom right 1/3 to take three positions to occur as the text of descriptive information Position.So, the descriptive information of photo and photo can be converted into video segment in the following manner：First, with Original Photo Piece is background, replicates the frame of 20 frame number 1~20, position occurs in each frame correspondence text, using predefined font according to from small To small font size change is arrived again greatly, first descriptive information is drawn out；Again using the 20th frame as background, replicate 20 frame numbers 21~ 40 frames, mode same as above draws second descriptive information；The 3rd descriptive information is finally drawn by background of the 40th frame； Finally, 60 frame assembly codings are the video segment that 10~20 frame per second is played by more than.

S240, is linked to generate target electronic photograph album to video segment.

Specifically, all videos fragment is chained up, i.e., the video segment generated each photo is linked at one Rise, composition sequence photograph video, and the video after link mixes background music, and head are mixed in video, so as to complete The making of target electronic photograph album.

The electron album generation method of the embodiment of the present invention, synthesis used multinomial technology to complete, i.e., using image at Reason technology pre-processed to photo, to photo generate descriptive words, using video coding and decoding technology using CNN+NLP technologies Electron album Video Composition is carried out, target electronic photograph album is finally given, in the generating process of electron album, without requiring user Possessing image procossing knowledge also can intelligently complete the making of electron album, while manual manufacture cost is saved, enrich Photograph album content, it is interesting to improve photograph album, has expanded imagination space, and improve Consumer's Experience.

Corresponding with the electron album generation method that above-mentioned several embodiments are provided, a kind of embodiment of the invention is also provided A kind of electron album generating means, because electron album generating means provided in an embodiment of the present invention are carried with above-mentioned several embodiments The electron album generation method of confession is corresponding, therefore is also applied for this implementation in the embodiment of aforementioned electronic album creating method The electron album generating means that example is provided, are not described in detail in the present embodiment.Fig. 7 is according to an embodiment of the invention The structural representation of electron album generating means.As shown in fig. 7, the electron album generating means can include：Descriptive information Generation module 700 and electron album generation module 800.

Specifically, descriptive information generation module 700 is used to carry out content analysis to photo, generates the descriptive letter of photo Breath.

After being pre-processed to photo, descriptive information generation module 700 can carry out interior to pretreated photo Hold analysis, and based on the descriptive information of analysis result generation photo.Specifically, CNN methods can be used, loading training in advance is good Model comparison piece in object analyzed, obtain the object type in photo, afterwards, can be according to object type from previously generating Alternative descriptive information set in select the descriptive information of the photo.Concrete implementation mode can be found in subsequent embodiment Description.

Electron album generation module 800 is used to carry out Video Composition according to photo and descriptive information, to generate target electricity Sub- photograph album.More specifically, electron album generation module 800 can be entered according to photo and descriptive information by video coding technique The Video Composition of row electron album, finally gives target electronic photograph album.Concrete implementation process can be found in retouching for subsequent embodiment State.

Fig. 8 is the structural representation of the electron album generating means according to a specific embodiment of the invention.Such as Fig. 8 institutes Show, the electron album generating means can include：Descriptive information generation module 700 and electron album generation module 800.Its In, the descriptive information generation module 700 may include：Recognize submodule 710 and generation submodule 720.

Wherein, identification submodule 710 is used to photo is identified according to default disaggregated model, to obtain in photo Object type.

Generating submodule 720 is used for according to object type, chooses and shines from the alternative descriptive information set previously generated The descriptive information of piece.As a kind of example, as shown in figure 9, the generation submodule 720 may include：First acquisition unit 721, Second acquisition unit 722, computing unit 723, the 3rd acquiring unit 724 and generation unit 725.

Wherein, first acquisition unit 721 is used for the text term vector for obtaining object type.

Second acquisition unit 722 is used for acquisition high frequency corresponding with object type and describes word list, wherein, high frequency adjective There are multiple high frequency adjectives in list.

Computing unit 723 is used to calculate between the text term vector of object type and the adjectival term vector of each high frequency First similarity.

3rd acquiring unit 724 is used to obtain the target adjective that the first similarity is more than predetermined threshold value.

Generation unit 725 is used to choose descriptive information from alternative descriptive information set according to target adjective.

It should be noted that in an embodiment of the present invention, the alternative descriptive information set can be previously generated. As a kind of example, as shown in Figure 10, the electron album generating means may also include：Module 900 is anticipated, for pre- Mr. Into alternative descriptive information set.Wherein, as shown in Figure 10, this is anticipated module 900 and can included：First acquisition submodule 910th, calculating sub module 920, the second acquisition submodule 930 and generation submodule 940.

Wherein, the first acquisition submodule 910 is used to obtain the descriptive words list in high frequency word list, wherein, description Property word list include multiple adjective samples and multiple adverbial word samples.

Calculating sub module 920 is used to obtain each adjective sample and the vector representation of each adverbial word sample respectively, calculates The second similarity between each adjective sample and each adverbial word sample.

Second acquisition submodule 930 is used to be directed to each adjective sample, obtains the second phase with each adjective sample Like the degree N number of adverbial word sample of highest, wherein, N is positive integer.

Generation submodule 940 is standby to generate for each adjective sample and corresponding N number of adverbial word sample to be combined Select descriptive information set.

As a kind of example, as shown in figure 11, the electron album generation module 800 can include：The He of transform subblock 810 Generate submodule 820.Wherein, transform subblock 810 is used to the descriptive information of photo and photo being converted to video segment.It is raw It is used to link video segment to generate target electronic photograph album into submodule 820.

The electron album generating means of the embodiment of the present invention, synthesis used multinomial technology to complete, i.e., using image at Reason technology pre-processed to photo, to photo generate descriptive words, using video coding and decoding technology using CNN+NLP technologies Electron album Video Composition is carried out, target electronic photograph album is finally given, in the generating process of electron album, without requiring user Possessing image procossing knowledge also can intelligently complete the making of electron album, while manual manufacture cost is saved, enrich Photograph album content, it is interesting to improve photograph album, has expanded imagination space, and improve Consumer's Experience.

In the description of the invention, it is to be understood that term " first ", " second " are only used for describing purpose, and can not It is interpreted as indicating or implies relative importance or the implicit quantity for indicating indicated technical characteristic.Thus, define " the One ", at least one this feature can be expressed or be implicitly included to the feature of " second ".In the description of the invention, " multiple " It is meant that at least two, such as two, three etc., unless otherwise specifically defined.

In the description of this specification, reference term " one embodiment ", " some embodiments ", " example ", " specifically show The description of example " or " some examples " etc. means to combine specific features, structure, material or the spy that the embodiment or example are described Point is contained at least one embodiment of the present invention or example.In this manual, to the schematic representation of above-mentioned term not Identical embodiment or example must be directed to.Moreover, specific features, structure, material or the feature of description can be with office Combined in an appropriate manner in one or more embodiments or example.In addition, in the case of not conflicting, the skill of this area Art personnel can be tied the not be the same as Example or the feature of example and non-be the same as Example or example described in this specification Close and combine.

Any process described otherwise above or method description are construed as in flow chart or herein, represent to include Module, fragment or the portion of the code of one or more executable instructions for the step of realizing specific logical function or process Point, and the scope of the preferred embodiment of the present invention includes other realization, wherein can not be by shown or discussion suitable Sequence, including according to involved function by it is basic simultaneously in the way of or in the opposite order, carry out perform function, this should be of the invention Embodiment person of ordinary skill in the field understood.

Represent in flow charts or logic and/or step described otherwise above herein, for example, being considered use In the order list for the executable instruction for realizing logic function, it may be embodied in any computer-readable medium, for Instruction execution system, device or equipment (such as computer based system including the system of processor or other can be held from instruction The system of row system, device or equipment instruction fetch and execute instruction) use, or combine these instruction execution systems, device or set It is standby and use.For the purpose of this specification, " computer-readable medium " can any can be included, store, communicate, propagate or pass Defeated program is for instruction execution system, device or equipment or the dress for combining these instruction execution systems, device or equipment and using Put.The more specifically example (non-exhaustive list) of computer-readable medium includes following：Electricity with one or more wirings Connecting portion (electronic installation), portable computer diskette box (magnetic device), random access memory (RAM), read-only storage (ROM), erasable edit read-only storage (EPROM or flash memory), fiber device, and portable optic disk is read-only deposits Reservoir (CDROM).In addition, can even is that can be in the paper of printing described program thereon or other are suitable for computer-readable medium Medium, because can then enter edlin, interpretation or if necessary with it for example by carrying out optical scanner to paper or other media His suitable method is handled electronically to obtain described program, is then stored in computer storage.

It should be appreciated that each several part of the present invention can be realized with hardware, software, firmware or combinations thereof.Above-mentioned In embodiment, the software that multiple steps or method can in memory and by suitable instruction execution system be performed with storage Or firmware is realized.If, and in another embodiment, can be with well known in the art for example, realized with hardware Any one of row technology or their combination are realized：With the logic gates for realizing logic function to data-signal Discrete logic, the application specific integrated circuit with suitable combinational logic gate circuit, programmable gate array (PGA), scene Programmable gate array (FPGA) etc..

Those skilled in the art are appreciated that to realize all or part of step that above-described embodiment method is carried Rapid to can be by program to instruct the hardware of correlation to complete, described program can be stored in a kind of computer-readable storage medium In matter, the program upon execution, including one or a combination set of the step of embodiment of the method.

In addition, each functional unit in each embodiment of the invention can be integrated in a processing module, can also That unit is individually physically present, can also two or more units be integrated in a module.Above-mentioned integrated mould Block can both be realized in the form of hardware, it would however also be possible to employ the form of software function module is realized.The integrated module is such as Fruit is realized using in the form of software function module and as independent production marketing or in use, can also be stored in a computer In read/write memory medium.

Storage medium mentioned above can be read-only storage, disk or CD etc..Although having been shown and retouching above Embodiments of the invention are stated, it is to be understood that above-described embodiment is exemplary, it is impossible to be interpreted as the limit to the present invention System, one of ordinary skill in the art can be changed to above-described embodiment, change, replace and become within the scope of the invention Type.

Claims

1. a kind of electron album generation method, it is characterised in that comprise the following steps：

Content analysis is carried out to photo, the descriptive information of the photo is generated；And

Video Composition is carried out according to the photo and the descriptive information, to generate target electronic photograph album.

2. electron album generation method as claimed in claim 1, it is characterised in that described that content analysis, life are carried out to photo Into the descriptive information of the photo, including：

The photo is identified according to default disaggregated model, to obtain the object type in the photo；

According to the object type, the descriptive letter of the photo is chosen from the alternative descriptive information set previously generated Breath.

3. electron album generation method as claimed in claim 2, it is characterised in that described according to the object type, from pre- The descriptive information of the photo is chosen in the alternative descriptive information set first generated, including：

Obtain the text term vector of the object type；

Obtain high frequency corresponding with the object type and describe word list, wherein, the high frequency is described in word list with multiple High frequency adjective；

Calculate the first similarity between the text term vector of the object type and the adjectival term vector of each high frequency；

Obtain the target adjective that the first similarity is more than predetermined threshold value；

The descriptive information is chosen from the alternative descriptive information set according to the target adjective.

4. electron album generation method as claimed in claim 2 or claim 3, it is characterised in that previously generate institute by following steps State alternative descriptive information set：

The descriptive words list in high frequency word list is obtained, wherein, the descriptive words list includes multiple adjective samples Originally with multiple adverbial word samples；

The vector representation of each adjective sample and each adverbial word sample is obtained respectively, calculates each described adjective sample and institute State the second similarity between each adverbial word sample；

For each adjective sample, the N number of adverbial word sample of the second similarity highest with each adjective sample is obtained, its In, N is positive integer；

Each adjective sample is combined with corresponding N number of adverbial word sample to generate the alternative descriptive letter Breath set.

5. electron album generation method as claimed in claim 1, it is characterised in that described according to the photo and the description Property information carry out Video Composition, to generate target electronic photograph album, including：

The descriptive information of the photo and the photo is converted into video segment；

The video segment is linked to generate the target electronic photograph album.

6. a kind of electron album generating means, it is characterised in that including：

Descriptive information generation module, for carrying out content analysis to photo, generates the descriptive information of the photo；And

Electron album generation module, for carrying out Video Composition according to the photo and the descriptive information, to generate target Electron album.

7. electron album generating means as claimed in claim 6, it is characterised in that the descriptive information generation module bag Include：

Submodule is recognized, for the photo to be identified according to default disaggregated model, to obtain pair in the photo As classification；

Submodule is generated, for according to the object type, choosing described from the alternative descriptive information set previously generated The descriptive information of photo.

8. electron album generating means as claimed in claim 7, it is characterised in that the generation submodule includes：

First acquisition unit, the text term vector for obtaining the object type；

Second acquisition unit, word list is described for obtaining high frequency corresponding with the object type, wherein, the high frequency is described There are multiple high frequency adjectives in word list；

Computing unit, for calculating between the text term vector of the object type and the adjectival term vector of each high frequency One similarity；

3rd acquiring unit, the target adjective of predetermined threshold value is more than for obtaining the first similarity；

Generation unit, for choosing the descriptive letter from the alternative descriptive information set according to the target adjective Breath.

9. electron album generating means as claimed in claim 7 or 8, it is characterised in that also include：

Module is anticipated, for previously generating the alternative descriptive information set；

Wherein, the module of anticipating includes：

First acquisition submodule, for obtaining the descriptive words list in high frequency word list, wherein, the descriptive words row Table includes multiple adjective samples and multiple adverbial word samples；

Calculating sub module, the vector representation for obtaining each adjective sample and each adverbial word sample respectively calculates described each The second similarity between individual adjective sample and each described adverbial word sample；

Second acquisition submodule, for for each adjective sample, obtaining similar to the second of each adjective sample The N number of adverbial word sample of highest is spent, wherein, N is positive integer；

Submodule is generated, for each adjective sample and corresponding N number of adverbial word sample to be combined to generate The alternative descriptive information set.

10. electron album generating means as claimed in claim 6, it is characterised in that the electron album generation module includes：

Transform subblock, for the descriptive information of the photo and the photo to be converted into video segment；

Submodule is generated, for being linked to the video segment to generate the target electronic photograph album.