CN105869447A

CN105869447A - Generating method and device of audiobook

Info

Publication number: CN105869447A
Application number: CN201610192366.2A
Authority: CN
Inventors: 吴建国; 刘超华; 张珩; 沈韡; 丁磊; 代红桥
Original assignee: Intelligent Technology (beijing) Co Ltd; LeTV Holding Beijing Co Ltd
Current assignee: Intelligent Technology (beijing) Co Ltd; LeTV Holding Beijing Co Ltd
Priority date: 2016-03-30
Filing date: 2016-03-30
Publication date: 2016-08-17

Abstract

The invention provides a generating method and device of an audiobook, relating to the technical field of information processing. The problems of low generation efficiency and poor generation flexibility of existing audiobooks are solved. The technical scheme of the invention is that the method comprises a step of obtaining background pictures and book audios of an audiobook to be generated, a step of receiving an instruction of adjusting a background picture display sequence, wherein, the instruction comprises background pictures to be adjusted and book voice time points corresponding to the background pictures to be adjusted, a step of adjusting the sequence of the background pictures to be adjusted to the corresponding book voice time points according to the instruction. The invention is mainly used for generating the audiobooks.

Description

The generation method and device of talking book

Technical field

The present embodiments relate to technical field of information processing, particularly relate to the generation side of a kind of talking book Method and device.

Background technology

Along with knowledge, the diversification of information acquiring pattern, especially constantly rush at emerging digitlization medium Hitting under traditional papery books ,newspapers and magazines, social system monitoring custom occurs to change the most to a certain extent, sound Reading matter arises at the historic moment under this scene.Wherein, talking book is the book of sound, such as sound news, Sound novel, children voice material etc., talking book has and intersects with digitlization and traditional publication and distinguish, There is the advantage of its uniqueness, the reading requirement of different user can be met by talking book.

At present, talking book is background video and the captions being made talking book by video software, then Obtain for the sound that this talking book typing is corresponding.But the background of talking book is done by video software Video and captions require a great deal of time and energy, and the background video of this talking book and captions are only Can be applied on this talking book, the formation efficiency of the most existing talking book is low, and flexibility is poor.

Summary of the invention

Embodiments provide the generation method and device of a kind of talking book, in order to solve existing skill In art the formation efficiency of talking book low and generate very flexible problem.

The problem existed for prior art, embodiments provides the generation side of a kind of talking book Method, including:

Obtain the background picture of talking book to be generated, reading matter voice；

Receive adjust background picture DISPLAY ORDER instruction, described instruction include background picture to be adjusted and with The reading matter Speech time point that described background picture to be adjusted is corresponding；

According to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter Speech time of correspondence On point；

According to talking book described in the background picture order after described adjustment and described reading matter speech production.

Concrete, the background picture of described acquisition talking book to be generated, reading matter voice include:

Obtain the background music of talking book to be generated, described background music and the broadcasting of described reading matter voice Duration is equal；

Concrete, described according to described in the background picture order after described adjustment and described reading matter speech production Talking book includes:

According to the background picture order after described adjustment, described background music and described reading matter speech production institute State talking book.

Further, after the background music of described acquisition talking book to be generated, described method also includes:

Extracting the attribute tags in described reading matter voice, each attribute tags all correspondences have described reading matter voice In reproduction time scope, described attribute tags includes but are not limited to scene properties label, atmosphere attribute Label, character attribute label.

Further, after the attribute tags in described extraction described reading matter voice, described method also includes:

If described attribute tags is scene properties label, then from scene sound bank, obtain the scene back of the body of correspondence Jing Yin；

If described attribute tags is atmosphere attribute tags, then from atmosphere sound bank, obtain the atmosphere back of the body of correspondence Jing Yin；

If described attribute tags is personage's attribute tags, then the personage obtaining correspondence from personage's sound bank belongs to Property.

Background sound corresponding with described reproduction time scope in described background music is replaced to corresponding scene Background sound or atmosphere background sound.

Voice evil spirit sound corresponding with described reproduction time scope in described reading matter voice become corresponding personage belong to Property.

Embodiments provide the generating means of a kind of talking book, including:

Acquiring unit, for obtaining the background picture of talking book to be generated, reading matter voice；

Receiving unit, be used for receiving adjustment background picture DISPLAY ORDER instruction, described instruction includes waiting to adjust Whole background picture and the reading matter Speech time point corresponding with described background picture to be adjusted；

Adjustment unit, for being adjusted to correspondence according to described instruction by the order of described background picture to be adjusted Reading matter Speech time point on；

Signal generating unit, for according to the background picture order after described adjustment and described reading matter speech production institute State talking book.

Described acquiring unit, is additionally operable to obtain the background music of talking book to be generated, described background music Equal with the playing duration of described reading matter voice；

Described signal generating unit, specifically for according to the background picture order after described adjustment, described background sound Talking book described in happy described reading matter speech production.

Further, described device also includes:

Extraction unit, for extracting the attribute tags in described reading matter voice, each attribute tags is the most corresponding Having the reproduction time scope in described reading matter voice, described attribute tags includes but are not limited to scene properties Label, atmosphere attribute tags, character attribute label.

Described acquiring unit, if being additionally operable to described attribute tags is scene properties label, then from scene voice Storehouse obtains the scene background sound of correspondence；

Described acquiring unit, if being additionally operable to described attribute tags is atmosphere attribute tags, then from atmosphere voice Storehouse obtains the atmosphere background sound of correspondence；

Described acquiring unit, if being additionally operable to described attribute tags is personage's attribute tags, then from personage's voice Storehouse obtains the character attribute of correspondence.

Further, described device also includes:

Replacement unit, for replacing background sound corresponding with described reproduction time scope in described background music Change scene background sound or the atmosphere background sound of correspondence into.

Evil spirit sound unit, for by voice evil spirit sound corresponding with described reproduction time scope in described reading matter voice Become corresponding character attribute.

The generation method and device of a kind of talking book that the embodiment of the present invention provides, first obtains to be generated The background picture of talking book, reading matter voice, then receive and adjust the instruction of background picture DISPLAY ORDER, institute State instruction and include background picture to be adjusted and the reading matter Speech time corresponding with described background picture to be adjusted Point, when being adjusted to corresponding reading matter voice further according to described instruction by the order of described background picture to be adjusted Between point on, finally according to after described adjustment background picture order and described reading matter speech production described in sound Reading matter.Make with the reading matter voice of the background video and recording of making talking book by video software at present Talking book compare, the background video of the talking book in the middle of the embodiment of the present invention is to pass through plurality of pictures Generate, therefore background video can be obtained flexibly and configure by the embodiment of the present invention, thus Solved the problem making talking book background video difficulty in prior art by the embodiment of the present invention, enter And improve the formation efficiency of talking book, and the flexibility that talking book generates.

Accompanying drawing explanation

In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that below, Accompanying drawing in description is some embodiments of the present invention, for those of ordinary skill in the art, not On the premise of paying creative work, it is also possible to obtain other accompanying drawing according to these accompanying drawings.

The generation method flow diagram of a kind of talking book that Fig. 1 provides for the embodiment of the present invention；

The generation method flow diagram of the another kind of talking book that Fig. 2 provides for the embodiment of the present invention；

The generating means structural representation of a kind of talking book that Fig. 3 provides for the embodiment of the present invention；

The generating means structural representation of the another kind of talking book that Fig. 4 provides for the embodiment of the present invention.

Detailed description of the invention

For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with this Accompanying drawing in bright embodiment, is clearly and completely described the technical scheme in the embodiment of the present invention, Obviously, described embodiment is a part of embodiment of the present invention rather than whole embodiments.Based on Embodiment in the present invention, those of ordinary skill in the art are obtained under not making creative work premise The every other embodiment obtained, broadly falls into the scope of protection of the invention.

Embodiments provide a kind of generation method of talking book, as it is shown in figure 1, described method Including:

101, the background picture of talking book to be generated, reading matter voice are obtained.

Wherein, described background picture can be the photo of shooting, it is also possible to be the figure downloaded in the middle of network Sheet, it is also possible to being the picture etc. by Software on Drawing, the embodiment of the present invention is not specifically limited.Described reading Story sound is the voice that user records, it is also possible to the voice downloaded by network, and the embodiment of the present invention is not done Concrete restriction.

Such as, if user's talking book to be generated is children's story " small red cap ", then available by clapping The mode taking the photograph " small red cap " strip cartoon obtains the background picture of talking book, the most permissible as reading matter voice Obtain by recording the voice of " small red cap " story that user reads.

102, adjustment background picture DISPLAY ORDER instruction is received.

Wherein, described instruction includes background picture to be adjusted and corresponding with described background picture to be adjusted Reading matter Speech time point.In embodiments of the present invention, the reading matter time point that background picture to be adjusted is corresponding is User is configured, and user can be according to its reading matter speech play recorded order, from background picture Select the picture corresponding with its playing sequence, reach background picture and reading matter in the talking book generated with this The coupling of voice.Such as, the description in first 1-3 minute in " Snow White " reading matter voice that user records Be the appearance in Snow White's childhood, within 4-6 minute, tell about is the arrival of its stepmother, within 7-8 minute, tells about It is that stepmother hello Snow White eats poison apple, then according to the description of plot in reading matter voice, divides for 1-3 The picture in clock distribution Snow White's childhood, distributed, for 4-6 minute, the picture that stepmother arrives, for 7-8 minute Distribution stepmother feeds Snow White and eats poison apple picture, completes mating of reading matter voice and background picture with this.

103, according to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter voice of correspondence On time point.

If it should be noted that not receiving adjustment background picture DISPLAY ORDER instruction, then background picture DISPLAY ORDER is the order uploading background picture, and the display duration of each background picture is identical.Example As, reading matter voice time a length of 10 minutes, background picture is 5, if adjusting the display of background picture The order i.e. playing duration of its correspondence, during the display of the most each background picture a length of 2 minutes, background picture The order that DISPLAY ORDER is uploading pictures.

104, according to sound reading described in the background picture order after described adjustment and described reading matter speech production Thing.

In embodiments of the present invention, the background picture after adjustment can play out by the way of lantern slide, While playing lantern slides background picture, configure the reading matter voice of correspondence, generate described talking book with this. In embodiments of the present invention, it is by many due to the background video of the talking book in the middle of the embodiment of the present invention Pictures generates, and therefore can be obtained background video flexibly by the embodiment of the present invention and configure, Thus solved by the embodiment of the present invention and prior art makes asking of talking book background video difficulty Topic, and then improve the formation efficiency of talking book, and the flexibility that talking book generates.

The generation method of a kind of talking book that the embodiment of the present invention provides, first obtains sound reading to be generated The background picture of thing, reading matter voice, then receive and adjust the instruction of background picture DISPLAY ORDER, described instruction Include background picture to be adjusted and the reading matter Speech time point corresponding with described background picture to be adjusted, then According to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter Speech time point of correspondence On, finally according to talking book described in the background picture order after described adjustment and described reading matter speech production. Sound with what the reading matter voice of the background video and recording of making talking book by video software at present made Reading matter is compared, owing to the background video of the talking book in the middle of the embodiment of the present invention is raw by plurality of pictures Become, therefore background video can be obtained flexibly and configure by the embodiment of the present invention, thus logical Cross the embodiment of the present invention and solve the problem making talking book background video difficulty in prior art, and then Improve the formation efficiency of talking book, and the flexibility that talking book generates.

Embodiments provide the generation method of another kind of talking book, as in figure 2 it is shown, described side Method includes:

201, the background picture of talking book to be generated, reading matter voice, background music are obtained.

Wherein, the playing duration of described background music and described reading matter voice is equal, and described background music can Being that user makes, it is also possible to be that network is downloaded, it is also possible to being to record, the embodiment of the present invention is not It is specifically limited.If it should be noted that the playing duration of background music and reading matter voice is not desired to, then Can be by the way of intercepting background music so that the duration of background music is identical with reading matter voice.

In embodiments of the present invention, after step 201, described method also includes: extract described reading matter Attribute tags in voice, each attribute tags is all corresponding the reproduction time scope in described reading matter voice, Described attribute tags includes but are not limited to scene properties label, atmosphere attribute tags, character attribute label. In embodiments of the present invention, the process extracting the attribute tags in reading matter voice concrete can be: first knows Do not go out word corresponding to reading matter voice and the time point the most corresponding with each word, then by reading matter voice Attribute tags in corresponding word and preset attribute tag library is mated, wherein preset attribute tag library In attribute tags be to be set according to the actual requirements, such as scene properties label, atmosphere attribute tags, Character attribute labels etc., the embodiment of the present invention is not specifically limited.If the word that reading matter voice is corresponding is deposited Describe at certain section of word and mate with the attribute tags in preset attribute tag library, then obtain this section of word and reading Reproduction time section corresponding in story sound.

Such as, after " small red cap " reading matter voice is carried out speech recognition, get 1-5 before in reading matter voice Minute word to describe rough idea be that small red cap is walked on woodland path, local environment gentle breeze blows slowly bird's twitter The fragrance of a flower, then extracting the attribute tags of 1-5 minute in reading matter voice according to preset attribute tag library is that scene belongs to Property label, this scene properties label is specifically as follows the label of various sound in the woodland corresponding with its linguistic context； Getting the word of 6-10 minute in reading matter voice and describing rough idea is small red cap and disguise oneself as the big of grandmother Grey wolf is talked with, then extracting in reading matter voice according to preset attribute tag library 6-10 minute is character attribute mark Signing, wherein character attribute label is specially animal tag and little girl's label.

In embodiments of the present invention, after the attribute tags in described extraction described reading matter voice, described side Method also includes: if described attribute tags is scene properties label, then obtain correspondence from scene sound bank Scene background sound；If described attribute tags is atmosphere attribute tags, then from atmosphere sound bank, obtain correspondence Atmosphere background sound；If described attribute tags is personage's attribute tags, then it is right to obtain from personage's sound bank The character attribute answered.Wherein, scene sound bank, atmosphere sound bank and personage's sound bank are all pre-configured with Alright, described scene sound bank includes various types of scene background sound, such as rainy day scene, arena Scape, scene in summer etc.；Described atmosphere sound bank includes various types of atmosphere background sound, as cheerful and light-hearted Background sound, sad background sound, gloomy background sound etc.；Described personage's sound bank includes all kinds Personage, such as the sound of children, the sound of old man, the sound of woman, the sound etc. of animal, the present invention Embodiment is not specifically limited.

For the embodiment of the present invention, after obtaining the voice of correspondence in various types of voice storehouse, described method is also Including: background sound corresponding with described reproduction time scope in described background music is replaced to corresponding field Scape background sound or atmosphere background sound.By voice corresponding with described reproduction time scope in described reading matter voice Evil spirit sound becomes corresponding character attribute.In embodiments of the present invention, by reading matter voice with described reproduction time Voice evil spirit sound corresponding to scope becomes corresponding character attribute, can increase vividness that talking book reads and Interesting.As can be by " small red cap " reading matter voice, the dialogue evil spirit sound of small red cap becomes the sound of little girl Sound, the dialogue evil spirit sound of lobo becomes to comprise the sound of wolf characteristic.

202, adjustment background picture DISPLAY ORDER instruction is received.

Wherein, described instruction includes background picture to be adjusted and corresponding with described background picture to be adjusted Reading matter Speech time point.In embodiments of the present invention, the reading matter time point that background picture to be adjusted is corresponding is User is configured, and user can be according to its reading matter speech play recorded order, from background picture Select the picture corresponding with its playing sequence, reach background picture and reading matter in the talking book generated with this The coupling of voice.

203, according to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter language of correspondence On sound time point.

If it should be noted that not receiving adjustment background picture DISPLAY ORDER instruction, then background picture DISPLAY ORDER is the order uploading background picture, and the display duration of each background picture is identical.Example As, reading matter voice time a length of 20 minutes, background picture is 10, if adjusting the aobvious of background picture Show the order i.e. playing duration of its correspondence, during the display of the most each background picture a length of 2 minutes, Background The DISPLAY ORDER of sheet is the order of uploading pictures.

204, raw according to the background picture order after described adjustment, described background music and described reading matter voice Become described talking book.

For the embodiment of the present invention, according to the plot in reading matter voice, the scene background sound that will obtain Corresponding with atmosphere background sound it is inserted in background music, the most also reading matter voice will comprise personage's characteristic Dialogue evil spirit sound becomes corresponding personage, thus the talking book generated by the embodiment of the present invention can increase reading Interest and vividness.

The generation method of the another kind of talking book that the embodiment of the present invention provides, first obtains to be generated sound The background picture of reading matter, reading matter voice, then receive and adjust the instruction of background picture DISPLAY ORDER, described finger Order includes background picture to be adjusted and the reading matter Speech time point corresponding with described background picture to be adjusted, Further according to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter Speech time point of correspondence On, finally according to talking book described in the background picture order after described adjustment and described reading matter speech production. Sound with what the reading matter voice of the background video and recording of making talking book by video software at present made Reading matter is compared, owing to the background video of the talking book in the middle of the embodiment of the present invention is raw by plurality of pictures Become, therefore background video can be obtained flexibly and configure by the embodiment of the present invention, thus logical Cross the embodiment of the present invention and solve the problem making talking book background video difficulty in prior art, and then Improve the formation efficiency of talking book, and the flexibility that talking book generates.

Further, as implementing of method described in Fig. 1, embodiments providing one has The generating means of sound reading matter, as it is shown on figure 3, described device includes: acquiring unit 31, receive unit 32, Adjustment unit 33, signal generating unit 34.

Acquiring unit 31, for obtaining the background picture of talking book to be generated, reading matter voice；Wherein, Described background picture can be the photo of shooting, it is also possible to be the picture downloaded in the middle of network, it is also possible to Being the picture etc. by Software on Drawing, the embodiment of the present invention is not specifically limited.Described reading matter voice is to use The voice that family is recorded, it is also possible to the voice downloaded by network, the embodiment of the present invention is not specifically limited.

Receiving unit 32, be used for receiving adjustment background picture DISPLAY ORDER instruction, described instruction includes treating Adjust background picture and the reading matter Speech time point corresponding with described background picture to be adjusted；Real in the present invention Executing in example, the reading matter time point that background picture to be adjusted is corresponding is that user is configured, and user can root According to its reading matter speech play recorded order, from background picture, select the picture corresponding with its playing sequence, Background picture and the coupling of reading matter voice in the talking book generated is reached with this.

Adjustment unit 33, right for the order of described background picture to be adjusted being adjusted to according to described instruction On the reading matter Speech time point answered；If it should be noted that not receiving adjustment background picture DISPLAY ORDER Instruction, then the DISPLAY ORDER of background picture is the order uploading background picture, and each background picture Display duration identical

Signal generating unit 34, for according to the background picture order after described adjustment and described reading matter speech production Described talking book.In embodiments of the present invention, the background picture after adjustment can be by the side of lantern slide Formula plays out, and configures the reading matter voice of correspondence, generate with this while playing lantern slides background picture Described talking book.In embodiments of the present invention, due to the back of the body of the talking book in the middle of the embodiment of the present invention Scape video is generated by plurality of pictures, therefore background video can be carried out spirit by the embodiment of the present invention The acquisition lived and configuration, thus solved by the embodiment of the present invention and prior art makes the talking book back of the body The problem of scape video difficulty, and then improve the formation efficiency of talking book, and the spirit that talking book generates Activity.

It should be noted that it is each involved by the generating means of a kind of talking book of embodiment of the present invention offer Other of functional unit describe accordingly, the corresponding description being referred in Fig. 1, do not repeat them here.This Inventive embodiments can be passed through hardware processor (hardware processor) and realize correlation function Module.

The generating means of a kind of talking book that the embodiment of the present invention provides, first obtains sound reading to be generated The background picture of thing, reading matter voice, then receive and adjust the instruction of background picture DISPLAY ORDER, described instruction Include background picture to be adjusted and the reading matter Speech time point corresponding with described background picture to be adjusted, then According to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter Speech time point of correspondence On, finally according to talking book described in the background picture order after described adjustment and described reading matter speech production. With background video and the talking book phase of reading matter voice making being made talking book at present by video software Ratio, owing to the background video of the talking book in the middle of the embodiment of the present invention is generated by plurality of pictures, Therefore background video can be obtained flexibly and configure by the embodiment of the present invention, thus by this Bright embodiment solves the problem making talking book background video difficulty in prior art, and then improves The formation efficiency of talking book, and the flexibility that talking book generates.

Further, as implementing of method described in Fig. 2, another kind is embodiments provided The generating means of talking book, as shown in Figure 4, described device includes: acquiring unit 41, reception unit 42, adjustment unit 43, signal generating unit 44.

Acquiring unit 41, for obtaining the background picture of talking book to be generated, reading matter voice；Wherein, Described background picture can be the photo of shooting, it is also possible to be the picture downloaded in the middle of network, it is also possible to Being the picture etc. by Software on Drawing, the embodiment of the present invention is not specifically limited.Described reading matter voice is to use The voice that family is recorded, it is also possible to the voice downloaded by network, the embodiment of the present invention is not specifically limited.

Receiving unit 42, be used for receiving adjustment background picture DISPLAY ORDER instruction, described instruction includes treating Adjust background picture and the reading matter Speech time point corresponding with described background picture to be adjusted；Real in the present invention Executing in example, the reading matter time point that background picture to be adjusted is corresponding is that user is configured, and user can root According to its reading matter speech play recorded order, from background picture, select the picture corresponding with its playing sequence, Background picture and the coupling of reading matter voice in the talking book generated is reached with this.

Adjustment unit 43, right for the order of described background picture to be adjusted being adjusted to according to described instruction On the reading matter Speech time point answered；If it should be noted that not receiving adjustment background picture DISPLAY ORDER Instruction, then the DISPLAY ORDER of background picture is the order uploading background picture, and each background picture Display duration identical

Signal generating unit 44, for according to the background picture order after described adjustment and described reading matter speech production Described talking book.In embodiments of the present invention, the background picture after adjustment can be by the side of lantern slide Formula plays out, and configures the reading matter voice of correspondence, generate with this while playing lantern slides background picture Described talking book.In embodiments of the present invention, due to the back of the body of the talking book in the middle of the embodiment of the present invention Scape video is generated by plurality of pictures, therefore background video can be carried out spirit by the embodiment of the present invention The acquisition lived and configuration, thus solved by the embodiment of the present invention and prior art makes the talking book back of the body The problem of scape video difficulty, and then improve the formation efficiency of talking book, and the spirit that talking book generates Activity.

Described acquiring unit 41, is additionally operable to obtain the background music of talking book to be generated, described background sound The playing duration of happy described reading matter voice is equal；Described background music can be that user makes, it is possible to To be network download, it is also possible to being to record, the embodiment of the present invention is not specifically limited.Need explanation If the playing duration of background music and reading matter voice is not desired to, then can be by intercepting background music Mode so that the duration of background music is identical with reading matter voice.

Described signal generating unit 44, specifically for according to the background picture order after described adjustment, described background Talking book described in music and described reading matter speech production.

Further, described device also includes:

Extraction unit 45, for extracting the attribute tags in described reading matter voice, each attribute tags is the most right Should have the reproduction time scope in described reading matter voice, described attribute tags includes but are not limited to scene and belongs to Property label, atmosphere attribute tags, character attribute label.In embodiments of the present invention, reading matter voice is extracted In the concrete process of attribute tags can be: first identify word corresponding to reading matter voice and and each The time point that word is the most corresponding, then by word corresponding for reading matter voice and preset attribute tag library Attribute tags is mated, and wherein the attribute tags in preset attribute tag library is to carry out according to the actual requirements Setting, such as scene properties label, atmosphere attribute tags, character attribute label etc., the embodiment of the present invention is not It is specifically limited.If the word that reading matter voice is corresponding existing certain section of word describe and preset attribute tag library In attribute tags coupling, then obtain the reproduction time section that this section of word is corresponding in reading matter voice.

Described acquiring unit 41, if being additionally operable to described attribute tags is scene properties label, then from scene language Sound storehouse obtains the scene background sound of correspondence；

Described acquiring unit 41, if being additionally operable to described attribute tags is atmosphere attribute tags, then from atmosphere language Sound storehouse obtains the atmosphere background sound of correspondence；

Described acquiring unit 41, if being additionally operable to described attribute tags is personage's attribute tags, then from people's story Sound storehouse obtains the character attribute of correspondence.Wherein, scene sound bank, atmosphere sound bank and personage's sound bank Being all pre-configured, described scene sound bank includes various types of scene background sound, such as the rainy day Scene, match scene, scene in summer etc.；Described atmosphere sound bank includes various types of atmosphere background Sound, such as cheerful and light-hearted background sound, sad background sound, gloomy background sound etc.；In described personage's sound bank Including various types of personages, such as the sound of children, the sound of old man, the sound of woman, the sound of animal Sounds etc., the embodiment of the present invention is not specifically limited.

Further, described device also includes:

Replacement unit 46, for by background sound corresponding with described reproduction time scope in described background music Replace to scene background sound or the atmosphere background sound of correspondence.

Evil spirit sound unit 47, for by voice evil spirit corresponding with described reproduction time scope in described reading matter voice Sound becomes corresponding character attribute.In embodiments of the present invention, by reading matter voice with described reproduction time model The voice evil spirit sound enclosing correspondence becomes the character attribute of correspondence, can increase vividness and interest that talking book is read Taste.

It should be noted that involved by the generating means of the another kind of talking book of embodiment of the present invention offer Other of each functional unit describe accordingly, the corresponding description being referred in Fig. 2, do not repeat them here. The embodiment of the present invention can realize related function module by hardware processor.

The generating means of the another kind of talking book that the embodiment of the present invention provides, first obtains to be generated sound The background picture of reading matter, reading matter voice, then receive and adjust the instruction of background picture DISPLAY ORDER, described finger Order includes background picture to be adjusted and the reading matter Speech time point corresponding with described background picture to be adjusted, Further according to described instruction, the order of described background picture to be adjusted is adjusted to the reading matter Speech time point of correspondence On, finally according to talking book described in the background picture order after described adjustment and described reading matter speech production. Sound with what the reading matter voice of the background video and recording of making talking book by video software at present made Reading matter is compared, owing to the background video of the talking book in the middle of the embodiment of the present invention is raw by plurality of pictures Become, therefore background video can be obtained flexibly and configure by the embodiment of the present invention, thus logical Cross the embodiment of the present invention and solve the problem making talking book background video difficulty in prior art, and then Improve the formation efficiency of talking book, and the flexibility that talking book generates.

Device embodiment described above is only schematically, wherein said illustrates as separating component Unit can be or may not be physically separate, the parts shown as unit can be or Person may not be physical location, i.e. may be located at a place, or can also be distributed to multiple network On unit.Some or all of module therein can be selected according to the actual needs to realize the present embodiment The purpose of scheme.Those of ordinary skill in the art are not in the case of paying performing creative labour, the most permissible Understand and implement.

Through the above description of the embodiments, those skilled in the art is it can be understood that arrive each reality The mode of executing can add the mode of required general hardware platform by software and realize, naturally it is also possible to by firmly Part.Based on such understanding, the portion that prior art is contributed by technique scheme the most in other words Dividing and can embody with the form of software product, this computer software product can be stored in computer can Read in storage medium, such as ROM/RAM, magnetic disc, CD etc., including some instructions with so that one Computer equipment (can be personal computer, server, or the network equipment etc.) performs each to be implemented The method described in some part of example or embodiment.

Last it is noted that above example is only in order to illustrate technical scheme, rather than to it Limit；Although the present invention being described in detail with reference to previous embodiment, the ordinary skill of this area Personnel it is understood that the technical scheme described in foregoing embodiments still can be modified by it, or Person carries out equivalent to wherein portion of techniques feature；And these amendments or replacement, do not make corresponding skill The essence of art scheme departs from the spirit and scope of various embodiments of the present invention technical scheme.

Claims

1. the generation method of a talking book, it is characterised in that including:

Method the most according to claim 1, it is characterised in that described acquisition talking book to be generated Background picture, reading matter voice includes:

Described according to talking book described in the background picture order after described adjustment and described reading matter speech production Including:

Method the most according to claim 2, it is characterised in that described acquisition talking book to be generated Background music after, described method also includes:

Method the most according to claim 3, it is characterised in that in described extraction described reading matter voice Attribute tags after, described method also includes:

Method the most according to claim 4, it is characterised in that described acquisition talking book to be generated Background music after, described method also includes:

Method the most according to claim 4, it is characterised in that described method also includes:

7. the generating means of a talking book, it is characterised in that including:

Device the most according to claim 7, it is characterised in that

Device the most according to claim 8, it is characterised in that described device also includes:

Device the most according to claim 9, it is characterised in that

11. devices according to claim 10, it is characterised in that described device also includes:

12. devices according to claim 10, it is characterised in that described device also includes: