CN101984490B

CN101984490B - Word-for-word synchronous lyric file generating method and system thereof

Info

Publication number: CN101984490B
Application number: CN2010105572583A
Authority: CN
Inventors: 翟海平; 林健; 李想
Original assignee: Yeelion Online Network Technology Beijing Co Ltd
Current assignee: Yeelion Online Network Technology Beijing Co Ltd
Priority date: 2010-11-23
Filing date: 2010-11-23
Publication date: 2012-06-27
Anticipated expiration: 2030-11-23
Also published as: CN101984490A

Abstract

The invention discloses a word-for-word synchronous lyric file generating method, including: an audio file is loaded, and audio data and time information are extracted; a time shaft is generated; an audio waveform diagram is generated and displayed; lyric is acquired, and initial position of each character in the lyric corresponding to the time shaft is determined and displayed; a lyric dragging handle for each character is generated; the audio file is played, and playing progress is prompted on the audio waveform diagram; a request of user for adjusting the position of the lyric dragging handle is received, and the position of the lyric dragging handle is adjusted; time information of each character in the lyric is determined; and the determined time information of each character in the lyric is stored, and a word-for-word synchronous lyric file is generated. The invention also discloses a word-for-word synchronous lyric file generating system. By adopting the method or system of the invention, reference for making word-for-word synchronous lyric file can be provided in audition and vision, and accuracy and making speed of word-for-word synchronous lyric file can be improved.

Description

A kind of generation method and system of word for word lyrics synchronized file

Technical field

The present invention relates to copy editor's technical field, particularly relate to a kind of generation method and system of word for word lyrics synchronized file.

Background technology

As everyone knows, most of song all has the lyrics.The voice playing instrument can be through loading the lyrics file of specific format, in played songs, the lyrics of song also is shown to the user.In the practical application, some users not only hope to see the lyrics in played songs, also hope to obtain the synchronous dynamic prompting of these lyrics.For this reason, lyrics synchronized file line by line occurred, this lyrics file is the temporal information of the unit record lyrics with the sentence, and the music instrument just can show the lyrics with sentence by sentence form in displaying audio file synchronously like this.But when the user was higher to the requirement of lyrics promptings, when recreation such as for example playing Karaoka, the lyrics that show with form sentence by sentence obviously can not satisfy user's requirement.

Therefore, occurred word for word lyrics synchronized file in the prior art, through loading this word for word lyrics synchronized file, the voice playing instrument just can be in played songs, and the lyrics are shown with word for word form synchronously.Obviously, the quality of lyrics file has determined the effect of synchronous lyrics.In the prior art, word for word the generation method of lyrics synchronized file mainly is:

At first try to achieve averaging time divided by the number of words of these lyrics with the duration of every lyrics; Again with the duration of this averaging time as each word in these lyrics; Zero-time in conjunction with each word calculates the concluding time, generates word for word lyrics synchronized file thus.Obviously, the file of the word for word lyrics synchronized file that this method generates, the temporal information of each word is very inaccurate.

Summary of the invention

The generation method and system that the purpose of this invention is to provide a kind of word for word lyrics synchronized file, can be from the sense of hearing and visually provide make lyrics synchronized file word for word with reference to foundation, improve the word for word accuracy of lyrics synchronized file.

For realizing above-mentioned purpose, the invention provides following scheme:

A kind of generation method of word for word lyrics synchronized file comprises:

Load audio file, extract the voice data and the temporal information of said audio file;

According to the temporal information of audio file, the rise time axle;

According to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;

Obtain the lyrics of said audio file, confirm that each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics;

For generating the lyrics, each literal in the lyrics drags handle;

Displaying audio file, and on audio volume control figure, point out playing progress rate;

Receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;

Preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.

Preferably, the said lyrics that obtain said audio file comprise:

Obtain the lyrics of user's input.

Preferably, the said lyrics that obtain said audio file comprise:

Load the lyrics in the existing lyrics file.

Preferably, also comprise:

Parse every lyrics time information corresponding in the said lyrics file of loading;

Each literal comprises corresponding to the initial position and the demonstration of said time shaft in said definite lyrics: confirm that according to every lyrics time information corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.

Preferably, also comprise:

According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, to the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.

Preferably, also comprise:

According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.

Preferably, saidly drag handle and comprise for each literal in the lyrics generates lyrics:

The lyrics that generate corresponding to the zero-time of this literal for each literal drag handle;

Preferably, saidly drag handle and also comprise for each literal in the lyrics generates lyrics:

The lyrics that generate corresponding to concluding time of this literal for each literal drag handle.

Preferably, the said playing progress rate of on audio volume control figure, pointing out comprises:

On audio volume control figure, adopt the playing progress rate pointer to point out, and/or, adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part.

A kind of generation system of word for word lyrics synchronized file comprises:

The audio file extraction unit is used to load audio file, extracts the voice data and the temporal information of said audio file;

The time shaft generation unit is used for the temporal information according to audio file, the rise time axle;

Audio volume control figure generation unit is used for the voice data according to audio file, and corresponding said time shaft generates the audio volume control figure of said audio file and shows;

Lyrics acquiring unit is used to obtain the lyrics of said audio file, confirms that each literal in the lyrics is corresponding to the initial position of said time shaft and show;

The lyrics drag the handle generation unit, and each literal generation lyrics that are used in the lyrics drag handle;

Playing control unit is used for displaying audio file, and on audio volume control figure, points out playing progress rate;

Lyrics adjustment unit is used to receive the user the said lyrics is dragged the request that the handle position is adjusted, and the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;

The lyrics file generation unit is used for preserving the temporal information of each word of the lyrics after confirming, generates word for word lyrics synchronized file.

Preferably, said lyrics acquiring unit comprises:

Lyric characters obtains subelement, is used to obtain the lyrics of user's input.

Preferably, said lyrics acquiring unit comprises:

Lyrics file loads subelement, is used for loading the lyrics of existing lyrics file.

Preferably, said lyrics acquiring unit also comprises:

Lyrics file is resolved subelement, is used for parsing every lyrics time corresponding of said lyrics file of loading;

The lyrics generate subelement, are used for confirming that according to every lyrics time corresponding of said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.

Preferably, said lyrics acquiring unit also comprises:

Head and the tail block molecular cell is used for according to the banner word of every the lyrics of every lyrics time corresponding identification that parse and finishes word, to the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.

Preferably, the said lyrics drag the handle generation unit and also comprise:

The head and the tail word lyrics drag handle and distinguish subelement; Be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse; The banner word in every lyrics and the lyrics that finish word are dragged handle; Distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.

Preferably, the said lyrics drag the handle generation unit and comprise:

Zero-time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to the zero-time of this literal and drags handle;

Preferably, the said lyrics drag the handle generation unit and also comprise:

Concluding time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to concluding time of this literal and drags handle.

Preferably, said playing control unit comprises:

Progress pointer prompting subelement and/or oscillogram color tips subelement;

Said progress pointer prompting subelement is used on audio volume control figure, adopting the playing progress rate of playing progress rate pointer prompt tone frequency file;

Said oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the various colors identification audio oscillogram and does not play part.

According to specific embodiment provided by the invention; The invention discloses following technique effect: through extracting the voice data of audio file; Generate audio volume control figure, show each literal in the lyrics, and drag handle for each literal generation lyrics corresponding to time shaft and the audio volume control figure of audio file; For the user simultaneously from the sense of hearing and visually provide make lyrics synchronized file word for word with reference to foundation, improved the word for word accuracy of lyrics synchronized file.

Description of drawings

In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use among the embodiment below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work property, can also obtain other accompanying drawing according to these accompanying drawings.

Fig. 1 is the method for making first pass figure of the said word for word lyrics synchronized of embodiment of the invention file;

Fig. 2 is for adopting a kind of software interface synoptic diagram of the said method of the embodiment of the invention;

Fig. 3 is method for making second process flow diagram of the said word for word lyrics synchronized of embodiment of the invention file;

Fig. 4 is the manufacturing system structural drawing of the said word for word lyrics synchronized of embodiment of the invention file;

Fig. 5 is the said lyrics acquiring unit of an embodiment of the invention structural drawing.

Embodiment

To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.

Referring to Fig. 1, be the method for making process flow diagram of the said word for word lyrics synchronized of embodiment of the invention file.As shown in Figure 1, the method comprising the steps of:

S101: load audio file, extract the voice data and the temporal information of said audio file;

S102: according to the temporal information of audio file, the rise time axle;

S103: according to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;

S104: obtain the lyrics of said audio file, confirm that each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics;

S105: drag handle for each literal in the lyrics generates the lyrics;

S106: displaying audio file, and on audio volume control figure, point out playing progress rate;

S107: receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;

S108: preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.

Among the step S101, the audio file of loading can be various forms, for example MP3, WMA, APE or the like.After the loading, adopt corresponding demoder to extract the voice data and the temporal information of said audio file.Wherein temporal information specifically can be the time span of audio file.

Among the step S102, the time shaft of generation can show on user interface.

The audio volume control figure that generates among the step S103, the temporal information of each literal provides visual reference frame in the lyrics for the user adjusts.

Concrete, with reference to Fig. 2, for adopting a kind of software interface synoptic diagram of the said method of the embodiment of the invention.Usually, in an audio file, the singer sings the sound big (because like this could give prominence to voice, make hearer hear the clearly lyrics) of the sound of the lyrics than accompaniment.Therefore, the amplitude of singing the audio volume control of lyrics part can be bigger than the amplitude of accompaniment audio volume control partly, forms a vibration more significantly.That is to say that each word corresponding audio waveform roughly can form a kind of like this vibration: this vibration and accompaniment oscillating phase ratio partly, amplitude is bigger, relatively significantly; And each vibration all is to be starting point with less amplitude, begins then to increase gradually, reduces gradually after reaching peak value again, finishes with less amplitude; Wherein, each vibration initial and finish the initial and end of just corresponding the lyric characters of singing.Therefore, utilize this characteristics, just can confirm the initial and concluding time of the corresponding lyric characters of singing according to the initial and end position of each vibration on the audio volume control figure.

In addition, need to prove that there is corresponding relation in the audio volume control figure that generates among the step S103 with the time shaft of audio file.Any place's audio volume control all has corresponding with it time zone on the audio volume control figure on time shaft.

Among the step S104, obtaining the lyrics of said audio file, can be to obtain the manually lyrics of input of user, also can load the lyrics in the existing lyrics file.

When obtaining the user manually during the lyrics of input: can an initial value be set, for example 1 second to the duration of each word of input.Can be with first word acquiescence corresponding between 0 second to 1 second of audio file, second word is corresponding between 1 second to 2 seconds of audio file, and the rest may be inferred.When the input lyrics number of words more for a long time, can adjust the duration of each word, the principle of adjustment is: the summation of the duration of all literal is no more than the time span of this audio file in the lyrics.

When loading the lyrics in the existing lyrics file: existing lyrics file can be the file that common suffix is called the .1rc type.Usually this existing lyrics file is sentence by sentence synchronous, also promptly, has included the temporal information of the lyrics in this lyrics file, and only this temporal information only is to be directed against every lyrics.

In order to make full use of the temporal information in the existing file of lyrics synchronized sentence by sentence, the said method of the embodiment of the invention also comprises: parse every lyrics time corresponding in the said lyrics file of loading; Confirm that according to every lyrics time corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.

Wherein, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the said lyrics file that parses.Confirm that according to every lyrics time corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics; Specifically can be: deduct the duration that said zero-time obtains these lyrics with the said concluding time; Divided by the literal number of these lyrics, obtain the average duration of each word; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal.The viewing area of each literal wherein is corresponding to the expectation duration scope (being the expectation zero-time of this literal on the time shaft and the scope between the expected concluding time) of this literal on the time shaft.

After loading the existing file of lyrics synchronized sentence by sentence; Can also and finish word according to the banner word in every the lyrics of every lyrics time corresponding identification that parse; To the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.Concrete, can banner word in every lyrics and the font size that finishes word be adjusted, use than the big font of other literal in these lyrics, show banner word and finish word; Also can with the banner word in every lyrics with finish word, use with these lyrics in other literal various colors, show.

Among the step S105,, each literal in the lyrics drags handle for generating the lyrics.Each literal can correspond respectively to two said lyrics and drag handle, to confirm the zero-time and the concluding time of each word respectively.Concrete, the lyrics drag the position of handle and there is corresponding relation in the time point on the time shaft.As shown in Figure 2, it is corresponding with the zero-time of this word that the lyrics on each literal left side drag handle, and it is corresponding with the concluding time of this word that the lyrics on the right drag handle.The zero-time or the concluding time of corresponding word can be regulated in the position that drags handle through the adjustment lyrics.

Among the step S105, also can be only drag handle corresponding to the lyrics of the zero-time of this literal for each literal generation.In this case, the lyrics between two adjacent words drag handle, except the zero-time of that word of expression back, also represent the concluding time of before word.

The difference of the lyrics file that two kinds of situation generate is, the former is for the demonstration time of each word in the lyrics, representes that this literal should sing in the demonstration time; The latter is for the demonstration time of each word in the lyrics, except representing the singing time of this literal, possibly represent also that this word is sung to finish, but next word do not begin that section accompaniment time of singing as yet.

The method for making of the said word for word lyrics synchronized of embodiment of the invention file; Can also and finish word according to the banner word in every the lyrics of every lyrics time corresponding identification that parse; The banner word in every lyrics and the lyrics that finish word are dragged handle; Distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.For example: the banner word in every lyrics and the lyrics that finish word are dragged the color of handle, be arranged to drag the handle various colors with the lyrics of other word; Perhaps the banner word in every lyrics and the lyrics that finish word are dragged the shape of handle, be arranged to drag the different shape of handle with the lyrics of other word.

Among the step S106, the method for prompting playing progress rate on audio volume control figure can be on audio volume control figure, to adopt the playing progress rate pointer to point out, and also can be to adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part.

Wherein, the playing progress rate pointer can be the vertical line (can certainly be other shapes) on audio volume control figure.This playing progress rate pointer position on audio volume control figure indicates where this audio file has played to.The user can also drag this playing progress rate pointer, with the playing progress rate of adjustment audio file.Adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part, for instance: can be designated green with having play part on the audio volume control figure, not play part and be designated redness.

In addition, adopt playing progress rate pointer and the method that adopts these two kinds prompting playing progress rates on audio volume control figure of various colors sign, can use separately, also can use simultaneously.

By on can know, the preferred embodiment of the method for making of word for word lyrics synchronized file according to the invention, as shown in Figure 3, comprise step:

S201: load audio file, extract the voice data and the temporal information of said audio file;

S202: according to the temporal information of audio file, the rise time axle;

S203: according to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;

S204: load the lyrics in the existing lyrics file;

S205: parse every lyrics time corresponding in the said lyrics file of loading, confirm that according to every lyrics time corresponding in the said lyrics file each literal in the lyrics is corresponding to the initial position of said time shaft and show;

S206:,, distinguish demonstration to banner word in every lyrics and end word according to banner word and the end word in every the lyrics of every lyrics time corresponding identification that parse;

S207: drag handle for each literal in the lyrics generates the lyrics;

S208: banner word and end word according in every the lyrics of every lyrics time corresponding identification that parse, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration;

S209: displaying audio file, and on audio volume control figure, adopt playing progress rate pointer prompting playing progress rate, adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part;

S210: receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;

S211: preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.

Corresponding with the method for making of the said word for word lyrics synchronized of embodiment of the invention file, the embodiment of the invention also discloses a kind of manufacturing system of word for word lyrics synchronized file.

Referring to Fig. 4, be the manufacturing system structural drawing of the said word for word lyrics synchronized of embodiment of the invention file.This system comprises:

Audio file extraction unit 401 is used to load audio file, extracts the voice data and the temporal information of said audio file;

Time shaft generation unit 402 is used for the temporal information according to audio file, the rise time axle;

Audio volume control figure generation unit 403 is used for the voice data according to audio file, and corresponding said time shaft generates the audio volume control figure of said audio file and shows;

Lyrics acquiring unit 404 is used to obtain the lyrics of said audio file, confirms that each literal in the lyrics is corresponding to the initial position of said time shaft and show;

The lyrics drag handle generation unit 405, and each literal generation lyrics that are used in the lyrics drag handle;

Playing control unit 406 is used for displaying audio file, and on audio volume control figure, points out playing progress rate;

Lyrics adjustment unit 407 is used to receive the user the said lyrics is dragged the request that the handle position is adjusted, and the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;

Lyrics file generation unit 408 is used for preserving the temporal information of each word of the lyrics after confirming, generates word for word lyrics synchronized file.

Wherein, the audio volume control figure of audio volume control figure generation unit 403 generations has following characteristics:

Usually, in an audio file, the singer sings the sound big (because like this could give prominence to voice, make hearer hear the clearly lyrics) of the sound of the lyrics than accompaniment.Therefore, the amplitude of singing the audio volume control of lyrics part can be bigger than the amplitude of accompaniment audio volume control partly, forms a vibration more significantly.That is to say that each word corresponding audio waveform roughly can form a kind of like this vibration: this vibration and accompaniment oscillating phase ratio partly, amplitude is bigger, relatively significantly; And each vibration all is to be starting point with less amplitude, begins then to increase gradually, reduces gradually after reaching peak value again, finishes with less amplitude; Wherein, each vibration initial and finish the initial and end of just corresponding the lyric characters of singing.Therefore, utilize this characteristics, just can confirm the initial and concluding time of the corresponding lyric characters of singing according to the initial and end position of each vibration on the audio volume control figure.

As shown in Figure 5, lyrics acquiring unit 404 can comprise that lyric characters obtains subelement 4041, is used to obtain the lyrics of user's input; Can comprise that also lyrics file loads subelement 4042, is used for loading the lyrics of existing lyrics file.Existing lyrics file can be the lyrics file of various forms, for example: the file of common suffix .lrc type by name.

When adopting lyric characters to obtain subelement 4041, can an initial value be set, for example 1 second to the duration of each word of input.Can be with first word acquiescence corresponding between 0 second to 1 second of audio file, second word is corresponding between 1 second to 2 seconds of audio file, and the rest may be inferred.When the input lyrics number of words more for a long time, can adjust the duration of each word, the principle of adjustment is: the summation of the duration of all literal is no more than the time span of this audio file in the lyrics.

Lyrics acquiring unit 404 can also comprise:

Lyrics file is resolved subelement 4043, is used for parsing every lyrics time corresponding of said lyrics file of loading;

The lyrics generate subelement 4044, are used for confirming that according to every lyrics time corresponding of said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.

Wherein, lyrics file is resolved in the said lyrics file that subelement 4043 parses zero-time and the concluding time that every lyrics time corresponding comprises these lyrics.The lyrics generate subelement 4044 and confirm that according to every lyrics time corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics; Specifically can be: deduct the duration that said zero-time obtains these lyrics with the said concluding time; Divided by the literal number of these lyrics, obtain the average duration of each word; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal.The viewing area of each literal wherein is corresponding to the expectation duration scope (being the expectation zero-time of this literal on the time shaft and the scope between the expected concluding time) of this literal on the time shaft.

Concrete, for example: lyrics file is resolved subelement 4043 and is parsed that a certain sentence lyrics time corresponding is between 30 seconds to 40 seconds in the lyrics file of loading, and these lyrics have ten words; Then lyrics generation subelement 4044 can show 30 second to 31 second the interval of first word corresponding to time shaft, and with 31 second to 32 second the interval demonstration of second word corresponding to time shaft, the rest may be inferred.

In order to let user easier distinguish the banner word and end word of every lyrics, the said lyrics acquiring unit 404 of the embodiment of the invention can also comprise:

Head and the tail block molecular cell 4045; Be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse; To the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.

Concrete, can banner word in every lyrics and the font size that finishes word be adjusted, use than the big font of other literal in these lyrics, show banner word and finish word; Also can with the banner word in every lyrics with finish word, use with these lyrics in other literal various colors, show.

Similarly, the said lyrics of the embodiment of the invention drag handle generation unit 405 and can also comprise:

For example: the banner word in every lyrics and the lyrics that finish word are dragged the color of handle, be arranged to drag the handle various colors with the lyrics of other word; Perhaps the banner word in every lyrics and the lyrics that finish word are dragged the shape of handle, be arranged to drag the different shape of handle with the lyrics of other word.

The said lyrics of the embodiment of the invention drag handle generation unit 405, can comprise:

The lyrics drag handle generation unit 405, can also comprise:

Concrete; Include only zero-time and drag handle when generating subelement when the lyrics drag handle generation unit 405; The said system of the embodiment of the invention can all generate lyrics for each literal in the lyrics and drag handle, and these lyrics drag handle is represented this word corresponding to the position on the time shaft zero-time.The lyrics between two adjacent lyric characters drag the zero-time of handle except that word of expression back, also represent the concluding time of before word.

When dragging handle generation unit 405, the lyrics comprise that simultaneously zero-time drags handle when generating subelement and dragging handle generation subelement with the concluding time; The said system of the embodiment of the invention can all generate two lyrics for each literal in the lyrics and drag handle, corresponds respectively to the zero-time and the concluding time of this literal.In this case, the zero-time of each literal and concluding time all can be adjusted separately, can the temporal information of adjacent literal not impacted.

The said playing control unit 406 of the embodiment of the invention can comprise:

Progress pointer prompting subelement, and/or, oscillogram color tips subelement;

Wherein, the playing progress rate pointer of progress pointer prompting subelement can be the vertical line (can certainly be other shapes) on audio volume control figure.This playing progress rate pointer position on audio volume control figure indicates where this audio file has played to.The user can also drag this playing progress rate pointer, with the playing progress rate of adjustment audio file.Oscillogram color tips subelement adopts the part of broadcast corresponding on the various colors identification audio oscillogram and does not play part, for instance: can be designated green with having play part on the audio volume control figure, not play part and be designated redness.

More than to a kind of video checking method provided by the present invention and system, carried out detailed introduction.Used concrete example among this paper principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, part all can change on embodiment and range of application.In sum, this description should not be construed as limitation of the present invention.

Claims

1. the generation method of lyrics synchronized file word for word is characterized in that, comprising:

According to the temporal information of audio file, the rise time axle;

Obtain the lyrics of said audio file, comprise the lyrics that obtain the manual input of user or load the lyrics in the existing lyrics file;

Confirm that each literal in the lyrics corresponding to the initial position of said time shaft and show, specifically comprises: when the lyrics that obtain said audio file are to obtain the user manually during the lyrics of input, duration of each word of input is provided with an initial value; When the lyrics that obtain said audio file when loading the lyrics in the existing lyrics file, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the said lyrics file that parses; Deduct the duration that said zero-time obtains these lyrics with the said concluding time,, obtain the average duration of each word divided by the literal number of these lyrics; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal;

For generating the lyrics, each literal in the lyrics drags handle;

2. method according to claim 1 is characterized in that, also comprises:

After loading the existing file of lyrics synchronized sentence by sentence; According to banner word and the end word in every the lyrics of every lyrics time corresponding identification that parse; To the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.

3. method according to claim 1 is characterized in that, also comprises:

4. method according to claim 1 is characterized in that, saidly drags handle and comprises for each literal in the lyrics generates lyrics:

5. method according to claim 4 is characterized in that, saidly drags handle and also comprises for each literal in the lyrics generates lyrics:

6. method according to claim 1 is characterized in that, the said playing progress rate of on audio volume control figure, pointing out comprises:

7. the generation system of lyrics synchronized file word for word is characterized in that, comprising:

Lyrics acquiring unit is used to obtain the lyrics of said audio file, confirms that each literal in the lyrics is corresponding to the initial position of said time shaft and show; Said lyrics acquiring unit comprises: lyric characters obtains subelement or/and lyrics file loads subelement, and lyric characters obtains the lyrics that subelement is used to obtain user's input, and lyrics file loads the lyrics that subelement is used for loading existing lyrics file; Said lyrics acquiring unit comprises that also lyrics file is resolved subelement and the lyrics generate subelement, and lyrics file is resolved every lyrics time corresponding of said lyrics file that subelement is used for parsing loading; The lyrics generate subelement and are used for confirming that according to every lyrics time corresponding of said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics; Each literal is corresponding to the initial position of said time shaft and show and specifically comprise in said definite lyrics: when the lyrics that obtain said audio file are to obtain the user manually during the lyrics of input, duration of each word of input is provided with an initial value; When the lyrics that obtain said audio file when loading the lyrics in the existing lyrics file, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the said lyrics file that parses; Deduct the duration that said zero-time obtains these lyrics with the said concluding time,, obtain the average duration of each word divided by the literal number of these lyrics; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal;

8. system according to claim 7 is characterized in that, said lyrics acquiring unit also comprises:

9. system according to claim 7 is characterized in that, the said lyrics drag the handle generation unit and also comprise:

10. system according to claim 7 is characterized in that, the said lyrics drag the handle generation unit and comprise:

11. system according to claim 10 is characterized in that, the said lyrics drag the handle generation unit and also comprise:

12. system according to claim 7 is characterized in that, said playing control unit comprises:

Progress pointer prompting subelement and/or oscillogram color tips subelement;