CN101984490B - Word-for-word synchronous lyric file generating method and system thereof - Google Patents

Word-for-word synchronous lyric file generating method and system thereof Download PDF

Info

Publication number
CN101984490B
CN101984490B CN2010105572583A CN201010557258A CN101984490B CN 101984490 B CN101984490 B CN 101984490B CN 2010105572583 A CN2010105572583 A CN 2010105572583A CN 201010557258 A CN201010557258 A CN 201010557258A CN 101984490 B CN101984490 B CN 101984490B
Authority
CN
China
Prior art keywords
lyrics
word
file
time
literal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN2010105572583A
Other languages
Chinese (zh)
Other versions
CN101984490A (en
Inventor
翟海平
林健
李想
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yeelion Online Network Technology Beijing Co Ltd
Original Assignee
Yeelion Online Network Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yeelion Online Network Technology Beijing Co Ltd filed Critical Yeelion Online Network Technology Beijing Co Ltd
Priority to CN2010105572583A priority Critical patent/CN101984490B/en
Publication of CN101984490A publication Critical patent/CN101984490A/en
Application granted granted Critical
Publication of CN101984490B publication Critical patent/CN101984490B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a word-for-word synchronous lyric file generating method, including: an audio file is loaded, and audio data and time information are extracted; a time shaft is generated; an audio waveform diagram is generated and displayed; lyric is acquired, and initial position of each character in the lyric corresponding to the time shaft is determined and displayed; a lyric dragging handle for each character is generated; the audio file is played, and playing progress is prompted on the audio waveform diagram; a request of user for adjusting the position of the lyric dragging handle is received, and the position of the lyric dragging handle is adjusted; time information of each character in the lyric is determined; and the determined time information of each character in the lyric is stored, and a word-for-word synchronous lyric file is generated. The invention also discloses a word-for-word synchronous lyric file generating system. By adopting the method or system of the invention, reference for making word-for-word synchronous lyric file can be provided in audition and vision, and accuracy and making speed of word-for-word synchronous lyric file can be improved.

Description

A kind of generation method and system of word for word lyrics synchronized file
Technical field
The present invention relates to copy editor's technical field, particularly relate to a kind of generation method and system of word for word lyrics synchronized file.
Background technology
As everyone knows, most of song all has the lyrics.The voice playing instrument can be through loading the lyrics file of specific format, in played songs, the lyrics of song also is shown to the user.In the practical application, some users not only hope to see the lyrics in played songs, also hope to obtain the synchronous dynamic prompting of these lyrics.For this reason, lyrics synchronized file line by line occurred, this lyrics file is the temporal information of the unit record lyrics with the sentence, and the music instrument just can show the lyrics with sentence by sentence form in displaying audio file synchronously like this.But when the user was higher to the requirement of lyrics promptings, when recreation such as for example playing Karaoka, the lyrics that show with form sentence by sentence obviously can not satisfy user's requirement.
Therefore, occurred word for word lyrics synchronized file in the prior art, through loading this word for word lyrics synchronized file, the voice playing instrument just can be in played songs, and the lyrics are shown with word for word form synchronously.Obviously, the quality of lyrics file has determined the effect of synchronous lyrics.In the prior art, word for word the generation method of lyrics synchronized file mainly is:
At first try to achieve averaging time divided by the number of words of these lyrics with the duration of every lyrics; Again with the duration of this averaging time as each word in these lyrics; Zero-time in conjunction with each word calculates the concluding time, generates word for word lyrics synchronized file thus.Obviously, the file of the word for word lyrics synchronized file that this method generates, the temporal information of each word is very inaccurate.
Summary of the invention
The generation method and system that the purpose of this invention is to provide a kind of word for word lyrics synchronized file, can be from the sense of hearing and visually provide make lyrics synchronized file word for word with reference to foundation, improve the word for word accuracy of lyrics synchronized file.
For realizing above-mentioned purpose, the invention provides following scheme:
A kind of generation method of word for word lyrics synchronized file comprises:
Load audio file, extract the voice data and the temporal information of said audio file;
According to the temporal information of audio file, the rise time axle;
According to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;
Obtain the lyrics of said audio file, confirm that each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics;
For generating the lyrics, each literal in the lyrics drags handle;
Displaying audio file, and on audio volume control figure, point out playing progress rate;
Receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
Preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.
Preferably, the said lyrics that obtain said audio file comprise:
Obtain the lyrics of user's input.
Preferably, the said lyrics that obtain said audio file comprise:
Load the lyrics in the existing lyrics file.
Preferably, also comprise:
Parse every lyrics time information corresponding in the said lyrics file of loading;
Each literal comprises corresponding to the initial position and the demonstration of said time shaft in said definite lyrics: confirm that according to every lyrics time information corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.
Preferably, also comprise:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, to the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.
Preferably, also comprise:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.
Preferably, saidly drag handle and comprise for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to the zero-time of this literal for each literal drag handle;
Preferably, saidly drag handle and also comprise for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to concluding time of this literal for each literal drag handle.
Preferably, the said playing progress rate of on audio volume control figure, pointing out comprises:
On audio volume control figure, adopt the playing progress rate pointer to point out, and/or, adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part.
A kind of generation system of word for word lyrics synchronized file comprises:
The audio file extraction unit is used to load audio file, extracts the voice data and the temporal information of said audio file;
The time shaft generation unit is used for the temporal information according to audio file, the rise time axle;
Audio volume control figure generation unit is used for the voice data according to audio file, and corresponding said time shaft generates the audio volume control figure of said audio file and shows;
Lyrics acquiring unit is used to obtain the lyrics of said audio file, confirms that each literal in the lyrics is corresponding to the initial position of said time shaft and show;
The lyrics drag the handle generation unit, and each literal generation lyrics that are used in the lyrics drag handle;
Playing control unit is used for displaying audio file, and on audio volume control figure, points out playing progress rate;
Lyrics adjustment unit is used to receive the user the said lyrics is dragged the request that the handle position is adjusted, and the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
The lyrics file generation unit is used for preserving the temporal information of each word of the lyrics after confirming, generates word for word lyrics synchronized file.
Preferably, said lyrics acquiring unit comprises:
Lyric characters obtains subelement, is used to obtain the lyrics of user's input.
Preferably, said lyrics acquiring unit comprises:
Lyrics file loads subelement, is used for loading the lyrics of existing lyrics file.
Preferably, said lyrics acquiring unit also comprises:
Lyrics file is resolved subelement, is used for parsing every lyrics time corresponding of said lyrics file of loading;
The lyrics generate subelement, are used for confirming that according to every lyrics time corresponding of said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.
Preferably, said lyrics acquiring unit also comprises:
Head and the tail block molecular cell is used for according to the banner word of every the lyrics of every lyrics time corresponding identification that parse and finishes word, to the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.
Preferably, the said lyrics drag the handle generation unit and also comprise:
The head and the tail word lyrics drag handle and distinguish subelement; Be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse; The banner word in every lyrics and the lyrics that finish word are dragged handle; Distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.
Preferably, the said lyrics drag the handle generation unit and comprise:
Zero-time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to the zero-time of this literal and drags handle;
Preferably, the said lyrics drag the handle generation unit and also comprise:
Concluding time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to concluding time of this literal and drags handle.
Preferably, said playing control unit comprises:
Progress pointer prompting subelement and/or oscillogram color tips subelement;
Said progress pointer prompting subelement is used on audio volume control figure, adopting the playing progress rate of playing progress rate pointer prompt tone frequency file;
Said oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the various colors identification audio oscillogram and does not play part.
According to specific embodiment provided by the invention; The invention discloses following technique effect: through extracting the voice data of audio file; Generate audio volume control figure, show each literal in the lyrics, and drag handle for each literal generation lyrics corresponding to time shaft and the audio volume control figure of audio file; For the user simultaneously from the sense of hearing and visually provide make lyrics synchronized file word for word with reference to foundation, improved the word for word accuracy of lyrics synchronized file.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art; To do to introduce simply to the accompanying drawing of required use among the embodiment below; Obviously, the accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills; Under the prerequisite of not paying creative work property, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the method for making first pass figure of the said word for word lyrics synchronized of embodiment of the invention file;
Fig. 2 is for adopting a kind of software interface synoptic diagram of the said method of the embodiment of the invention;
Fig. 3 is method for making second process flow diagram of the said word for word lyrics synchronized of embodiment of the invention file;
Fig. 4 is the manufacturing system structural drawing of the said word for word lyrics synchronized of embodiment of the invention file;
Fig. 5 is the said lyrics acquiring unit of an embodiment of the invention structural drawing.
Embodiment
To combine the accompanying drawing in the embodiment of the invention below, the technical scheme in the embodiment of the invention is carried out clear, intactly description, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills are not making the every other embodiment that is obtained under the creative work prerequisite, all belong to the scope of the present invention's protection.
Referring to Fig. 1, be the method for making process flow diagram of the said word for word lyrics synchronized of embodiment of the invention file.As shown in Figure 1, the method comprising the steps of:
S101: load audio file, extract the voice data and the temporal information of said audio file;
S102: according to the temporal information of audio file, the rise time axle;
S103: according to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;
S104: obtain the lyrics of said audio file, confirm that each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics;
S105: drag handle for each literal in the lyrics generates the lyrics;
S106: displaying audio file, and on audio volume control figure, point out playing progress rate;
S107: receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
S108: preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.
Among the step S101, the audio file of loading can be various forms, for example MP3, WMA, APE or the like.After the loading, adopt corresponding demoder to extract the voice data and the temporal information of said audio file.Wherein temporal information specifically can be the time span of audio file.
Among the step S102, the time shaft of generation can show on user interface.
The audio volume control figure that generates among the step S103, the temporal information of each literal provides visual reference frame in the lyrics for the user adjusts.
Concrete, with reference to Fig. 2, for adopting a kind of software interface synoptic diagram of the said method of the embodiment of the invention.Usually, in an audio file, the singer sings the sound big (because like this could give prominence to voice, make hearer hear the clearly lyrics) of the sound of the lyrics than accompaniment.Therefore, the amplitude of singing the audio volume control of lyrics part can be bigger than the amplitude of accompaniment audio volume control partly, forms a vibration more significantly.That is to say that each word corresponding audio waveform roughly can form a kind of like this vibration: this vibration and accompaniment oscillating phase ratio partly, amplitude is bigger, relatively significantly; And each vibration all is to be starting point with less amplitude, begins then to increase gradually, reduces gradually after reaching peak value again, finishes with less amplitude; Wherein, each vibration initial and finish the initial and end of just corresponding the lyric characters of singing.Therefore, utilize this characteristics, just can confirm the initial and concluding time of the corresponding lyric characters of singing according to the initial and end position of each vibration on the audio volume control figure.
In addition, need to prove that there is corresponding relation in the audio volume control figure that generates among the step S103 with the time shaft of audio file.Any place's audio volume control all has corresponding with it time zone on the audio volume control figure on time shaft.
Among the step S104, obtaining the lyrics of said audio file, can be to obtain the manually lyrics of input of user, also can load the lyrics in the existing lyrics file.
When obtaining the user manually during the lyrics of input: can an initial value be set, for example 1 second to the duration of each word of input.Can be with first word acquiescence corresponding between 0 second to 1 second of audio file, second word is corresponding between 1 second to 2 seconds of audio file, and the rest may be inferred.When the input lyrics number of words more for a long time, can adjust the duration of each word, the principle of adjustment is: the summation of the duration of all literal is no more than the time span of this audio file in the lyrics.
When loading the lyrics in the existing lyrics file: existing lyrics file can be the file that common suffix is called the .1rc type.Usually this existing lyrics file is sentence by sentence synchronous, also promptly, has included the temporal information of the lyrics in this lyrics file, and only this temporal information only is to be directed against every lyrics.
In order to make full use of the temporal information in the existing file of lyrics synchronized sentence by sentence, the said method of the embodiment of the invention also comprises: parse every lyrics time corresponding in the said lyrics file of loading; Confirm that according to every lyrics time corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.
Wherein, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the said lyrics file that parses.Confirm that according to every lyrics time corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics; Specifically can be: deduct the duration that said zero-time obtains these lyrics with the said concluding time; Divided by the literal number of these lyrics, obtain the average duration of each word; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal.The viewing area of each literal wherein is corresponding to the expectation duration scope (being the expectation zero-time of this literal on the time shaft and the scope between the expected concluding time) of this literal on the time shaft.
After loading the existing file of lyrics synchronized sentence by sentence; Can also and finish word according to the banner word in every the lyrics of every lyrics time corresponding identification that parse; To the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.Concrete, can banner word in every lyrics and the font size that finishes word be adjusted, use than the big font of other literal in these lyrics, show banner word and finish word; Also can with the banner word in every lyrics with finish word, use with these lyrics in other literal various colors, show.
Among the step S105,, each literal in the lyrics drags handle for generating the lyrics.Each literal can correspond respectively to two said lyrics and drag handle, to confirm the zero-time and the concluding time of each word respectively.Concrete, the lyrics drag the position of handle and there is corresponding relation in the time point on the time shaft.As shown in Figure 2, it is corresponding with the zero-time of this word that the lyrics on each literal left side drag handle, and it is corresponding with the concluding time of this word that the lyrics on the right drag handle.The zero-time or the concluding time of corresponding word can be regulated in the position that drags handle through the adjustment lyrics.
Among the step S105, also can be only drag handle corresponding to the lyrics of the zero-time of this literal for each literal generation.In this case, the lyrics between two adjacent words drag handle, except the zero-time of that word of expression back, also represent the concluding time of before word.
The difference of the lyrics file that two kinds of situation generate is, the former is for the demonstration time of each word in the lyrics, representes that this literal should sing in the demonstration time; The latter is for the demonstration time of each word in the lyrics, except representing the singing time of this literal, possibly represent also that this word is sung to finish, but next word do not begin that section accompaniment time of singing as yet.
The method for making of the said word for word lyrics synchronized of embodiment of the invention file; Can also and finish word according to the banner word in every the lyrics of every lyrics time corresponding identification that parse; The banner word in every lyrics and the lyrics that finish word are dragged handle; Distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.For example: the banner word in every lyrics and the lyrics that finish word are dragged the color of handle, be arranged to drag the handle various colors with the lyrics of other word; Perhaps the banner word in every lyrics and the lyrics that finish word are dragged the shape of handle, be arranged to drag the different shape of handle with the lyrics of other word.
Among the step S106, the method for prompting playing progress rate on audio volume control figure can be on audio volume control figure, to adopt the playing progress rate pointer to point out, and also can be to adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part.
Wherein, the playing progress rate pointer can be the vertical line (can certainly be other shapes) on audio volume control figure.This playing progress rate pointer position on audio volume control figure indicates where this audio file has played to.The user can also drag this playing progress rate pointer, with the playing progress rate of adjustment audio file.Adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part, for instance: can be designated green with having play part on the audio volume control figure, not play part and be designated redness.
In addition, adopt playing progress rate pointer and the method that adopts these two kinds prompting playing progress rates on audio volume control figure of various colors sign, can use separately, also can use simultaneously.
By on can know, the preferred embodiment of the method for making of word for word lyrics synchronized file according to the invention, as shown in Figure 3, comprise step:
S201: load audio file, extract the voice data and the temporal information of said audio file;
S202: according to the temporal information of audio file, the rise time axle;
S203: according to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;
S204: load the lyrics in the existing lyrics file;
S205: parse every lyrics time corresponding in the said lyrics file of loading, confirm that according to every lyrics time corresponding in the said lyrics file each literal in the lyrics is corresponding to the initial position of said time shaft and show;
S206:,, distinguish demonstration to banner word in every lyrics and end word according to banner word and the end word in every the lyrics of every lyrics time corresponding identification that parse;
S207: drag handle for each literal in the lyrics generates the lyrics;
S208: banner word and end word according in every the lyrics of every lyrics time corresponding identification that parse, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration;
S209: displaying audio file, and on audio volume control figure, adopt playing progress rate pointer prompting playing progress rate, adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part;
S210: receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
S211: preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.
Corresponding with the method for making of the said word for word lyrics synchronized of embodiment of the invention file, the embodiment of the invention also discloses a kind of manufacturing system of word for word lyrics synchronized file.
Referring to Fig. 4, be the manufacturing system structural drawing of the said word for word lyrics synchronized of embodiment of the invention file.This system comprises:
Audio file extraction unit 401 is used to load audio file, extracts the voice data and the temporal information of said audio file;
Time shaft generation unit 402 is used for the temporal information according to audio file, the rise time axle;
Audio volume control figure generation unit 403 is used for the voice data according to audio file, and corresponding said time shaft generates the audio volume control figure of said audio file and shows;
Lyrics acquiring unit 404 is used to obtain the lyrics of said audio file, confirms that each literal in the lyrics is corresponding to the initial position of said time shaft and show;
The lyrics drag handle generation unit 405, and each literal generation lyrics that are used in the lyrics drag handle;
Playing control unit 406 is used for displaying audio file, and on audio volume control figure, points out playing progress rate;
Lyrics adjustment unit 407 is used to receive the user the said lyrics is dragged the request that the handle position is adjusted, and the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
Lyrics file generation unit 408 is used for preserving the temporal information of each word of the lyrics after confirming, generates word for word lyrics synchronized file.
Wherein, the audio volume control figure of audio volume control figure generation unit 403 generations has following characteristics:
Usually, in an audio file, the singer sings the sound big (because like this could give prominence to voice, make hearer hear the clearly lyrics) of the sound of the lyrics than accompaniment.Therefore, the amplitude of singing the audio volume control of lyrics part can be bigger than the amplitude of accompaniment audio volume control partly, forms a vibration more significantly.That is to say that each word corresponding audio waveform roughly can form a kind of like this vibration: this vibration and accompaniment oscillating phase ratio partly, amplitude is bigger, relatively significantly; And each vibration all is to be starting point with less amplitude, begins then to increase gradually, reduces gradually after reaching peak value again, finishes with less amplitude; Wherein, each vibration initial and finish the initial and end of just corresponding the lyric characters of singing.Therefore, utilize this characteristics, just can confirm the initial and concluding time of the corresponding lyric characters of singing according to the initial and end position of each vibration on the audio volume control figure.
As shown in Figure 5, lyrics acquiring unit 404 can comprise that lyric characters obtains subelement 4041, is used to obtain the lyrics of user's input; Can comprise that also lyrics file loads subelement 4042, is used for loading the lyrics of existing lyrics file.Existing lyrics file can be the lyrics file of various forms, for example: the file of common suffix .lrc type by name.
When adopting lyric characters to obtain subelement 4041, can an initial value be set, for example 1 second to the duration of each word of input.Can be with first word acquiescence corresponding between 0 second to 1 second of audio file, second word is corresponding between 1 second to 2 seconds of audio file, and the rest may be inferred.When the input lyrics number of words more for a long time, can adjust the duration of each word, the principle of adjustment is: the summation of the duration of all literal is no more than the time span of this audio file in the lyrics.
Lyrics acquiring unit 404 can also comprise:
Lyrics file is resolved subelement 4043, is used for parsing every lyrics time corresponding of said lyrics file of loading;
The lyrics generate subelement 4044, are used for confirming that according to every lyrics time corresponding of said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics.
Wherein, lyrics file is resolved in the said lyrics file that subelement 4043 parses zero-time and the concluding time that every lyrics time corresponding comprises these lyrics.The lyrics generate subelement 4044 and confirm that according to every lyrics time corresponding in the said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics; Specifically can be: deduct the duration that said zero-time obtains these lyrics with the said concluding time; Divided by the literal number of these lyrics, obtain the average duration of each word; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal.The viewing area of each literal wherein is corresponding to the expectation duration scope (being the expectation zero-time of this literal on the time shaft and the scope between the expected concluding time) of this literal on the time shaft.
Concrete, for example: lyrics file is resolved subelement 4043 and is parsed that a certain sentence lyrics time corresponding is between 30 seconds to 40 seconds in the lyrics file of loading, and these lyrics have ten words; Then lyrics generation subelement 4044 can show 30 second to 31 second the interval of first word corresponding to time shaft, and with 31 second to 32 second the interval demonstration of second word corresponding to time shaft, the rest may be inferred.
In order to let user easier distinguish the banner word and end word of every lyrics, the said lyrics acquiring unit 404 of the embodiment of the invention can also comprise:
Head and the tail block molecular cell 4045; Be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse; To the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.
Concrete, can banner word in every lyrics and the font size that finishes word be adjusted, use than the big font of other literal in these lyrics, show banner word and finish word; Also can with the banner word in every lyrics with finish word, use with these lyrics in other literal various colors, show.
Similarly, the said lyrics of the embodiment of the invention drag handle generation unit 405 and can also comprise:
The head and the tail word lyrics drag handle and distinguish subelement; Be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse; The banner word in every lyrics and the lyrics that finish word are dragged handle; Distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.
For example: the banner word in every lyrics and the lyrics that finish word are dragged the color of handle, be arranged to drag the handle various colors with the lyrics of other word; Perhaps the banner word in every lyrics and the lyrics that finish word are dragged the shape of handle, be arranged to drag the different shape of handle with the lyrics of other word.
The said lyrics of the embodiment of the invention drag handle generation unit 405, can comprise:
Zero-time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to the zero-time of this literal and drags handle;
The lyrics drag handle generation unit 405, can also comprise:
Concluding time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to concluding time of this literal and drags handle.
Concrete; Include only zero-time and drag handle when generating subelement when the lyrics drag handle generation unit 405; The said system of the embodiment of the invention can all generate lyrics for each literal in the lyrics and drag handle, and these lyrics drag handle is represented this word corresponding to the position on the time shaft zero-time.The lyrics between two adjacent lyric characters drag the zero-time of handle except that word of expression back, also represent the concluding time of before word.
When dragging handle generation unit 405, the lyrics comprise that simultaneously zero-time drags handle when generating subelement and dragging handle generation subelement with the concluding time; The said system of the embodiment of the invention can all generate two lyrics for each literal in the lyrics and drag handle, corresponds respectively to the zero-time and the concluding time of this literal.In this case, the zero-time of each literal and concluding time all can be adjusted separately, can the temporal information of adjacent literal not impacted.
The said playing control unit 406 of the embodiment of the invention can comprise:
Progress pointer prompting subelement, and/or, oscillogram color tips subelement;
Said progress pointer prompting subelement is used on audio volume control figure, adopting the playing progress rate of playing progress rate pointer prompt tone frequency file;
Said oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the various colors identification audio oscillogram and does not play part.
Wherein, the playing progress rate pointer of progress pointer prompting subelement can be the vertical line (can certainly be other shapes) on audio volume control figure.This playing progress rate pointer position on audio volume control figure indicates where this audio file has played to.The user can also drag this playing progress rate pointer, with the playing progress rate of adjustment audio file.Oscillogram color tips subelement adopts the part of broadcast corresponding on the various colors identification audio oscillogram and does not play part, for instance: can be designated green with having play part on the audio volume control figure, not play part and be designated redness.
More than to a kind of video checking method provided by the present invention and system, carried out detailed introduction.Used concrete example among this paper principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, part all can change on embodiment and range of application.In sum, this description should not be construed as limitation of the present invention.

Claims (12)

1. the generation method of lyrics synchronized file word for word is characterized in that, comprising:
Load audio file, extract the voice data and the temporal information of said audio file;
According to the temporal information of audio file, the rise time axle;
According to the voice data of audio file, corresponding said time shaft generates the audio volume control figure of said audio file and shows;
Obtain the lyrics of said audio file, comprise the lyrics that obtain the manual input of user or load the lyrics in the existing lyrics file;
Confirm that each literal in the lyrics corresponding to the initial position of said time shaft and show, specifically comprises: when the lyrics that obtain said audio file are to obtain the user manually during the lyrics of input, duration of each word of input is provided with an initial value; When the lyrics that obtain said audio file when loading the lyrics in the existing lyrics file, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the said lyrics file that parses; Deduct the duration that said zero-time obtains these lyrics with the said concluding time,, obtain the average duration of each word divided by the literal number of these lyrics; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal;
For generating the lyrics, each literal in the lyrics drags handle;
Displaying audio file, and on audio volume control figure, point out playing progress rate;
Receive the user the said lyrics are dragged the request that the handle position is adjusted, the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
Preserve the temporal information of each word in the lyrics after confirming, generate word for word lyrics synchronized file.
2. method according to claim 1 is characterized in that, also comprises:
After loading the existing file of lyrics synchronized sentence by sentence; According to banner word and the end word in every the lyrics of every lyrics time corresponding identification that parse; To the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.
3. method according to claim 1 is characterized in that, also comprises:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.
4. method according to claim 1 is characterized in that, saidly drags handle and comprises for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to the zero-time of this literal for each literal drag handle;
5. method according to claim 4 is characterized in that, saidly drags handle and also comprises for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to concluding time of this literal for each literal drag handle.
6. method according to claim 1 is characterized in that, the said playing progress rate of on audio volume control figure, pointing out comprises:
On audio volume control figure, adopt the playing progress rate pointer to point out, and/or, adopt the part of broadcast corresponding on the various colors identification audio oscillogram and do not play part.
7. the generation system of lyrics synchronized file word for word is characterized in that, comprising:
The audio file extraction unit is used to load audio file, extracts the voice data and the temporal information of said audio file;
The time shaft generation unit is used for the temporal information according to audio file, the rise time axle;
Audio volume control figure generation unit is used for the voice data according to audio file, and corresponding said time shaft generates the audio volume control figure of said audio file and shows;
Lyrics acquiring unit is used to obtain the lyrics of said audio file, confirms that each literal in the lyrics is corresponding to the initial position of said time shaft and show; Said lyrics acquiring unit comprises: lyric characters obtains subelement or/and lyrics file loads subelement, and lyric characters obtains the lyrics that subelement is used to obtain user's input, and lyrics file loads the lyrics that subelement is used for loading existing lyrics file; Said lyrics acquiring unit comprises that also lyrics file is resolved subelement and the lyrics generate subelement, and lyrics file is resolved every lyrics time corresponding of said lyrics file that subelement is used for parsing loading; The lyrics generate subelement and are used for confirming that according to every lyrics time corresponding of said lyrics file each literal is corresponding to the initial position and the demonstration of said time shaft in the lyrics; Each literal is corresponding to the initial position of said time shaft and show and specifically comprise in said definite lyrics: when the lyrics that obtain said audio file are to obtain the user manually during the lyrics of input, duration of each word of input is provided with an initial value; When the lyrics that obtain said audio file when loading the lyrics in the existing lyrics file, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the said lyrics file that parses; Deduct the duration that said zero-time obtains these lyrics with the said concluding time,, obtain the average duration of each word divided by the literal number of these lyrics; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal;
The lyrics drag the handle generation unit, and each literal generation lyrics that are used in the lyrics drag handle;
Playing control unit is used for displaying audio file, and on audio volume control figure, points out playing progress rate;
Lyrics adjustment unit is used to receive the user the said lyrics is dragged the request that the handle position is adjusted, and the position that the said lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the said time shaft according to the adjusted said lyrics, confirm the temporal information of each word in the lyrics;
The lyrics file generation unit is used for preserving the temporal information of each word of the lyrics after confirming, generates word for word lyrics synchronized file.
8. system according to claim 7 is characterized in that, said lyrics acquiring unit also comprises:
Head and the tail block molecular cell is used for according to the banner word of every the lyrics of every lyrics time corresponding identification that parse and finishes word, to the banner word in every lyrics with finish word, distinguish demonstration, with the said lyrics in other literal distinguish mutually.
9. system according to claim 7 is characterized in that, the said lyrics drag the handle generation unit and also comprise:
The head and the tail word lyrics drag handle and distinguish subelement; Be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse; The banner word in every lyrics and the lyrics that finish word are dragged handle; Distinguish demonstration, with the said lyrics in the lyrics of other literal drag handle and distinguish mutually.
10. system according to claim 7 is characterized in that, the said lyrics drag the handle generation unit and comprise:
Zero-time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to the zero-time of this literal and drags handle;
11. system according to claim 10 is characterized in that, the said lyrics drag the handle generation unit and also comprise:
Concluding time drags handle and generates subelement, is used to the lyrics that each literal generates corresponding to concluding time of this literal and drags handle.
12. system according to claim 7 is characterized in that, said playing control unit comprises:
Progress pointer prompting subelement and/or oscillogram color tips subelement;
Said progress pointer prompting subelement is used on audio volume control figure, adopting the playing progress rate of playing progress rate pointer prompt tone frequency file;
Said oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the various colors identification audio oscillogram and does not play part.
CN2010105572583A 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof Active CN101984490B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010105572583A CN101984490B (en) 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010105572583A CN101984490B (en) 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof

Publications (2)

Publication Number Publication Date
CN101984490A CN101984490A (en) 2011-03-09
CN101984490B true CN101984490B (en) 2012-06-27

Family

ID=43641660

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010105572583A Active CN101984490B (en) 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof

Country Status (1)

Country Link
CN (1) CN101984490B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294547A (en) * 2016-07-25 2017-01-04 惠州Tcl移动通信有限公司 A kind of method and system intercepting audio file

Families Citing this family (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102290081A (en) * 2011-06-27 2011-12-21 深圳市基思瑞科技有限公司 Language study play control method
CN102324191B (en) * 2011-09-28 2015-01-07 Tcl集团股份有限公司 Method and system for synchronously displaying audio book word by word
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN102820027B (en) * 2012-06-21 2014-04-16 福建星网视易信息系统有限公司 Accompaniment subtitle display system and method
CN102881309B (en) * 2012-09-24 2016-08-24 广东欧珀移动通信有限公司 Lyrics file generates method and device
CN105609121B (en) * 2014-11-20 2019-03-12 广州酷狗计算机科技有限公司 Multimedia progress monitoring method and device
CN104575542A (en) * 2014-12-15 2015-04-29 天脉聚源(北京)科技有限公司 Method and device for realizing audio regional play
CN104751870B (en) * 2015-03-24 2018-07-06 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN105893430A (en) * 2015-12-08 2016-08-24 乐视移动智能信息技术(北京)有限公司 Lyrics matching method and device
CN105868307A (en) * 2016-03-26 2016-08-17 深圳市金立通信设备有限公司 An audio frequency information display method and a terminal
CN105827997A (en) * 2016-04-26 2016-08-03 厦门幻世网络科技有限公司 Method and device for dubbing audio and visual digital media
CN106297847B (en) * 2016-08-12 2019-06-04 青岛海信移动通信技术股份有限公司 A kind of reminding method and equipment of lyrics playing duration
CN106652983B (en) * 2016-09-18 2021-04-02 福建网龙计算机网络信息技术有限公司 Subtitle making method and system
CN108206029A (en) * 2016-12-16 2018-06-26 北京酷我科技有限公司 A kind of method and system for realizing the word for word lyrics
CN108563650B (en) * 2017-12-15 2019-10-15 维沃移动通信有限公司 A kind of adjusting method and mobile terminal of audio file playback progress
CN109543064B (en) * 2018-11-30 2020-12-18 北京微播视界科技有限公司 Lyric display processing method and device, electronic equipment and computer storage medium
CN109558510A (en) * 2018-12-06 2019-04-02 北京微播视界科技有限公司 Lyrics analytic method, device, electronic equipment and computer storage medium
CN109672832B (en) * 2018-12-20 2021-07-02 四川湖山电器股份有限公司 Processing method for realizing dynamic special effect display of digital film song-inserting lyrics captions
CN110087122A (en) * 2019-05-06 2019-08-02 北京字节跳动网络技术有限公司 For handling system, the method and apparatus of information
CN110867180B (en) * 2019-10-15 2022-03-29 北京雷石天地电子技术有限公司 System and method for generating word-by-word lyric file based on K-means clustering algorithm
CN114064963A (en) * 2020-08-03 2022-02-18 北京字跳网络技术有限公司 Information display method and device
CN112115283A (en) * 2020-08-25 2020-12-22 天津洪恩完美未来教育科技有限公司 Method, device and equipment for processing picture book data
CN112347298A (en) * 2020-11-13 2021-02-09 广州酷狗计算机科技有限公司 Character information display method, device, terminal and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3662969B2 (en) * 1995-03-06 2005-06-22 富士通株式会社 Karaoke system
JP4994623B2 (en) * 2005-08-31 2012-08-08 富士通株式会社 Text editing / playback device, content editing / playback device, and text editing / playback method
JPWO2007066813A1 (en) * 2005-12-09 2009-05-21 ソニー株式会社 Music editing apparatus, music editing information creation method, and recording medium on which music editing information is recorded

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294547A (en) * 2016-07-25 2017-01-04 惠州Tcl移动通信有限公司 A kind of method and system intercepting audio file

Also Published As

Publication number Publication date
CN101984490A (en) 2011-03-09

Similar Documents

Publication Publication Date Title
CN101984490B (en) Word-for-word synchronous lyric file generating method and system thereof
TWI576822B (en) Processing method of making song request and system thereof
CN106448630B (en) Method and device for generating digital music score file of song
CN105006234A (en) Karaoke processing method and apparatus
CN101694772A (en) Method for converting text into rap music and device thereof
CN104978973B (en) A kind of audio-frequency processing method and device
CN102915725A (en) Human-computer interaction song singing system and method
EP1746576A3 (en) Method and apparatus for outputting audio data and musical score image
JP2017513049A (en) How to provide users with feedback on the performance of karaoke songs
WO2016188211A1 (en) Audio processing method, apparatus and system
CN107481735A (en) A kind of method, server and the computer-readable recording medium of transducing audio sounding
CN101615417B (en) Synchronous Chinese lyrics display method which is accurate to words
CN108109652A (en) A kind of method of K songs chorus recording
JP6452229B2 (en) Karaoke sound effect setting system
JP5598516B2 (en) Voice synthesis system for karaoke and parameter extraction device
CN101930732B (en) Music producing method and device based on user input voice and intelligent terminal
CN108346418A (en) A kind of method, system and terminal that song generates
JP4171680B2 (en) Information setting device, information setting method, and information setting program for music playback device
JP2010237260A (en) Karaoke machine emphasizing main voice part of chorus music
JP6589521B2 (en) Singing standard data correction device, karaoke system, program
JP7117228B2 (en) karaoke system, karaoke machine
JP6596903B2 (en) Information providing system and information providing method
JP2017116899A (en) Karaoke generation by voice input
CN106652983B (en) Subtitle making method and system
JP6144593B2 (en) Singing scoring system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant