CN101984490A - Word-for-word synchronous lyric file generating method and system thereof - Google Patents

Word-for-word synchronous lyric file generating method and system thereof Download PDF

Info

Publication number
CN101984490A
CN101984490A CN 201010557258 CN201010557258A CN101984490A CN 101984490 A CN101984490 A CN 101984490A CN 201010557258 CN201010557258 CN 201010557258 CN 201010557258 A CN201010557258 A CN 201010557258A CN 101984490 A CN101984490 A CN 101984490A
Authority
CN
China
Prior art keywords
lyrics
word
file
literal
handle
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 201010557258
Other languages
Chinese (zh)
Other versions
CN101984490B (en
Inventor
翟海平
林健
李想
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Yeelion Online Network Technology Beijing Co Ltd
Original Assignee
Yeelion Online Network Technology Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Yeelion Online Network Technology Beijing Co Ltd filed Critical Yeelion Online Network Technology Beijing Co Ltd
Priority to CN2010105572583A priority Critical patent/CN101984490B/en
Publication of CN101984490A publication Critical patent/CN101984490A/en
Application granted granted Critical
Publication of CN101984490B publication Critical patent/CN101984490B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a word-for-word synchronous lyric file generating method, including: an audio file is loaded, and audio data and time information are extracted; a time shaft is generated; an audio waveform diagram is generated and displayed; lyric is acquired, and initial position of each character in the lyric corresponding to the time shaft is determined and displayed; a lyric dragging handle for each character is generated; the audio file is played, and playing progress is prompted on the audio waveform diagram; a request of user for adjusting the position of the lyric dragging handle is received, and the position of the lyric dragging handle is adjusted; time information of each character in the lyric is determined; and the determined time information of each character in the lyric is stored, and a word-for-word synchronous lyric file is generated. The invention also discloses a word-for-word synchronous lyric file generating system. By adopting the method or system of the invention, reference for making word-for-word synchronous lyric file can be provided in audition and vision, and accuracy and making speed of word-for-word synchronous lyric file can be improved.

Description

A kind of generation method and system of word for word synchronized lyrics file
Technical field
The present invention relates to copy editor's technical field, particularly relate to a kind of generation method and system of word for word synchronized lyrics file.
Background technology
As everyone knows, most of song all has the lyrics.The voice playing instrument can be by loading the lyrics file of specific format, in played songs, the lyrics of song also is shown to the user.In the practical application, some users not only wish to see the lyrics in played songs, also wish to obtain the synchronous dynamic prompting of these lyrics.For this reason, synchronized lyrics file line by line occurred, this lyrics file is the temporal information of the unit record lyrics with the sentence, and the music instrument just can show the lyrics with sentence by sentence form in displaying audio file synchronously like this.But when the user was higher to the requirement of lyrics promptings, when recreation such as for example playing Karaoka, the lyrics that show with form sentence by sentence obviously can not satisfy user's requirement.
Therefore, occurred word for word synchronized lyrics file in the prior art, by loading this word for word synchronized lyrics file, the voice playing instrument just can be in played songs, and the lyrics are shown synchronously with word for word form.Obviously, the quality of lyrics file has determined the effect of synchronous lyrics.In the prior art, word for word the generation method of synchronized lyrics file mainly is:
At first try to achieve averaging time divided by the number of words of these lyrics with the duration of every lyrics, again with the duration of this averaging time as each word in these lyrics, zero-time in conjunction with each word calculates the concluding time, generates word for word synchronized lyrics file thus.Obviously, the file of the word for word synchronized lyrics file that this method generates, the temporal information of each word is very inaccurate.
Summary of the invention
The generation method and system that the purpose of this invention is to provide a kind of word for word synchronized lyrics file, can be from the sense of hearing and visually provide make synchronized lyrics file word for word with reference to foundation, improve the word for word accuracy of synchronized lyrics file.
For achieving the above object, the invention provides following scheme:
A kind of generation method of word for word synchronized lyrics file comprises:
Load audio file, extract the voice data and the temporal information of described audio file;
According to the temporal information of audio file, the rise time axle;
According to the voice data of audio file, corresponding described time shaft generates the audio volume control figure of described audio file and shows;
Obtain the lyrics of described audio file, determine that each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics;
For generating the lyrics, each literal in the lyrics drags handle;
Displaying audio file, and on audio volume control figure, point out playing progress rate;
Receive the user the described lyrics are dragged the request that the handle position is adjusted, the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
Preserve the temporal information of each word in the lyrics after determining, generate word for word synchronized lyrics file.
Preferably, the described lyrics that obtain described audio file comprise:
Obtain the lyrics of user's input.
Preferably, the described lyrics that obtain described audio file comprise:
Load the lyrics in the existing lyrics file.
Preferably, also comprise:
Parse every lyrics time information corresponding in the described lyrics file of loading;
Each literal comprises corresponding to the initial position and the demonstration of described time shaft in described definite lyrics: determine that according to every lyrics time information corresponding in the described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics.
Preferably, also comprise:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, to the banner word in every lyrics with finish word, distinguish demonstration, with the described lyrics in other literal distinguish mutually.
Preferably, also comprise:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration, with the described lyrics in the lyrics of other literal drag handle and distinguish mutually.
Preferably, describedly drag handle and comprise for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to the zero-time of this literal for each literal drag handle;
Preferably, describedly drag handle and also comprise for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to concluding time of this literal for each literal drag handle.
Preferably, the described playing progress rate of pointing out on audio volume control figure comprises:
On audio volume control figure, adopt the playing progress rate pointer to point out, and/or, adopt the part of broadcast corresponding on the different colour code audio volume control figure and do not play part.
A kind of generation system of word for word synchronized lyrics file comprises:
The audio file extraction unit is used to load audio file, extracts the voice data and the temporal information of described audio file;
The time shaft generation unit is used for the temporal information according to audio file, the rise time axle;
Audio volume control figure generation unit is used for the voice data according to audio file, and corresponding described time shaft generates the audio volume control figure of described audio file and shows;
Lyrics acquiring unit is used to obtain the lyrics of described audio file, determines that each literal in the lyrics is corresponding to the initial position of described time shaft and show;
The lyrics drag the handle generation unit, are used to each literal generation lyrics in the lyrics to drag handle;
Playing control unit is used for displaying audio file, and points out playing progress rate on audio volume control figure;
Lyrics adjustment unit is used to receive the user the described lyrics is dragged the request that the handle position is adjusted, and the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
The lyrics file generation unit is used for preserving the temporal information of each word of the lyrics after determining, generates word for word synchronized lyrics file.
Preferably, described lyrics acquiring unit comprises:
Lyric characters obtains subelement, is used to obtain the lyrics of user's input.
Preferably, described lyrics acquiring unit comprises:
Lyrics file loads subelement, is used for loading the lyrics of existing lyrics file.
Preferably, described lyrics acquiring unit also comprises:
Lyrics file is resolved subelement, is used for parsing every lyrics time corresponding of described lyrics file of loading;
The lyrics generate subelement, are used for determining that according to every lyrics time corresponding of described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics.
Preferably, described lyrics acquiring unit also comprises:
Head and the tail block molecular cell is used for according to the banner word of every the lyrics of every lyrics time corresponding identification that parse and finishes word, to the banner word in every lyrics with finish word, distinguishes demonstration, with the described lyrics in other literal distinguish mutually.
Preferably, the described lyrics drag the handle generation unit and also comprise:
The head and the tail word lyrics drag handle and distinguish subelement, be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse, the lyrics to banner word in every lyrics and end word drag handle, distinguish demonstration, with the described lyrics in the lyrics of other literal drag handle and distinguish mutually.
Preferably, the described lyrics drag the handle generation unit and comprise:
Zero-time drags handle and generates subelement, is used to each literal generation to drag handle corresponding to the lyrics of the zero-time of this literal;
Preferably, the described lyrics drag the handle generation unit and also comprise:
Concluding time drags handle and generates subelement, is used to each literal generation to drag handle corresponding to the lyrics of the concluding time of this literal.
Preferably, described playing control unit comprises:
Progress pointer prompting subelement and/or oscillogram color tips subelement;
Described progress pointer prompting subelement is used for adopting the playing progress rate of playing progress rate pointer prompt tone frequency file on audio volume control figure;
Described oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the different colour code audio volume control figure and does not play part.
According to specific embodiment provided by the invention, the invention discloses following technique effect: by extracting the voice data of audio file, generate audio volume control figure, show each literal in the lyrics corresponding to the time shaft of audio file and audio volume control figure, and, each literal drags handle for generating lyrics, for the user simultaneously from the sense of hearing and visually provide make synchronized lyrics file word for word with reference to foundation, improved the word for word accuracy of synchronized lyrics file.
Description of drawings
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, to do to introduce simply to the accompanying drawing of required use among the embodiment below, apparently, accompanying drawing in describing below only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the method for making first pass figure of the described word for word synchronized lyrics of embodiment of the invention file;
Fig. 2 is for adopting a kind of software interface synoptic diagram of the described method of the embodiment of the invention;
Fig. 3 is method for making second process flow diagram of the described word for word synchronized lyrics of embodiment of the invention file;
Fig. 4 is the manufacturing system structural drawing of the described word for word synchronized lyrics of embodiment of the invention file;
Fig. 5 is the described lyrics acquiring unit of an embodiment of the invention structural drawing.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that is obtained under the creative work prerequisite.
Referring to Fig. 1, be the method for making process flow diagram of the described word for word synchronized lyrics of embodiment of the invention file.As shown in Figure 1, the method comprising the steps of:
S101: load audio file, extract the voice data and the temporal information of described audio file;
S102: according to the temporal information of audio file, the rise time axle;
S103: according to the voice data of audio file, corresponding described time shaft generates the audio volume control figure of described audio file and shows;
S104: obtain the lyrics of described audio file, determine that each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics;
S105: drag handle for each literal in the lyrics generates the lyrics;
S106: displaying audio file, and on audio volume control figure, point out playing progress rate;
S107: receive the user the described lyrics are dragged the request that the handle position is adjusted, the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
S108: preserve the temporal information of each word in the lyrics after determining, generate word for word synchronized lyrics file.
Among the step S101, the audio file of loading can be various forms, for example MP3, WMA, APE or the like.After the loading, adopt corresponding demoder to extract the voice data and the temporal information of described audio file.Wherein temporal information specifically can be the time span of audio file.
Among the step S102, the time shaft of generation can show on user interface.
The audio volume control figure that generates among the step S103, the temporal information of each literal provides visual reference frame in the lyrics for the user adjusts.
Concrete, with reference to Fig. 2, for adopting a kind of software interface synoptic diagram of the described method of the embodiment of the invention.Usually, in an audio file, the singer sings the sound big (because like this could give prominence to voice, make hearer hear the clearly lyrics) of the sound of the lyrics than accompaniment.Therefore, the amplitude of singing the audio volume control of lyrics part can be bigger than the amplitude of accompaniment audio volume control partly, forms a vibration more significantly.That is to say that each word corresponding audio waveform roughly can form a kind of like this vibration: this vibration and accompaniment oscillating phase ratio partly, amplitude is bigger, relatively significantly; And each vibration all is to be starting point with less amplitude, begins then to increase gradually, reduces gradually after reaching peak value again, finishes with less amplitude; Wherein, each vibration initial and finish the initial and end of just corresponding the lyric characters of singing.Therefore, utilize this characteristics, just can determine the initial and concluding time of the corresponding lyric characters of singing according to the initial and end position of each vibration on the audio volume control figure.
In addition, need to prove that there is corresponding relation in the audio volume control figure that generates among the step S103 with the time shaft of audio file.Any place's audio volume control all has corresponding with it time zone on the audio volume control figure on time shaft.
Among the step S104, obtaining the lyrics of described audio file, can be to obtain the manually lyrics of input of user, also can load the lyrics in the existing lyrics file.
When obtaining the user manually during the lyrics of input: can an initial value be set, for example 1 second to the duration of each word of input.Can be with first word acquiescence corresponding between 0 second to 1 second of audio file, second word is corresponding between 1 second to 2 seconds of audio file, and the rest may be inferred.When the input lyrics number of words more for a long time, can adjust the duration of each word, the principle of adjustment is: the summation of the duration of all literal is no more than the time span of this audio file in the lyrics.
When loading the lyrics in the existing lyrics file: existing lyrics file can be the file that common suffix is called the .1rc type.Usually this existing lyrics file is sentence by sentence synchronous, also, has included the temporal information of the lyrics in this lyrics file, and only this temporal information is only at every lyrics.
In order to make full use of the temporal information in the existing file of synchronized lyrics sentence by sentence, the described method of the embodiment of the invention also comprises: parse every lyrics time corresponding in the described lyrics file of loading; Determine that according to every lyrics time corresponding in the described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics.
Wherein, every lyrics time corresponding comprises the zero-time and the concluding time of these lyrics in the described lyrics file that parses.Determine that according to every lyrics time corresponding in the described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics, specifically can be: deduct the duration that described zero-time obtains these lyrics with the described concluding time, divided by the literal number of these lyrics, obtain the average duration of each word; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal.The viewing area of each literal wherein is corresponding to the expectation duration scope (being the expectation zero-time of this literal on the time shaft and the scope between the expected concluding time) of this literal on the time shaft.
After loading the existing file of synchronized lyrics sentence by sentence, can also and finish word according to the banner word in every the lyrics of every lyrics time corresponding identification that parse, to the banner word in every lyrics with finish word, distinguish demonstration, with the described lyrics in other literal distinguish mutually.Concrete, the font size of banner word in every lyrics and end word can be adjusted, use than the big font of other literal in these lyrics, show banner word and finish word; Also can and finish word with the banner word in every lyrics, use with these lyrics in the different color of other literal, show.
Among the step S105,, each literal in the lyrics drags handle for generating the lyrics.Each literal can correspond respectively to two described lyrics and drag handle, to determine the zero-time and the concluding time of each word respectively.Concrete, the lyrics drag the position of handle and there is corresponding relation in the time point on the time shaft.As shown in Figure 2, it is corresponding with the zero-time of this word that the lyrics on each literal left side drag handle, and it is corresponding with the concluding time of this word that the lyrics on the right drag handle.Can regulate the zero-time or the concluding time of corresponding word by adjusting position that the lyrics drag handle.
Among the step S105, also can be only drag handle corresponding to the lyrics of the zero-time of this literal for each literal generation.In this case, the lyrics between two adjacent words drag handle, except the zero-time of that word of expression back, also represent the concluding time of before word.
The difference of the lyrics file that two kinds of situations generate is, the former is for the demonstration time of each word in the lyrics, represents that this literal should sing in the demonstration time; The latter is for the demonstration time of each word in the lyrics, except representing the singing time of this literal, may represent also that this word is sung to finish, but next word do not begin that section accompaniment time of singing as yet.
The method for making of the described word for word synchronized lyrics of embodiment of the invention file, can also and finish word according to the banner word in every the lyrics of every lyrics time corresponding identification that parse, the lyrics to banner word in every lyrics and end word drag handle, distinguish demonstration, with the described lyrics in the lyrics of other literal drag handle and distinguish mutually.For example: the banner word in every lyrics and the lyrics that finish word are dragged the color of handle, be arranged to drag the different color of handle with the lyrics of other word; Perhaps the lyrics of banner word in every lyrics and end word are dragged the shape of handle, be arranged to drag the different shape of handle with the lyrics of other word.
Among the step S106, the method for prompting playing progress rate on audio volume control figure can be to adopt the playing progress rate pointer to point out on audio volume control figure, also can be to adopt the part of broadcast corresponding on the different colour code audio volume control figure and do not play part.
Wherein, the playing progress rate pointer can be the vertical line (can certainly be other shapes) on audio volume control figure.This playing progress rate pointer position on audio volume control figure indicates where this audio file has played to.The user can also drag this playing progress rate pointer, to adjust the playing progress rate of audio file.Adopt the part of broadcast corresponding on the different colour code audio volume control figure and do not play part, for instance: can be designated green with having play part on the audio volume control figure, not play part and be designated redness.
In addition, adopt playing progress rate pointer and the method that adopts these two kinds prompting playing progress rates on audio volume control figure of different colour codes, can use separately, also can use simultaneously.
As from the foregoing, the preferred embodiment of the method for making of word for word synchronized lyrics file of the present invention as shown in Figure 3, comprises step:
S201: load audio file, extract the voice data and the temporal information of described audio file;
S202: according to the temporal information of audio file, the rise time axle;
S203: according to the voice data of audio file, corresponding described time shaft generates the audio volume control figure of described audio file and shows;
S204: load the lyrics in the existing lyrics file;
S205: parse every lyrics time corresponding in the described lyrics file of loading, determine that according to every lyrics time corresponding in the described lyrics file each literal in the lyrics is corresponding to the initial position of described time shaft and show;
S206:,, distinguish demonstration to banner word in every lyrics and end word according to banner word and the end word in every the lyrics of every lyrics time corresponding identification that parse;
S207: drag handle for each literal in the lyrics generates the lyrics;
S208: banner word and end word according in every the lyrics of every lyrics time corresponding identification that parse, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration;
S209: displaying audio file, and on audio volume control figure, adopt playing progress rate pointer prompting playing progress rate, adopt the part of broadcast corresponding on the different colour code audio volume control figure and do not play part;
S210: receive the user the described lyrics are dragged the request that the handle position is adjusted, the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
S211: preserve the temporal information of each word in the lyrics after determining, generate word for word synchronized lyrics file.
Corresponding with the method for making of the described word for word synchronized lyrics of embodiment of the invention file, the embodiment of the invention also discloses a kind of manufacturing system of word for word synchronized lyrics file.
Referring to Fig. 4, be the manufacturing system structural drawing of the described word for word synchronized lyrics of embodiment of the invention file.This system comprises:
Audio file extraction unit 401 is used to load audio file, extracts the voice data and the temporal information of described audio file;
Time shaft generation unit 402 is used for the temporal information according to audio file, the rise time axle;
Audio volume control figure generation unit 403 is used for the voice data according to audio file, and corresponding described time shaft generates the audio volume control figure of described audio file and shows;
Lyrics acquiring unit 404 is used to obtain the lyrics of described audio file, determines that each literal in the lyrics is corresponding to the initial position of described time shaft and show;
The lyrics drag handle generation unit 405, are used to each literal generation lyrics in the lyrics to drag handle;
Playing control unit 406 is used for displaying audio file, and points out playing progress rate on audio volume control figure;
Lyrics adjustment unit 407 is used to receive the user the described lyrics is dragged the request that the handle position is adjusted, and the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
Lyrics file generation unit 408 is used for preserving the temporal information of each word of the lyrics after determining, generates word for word synchronized lyrics file.
Wherein, the audio volume control figure of audio volume control figure generation unit 403 generations has following characteristics:
Usually, in an audio file, the singer sings the sound big (because like this could give prominence to voice, make hearer hear the clearly lyrics) of the sound of the lyrics than accompaniment.Therefore, the amplitude of singing the audio volume control of lyrics part can be bigger than the amplitude of accompaniment audio volume control partly, forms a vibration more significantly.That is to say that each word corresponding audio waveform roughly can form a kind of like this vibration: this vibration and accompaniment oscillating phase ratio partly, amplitude is bigger, relatively significantly; And each vibration all is to be starting point with less amplitude, begins then to increase gradually, reduces gradually after reaching peak value again, finishes with less amplitude; Wherein, each vibration initial and finish the initial and end of just corresponding the lyric characters of singing.Therefore, utilize this characteristics, just can determine the initial and concluding time of the corresponding lyric characters of singing according to the initial and end position of each vibration on the audio volume control figure.
As shown in Figure 5, lyrics acquiring unit 404 can comprise that lyric characters obtains subelement 4041, is used to obtain the lyrics of user's input; Can comprise that also lyrics file loads subelement 4042, is used for loading the lyrics of existing lyrics file.Existing lyrics file can be the lyrics file of various forms, for example: the file of common suffix .lrc type by name.
When adopting lyric characters to obtain subelement 4041, can an initial value be set, for example 1 second to the duration of each word of input.Can be with first word acquiescence corresponding between 0 second to 1 second of audio file, second word is corresponding between 1 second to 2 seconds of audio file, and the rest may be inferred.When the input lyrics number of words more for a long time, can adjust the duration of each word, the principle of adjustment is: the summation of the duration of all literal is no more than the time span of this audio file in the lyrics.
Lyrics acquiring unit 404 can also comprise:
Lyrics file is resolved subelement 4043, is used for parsing every lyrics time corresponding of described lyrics file of loading;
The lyrics generate subelement 4044, are used for determining that according to every lyrics time corresponding of described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics.
Wherein, lyrics file is resolved in the described lyrics file that subelement 4043 parses zero-time and the concluding time that every lyrics time corresponding comprises these lyrics.The lyrics generate subelement 4044 and determine that according to every lyrics time corresponding in the described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics, specifically can be: deduct the duration that described zero-time obtains these lyrics with the described concluding time, divided by the literal number of these lyrics, obtain the average duration of each word; In conjunction with the zero-time of these lyrics, can calculate the expectation zero-time and the expected concluding time of each literal in these lyrics successively; In lyrics viewing area, show each literal.The viewing area of each literal wherein is corresponding to the expectation duration scope (being the expectation zero-time of this literal on the time shaft and the scope between the expected concluding time) of this literal on the time shaft.
Concrete, for example: lyrics file is resolved subelement 4043 and is parsed that a certain sentence lyrics time corresponding is between 30 seconds to 40 seconds in the lyrics file of loading, and these lyrics have ten words; Then lyrics generation subelement 4044 can show 30 second to 31 second the interval of first word corresponding to time shaft, and with 31 second to 32 second the interval demonstration of second word corresponding to time shaft, the rest may be inferred.
In order to allow the user distinguish the banner word of every lyrics easily and to finish word, the described lyrics acquiring unit 404 of the embodiment of the invention can also comprise:
Head and the tail block molecular cell 4045, be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse, to the banner word in every lyrics with finish word, distinguish demonstration, with the described lyrics in other literal distinguish mutually.
Concrete, the font size of banner word in every lyrics and end word can be adjusted, use than the big font of other literal in these lyrics, show banner word and finish word; Also can and finish word with the banner word in every lyrics, use with these lyrics in the different color of other literal, show.
Similarly, the described lyrics of the embodiment of the invention drag handle generation unit 405 and can also comprise:
The head and the tail word lyrics drag handle and distinguish subelement, be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse, the lyrics to banner word in every lyrics and end word drag handle, distinguish demonstration, with the described lyrics in the lyrics of other literal drag handle and distinguish mutually.
For example: the banner word in every lyrics and the lyrics that finish word are dragged the color of handle, be arranged to drag the different color of handle with the lyrics of other word; Perhaps the lyrics of banner word in every lyrics and end word are dragged the shape of handle, be arranged to drag the different shape of handle with the lyrics of other word.
The described lyrics of the embodiment of the invention drag handle generation unit 405, can comprise:
Zero-time drags handle and generates subelement, is used to each literal generation to drag handle corresponding to the lyrics of the zero-time of this literal;
The lyrics drag handle generation unit 405, can also comprise:
Concluding time drags handle and generates subelement, is used to each literal generation to drag handle corresponding to the lyrics of the concluding time of this literal.
Concrete, include only zero-time and drag handle when generating subelement when the lyrics drag handle generation unit 405, the described system of the embodiment of the invention can all generate lyrics for each literal in the lyrics and drag handle, and these lyrics drag handle is represented this word corresponding to the position on the time shaft zero-time.The lyrics between two adjacent lyric characters drag the zero-time of handle except that word of expression back, also represent the concluding time of before word.
When dragging handle generation unit 405, the lyrics comprise that simultaneously zero-time drags handle and generates subelement and concluding time and drag handle when generating subelement, the described system of the embodiment of the invention can all generate two lyrics for each literal in the lyrics and drag handle, corresponds respectively to the zero-time and the concluding time of this literal.In this case, the zero-time of each literal and concluding time all can be adjusted separately, can the temporal information of adjacent literal not impacted.
The described playing control unit 406 of the embodiment of the invention can comprise:
Progress pointer prompting subelement, and/or, oscillogram color tips subelement;
Described progress pointer prompting subelement is used for adopting the playing progress rate of playing progress rate pointer prompt tone frequency file on audio volume control figure;
Described oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the different colour code audio volume control figure and does not play part.
Wherein, the playing progress rate pointer of progress pointer prompting subelement can be the vertical line (can certainly be other shapes) on audio volume control figure.This playing progress rate pointer position on audio volume control figure indicates where this audio file has played to.The user can also drag this playing progress rate pointer, to adjust the playing progress rate of audio file.Oscillogram color tips subelement adopts the part of broadcast corresponding on the different colour code audio volume control figure and does not play part, for instance: can be designated green with having play part on the audio volume control figure, not play part and be designated redness.
More than to a kind of video checking method provided by the present invention and system, be described in detail.Used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, part in specific embodiments and applications all can change.In sum, this description should not be construed as limitation of the present invention.

Claims (18)

1. the generation method of synchronized lyrics file word for word is characterized in that, comprising:
Load audio file, extract the voice data and the temporal information of described audio file;
According to the temporal information of audio file, the rise time axle;
According to the voice data of audio file, corresponding described time shaft generates the audio volume control figure of described audio file and shows;
Obtain the lyrics of described audio file, determine that each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics;
For generating the lyrics, each literal in the lyrics drags handle;
Displaying audio file, and on audio volume control figure, point out playing progress rate;
Receive the user the described lyrics are dragged the request that the handle position is adjusted, the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
Preserve the temporal information of each word in the lyrics after determining, generate word for word synchronized lyrics file.
2. method according to claim 1 is characterized in that, the described lyrics that obtain described audio file comprise:
Obtain the lyrics of user's input.
3. method according to claim 1 is characterized in that, the described lyrics that obtain described audio file comprise:
Load the lyrics in the existing lyrics file.
4. method according to claim 3 is characterized in that, also comprises:
Parse every lyrics time information corresponding in the described lyrics file of loading;
Each literal comprises corresponding to the initial position and the demonstration of described time shaft in described definite lyrics: determine that according to every lyrics time information corresponding in the described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics.
5. method according to claim 4 is characterized in that, also comprises:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, to the banner word in every lyrics with finish word, distinguish demonstration, with the described lyrics in other literal distinguish mutually.
6. method according to claim 4 is characterized in that, also comprises:
According to the banner word in every the lyrics of every lyrics time corresponding identification that parse and finish word, the banner word in every lyrics and the lyrics that finish word are dragged handle, distinguish demonstration, with the described lyrics in the lyrics of other literal drag handle and distinguish mutually.
7. method according to claim 1 is characterized in that, describedly drags handle and comprises for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to the zero-time of this literal for each literal drag handle;
8. method according to claim 7 is characterized in that, describedly drags handle and also comprises for each literal in the lyrics generates lyrics:
The lyrics that generate corresponding to concluding time of this literal for each literal drag handle.
9. method according to claim 1 is characterized in that, the described playing progress rate of pointing out on audio volume control figure comprises:
On audio volume control figure, adopt the playing progress rate pointer to point out, and/or, adopt the part of broadcast corresponding on the different colour code audio volume control figure and do not play part.
10. the generation system of synchronized lyrics file word for word is characterized in that, comprising:
The audio file extraction unit is used to load audio file, extracts the voice data and the temporal information of described audio file;
The time shaft generation unit is used for the temporal information according to audio file, the rise time axle;
Audio volume control figure generation unit is used for the voice data according to audio file, and corresponding described time shaft generates the audio volume control figure of described audio file and shows;
Lyrics acquiring unit is used to obtain the lyrics of described audio file, determines that each literal in the lyrics is corresponding to the initial position of described time shaft and show;
The lyrics drag the handle generation unit, are used to each literal generation lyrics in the lyrics to drag handle;
Playing control unit is used for displaying audio file, and points out playing progress rate on audio volume control figure;
Lyrics adjustment unit is used to receive the user the described lyrics is dragged the request that the handle position is adjusted, and the position that the described lyrics is dragged handle is adjusted; Drag the relative position of each point on handle and the described time shaft according to the adjusted described lyrics, determine the temporal information of each word in the lyrics;
The lyrics file generation unit is used for preserving the temporal information of each word of the lyrics after determining, generates word for word synchronized lyrics file.
11. system according to claim 10 is characterized in that, described lyrics acquiring unit comprises:
Lyric characters obtains subelement, is used to obtain the lyrics of user's input.
12. system according to claim 10 is characterized in that, described lyrics acquiring unit comprises:
Lyrics file loads subelement, is used for loading the lyrics of existing lyrics file.
13. system according to claim 12 is characterized in that, described lyrics acquiring unit also comprises:
Lyrics file is resolved subelement, is used for parsing every lyrics time corresponding of described lyrics file of loading;
The lyrics generate subelement, are used for determining that according to every lyrics time corresponding of described lyrics file each literal is corresponding to the initial position and the demonstration of described time shaft in the lyrics.
14. system according to claim 13 is characterized in that, described lyrics acquiring unit also comprises:
Head and the tail block molecular cell is used for according to the banner word of every the lyrics of every lyrics time corresponding identification that parse and finishes word, to the banner word in every lyrics with finish word, distinguishes demonstration, with the described lyrics in other literal distinguish mutually.
15. system according to claim 13 is characterized in that, the described lyrics drag the handle generation unit and also comprise:
The head and the tail word lyrics drag handle and distinguish subelement, be used for banner word and end word according to every the lyrics of every lyrics time corresponding identification that parse, the lyrics to banner word in every lyrics and end word drag handle, distinguish demonstration, with the described lyrics in the lyrics of other literal drag handle and distinguish mutually.
16. system according to claim 10 is characterized in that, the described lyrics drag the handle generation unit and comprise:
Zero-time drags handle and generates subelement, is used to each literal generation to drag handle corresponding to the lyrics of the zero-time of this literal;
17. system according to claim 16 is characterized in that, the described lyrics drag the handle generation unit and also comprise:
Concluding time drags handle and generates subelement, is used to each literal generation to drag handle corresponding to the lyrics of the concluding time of this literal.
18. system according to claim 10 is characterized in that, described playing control unit comprises:
Progress pointer prompting subelement and/or oscillogram color tips subelement;
Described progress pointer prompting subelement is used for adopting the playing progress rate of playing progress rate pointer prompt tone frequency file on audio volume control figure;
Described oscillogram color tips subelement is used to adopt the part of broadcast corresponding on the different colour code audio volume control figure and does not play part.
CN2010105572583A 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof Active CN101984490B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010105572583A CN101984490B (en) 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010105572583A CN101984490B (en) 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof

Publications (2)

Publication Number Publication Date
CN101984490A true CN101984490A (en) 2011-03-09
CN101984490B CN101984490B (en) 2012-06-27

Family

ID=43641660

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010105572583A Active CN101984490B (en) 2010-11-23 2010-11-23 Word-for-word synchronous lyric file generating method and system thereof

Country Status (1)

Country Link
CN (1) CN101984490B (en)

Cited By (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102290081A (en) * 2011-06-27 2011-12-21 深圳市基思瑞科技有限公司 Language study play control method
CN102324191A (en) * 2011-09-28 2012-01-18 Tcl集团股份有限公司 Method and system for synchronously displaying audio book word by word
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN102820027A (en) * 2012-06-21 2012-12-12 福建星网视易信息系统有限公司 Accompaniment subtitle display system and method
CN102881309A (en) * 2012-09-24 2013-01-16 广东欧珀移动通信有限公司 Lyric file generating and correcting method and device
CN104575542A (en) * 2014-12-15 2015-04-29 天脉聚源(北京)科技有限公司 Method and device for realizing audio regional play
CN104751870A (en) * 2015-03-24 2015-07-01 联想(北京)有限公司 Information processing method and electronic equipment
CN105609121A (en) * 2014-11-20 2016-05-25 深圳市腾讯计算机系统有限公司 Method and device for controlling multimedia playing progress
CN105868307A (en) * 2016-03-26 2016-08-17 深圳市金立通信设备有限公司 An audio frequency information display method and a terminal
CN105893430A (en) * 2015-12-08 2016-08-24 乐视移动智能信息技术(北京)有限公司 Lyrics matching method and device
CN106297847A (en) * 2016-08-12 2017-01-04 青岛海信移动通信技术股份有限公司 The reminding method of a kind of lyrics playing duration and equipment
CN106652983A (en) * 2016-09-18 2017-05-10 福建网龙计算机网络信息技术有限公司 Subtitling method and subtitling system
WO2017186015A1 (en) * 2016-04-26 2017-11-02 厦门幻世网络科技有限公司 Method and device for dubbing audio-visual digital media
CN108206029A (en) * 2016-12-16 2018-06-26 北京酷我科技有限公司 A kind of method and system for realizing the word for word lyrics
CN108563650A (en) * 2017-12-15 2018-09-21 维沃移动通信有限公司 A kind of adjusting method and mobile terminal of audio file playing progress rate
CN109543064A (en) * 2018-11-30 2019-03-29 北京微播视界科技有限公司 Lyrics display processing method, device, electronic equipment and computer storage medium
CN109558510A (en) * 2018-12-06 2019-04-02 北京微播视界科技有限公司 Lyrics analytic method, device, electronic equipment and computer storage medium
CN109672832A (en) * 2018-12-20 2019-04-23 四川湖山电器股份有限公司 The processing method of digital movie interlude lyrics subtitle realization dynamic Special display effect
CN110087122A (en) * 2019-05-06 2019-08-02 北京字节跳动网络技术有限公司 For handling system, the method and apparatus of information
CN110867180A (en) * 2019-10-15 2020-03-06 北京雷石天地电子技术有限公司 System and method for generating word-by-word lyric file based on K-means clustering algorithm
CN112115283A (en) * 2020-08-25 2020-12-22 天津洪恩完美未来教育科技有限公司 Method, device and equipment for processing picture book data
CN112347298A (en) * 2020-11-13 2021-02-09 广州酷狗计算机科技有限公司 Character information display method, device, terminal and storage medium
CN114064963A (en) * 2020-08-03 2022-02-18 北京字跳网络技术有限公司 Information display method and device

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294547A (en) * 2016-07-25 2017-01-04 惠州Tcl移动通信有限公司 A kind of method and system intercepting audio file

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5880388A (en) * 1995-03-06 1999-03-09 Fujitsu Limited Karaoke system for synchronizing and reproducing a performance data, and karaoke system configuration method
CN101322179A (en) * 2005-12-09 2008-12-10 索尼株式会社 Music edit device, music edit information creating method, and recording medium where music edit information is recorded
US7681115B2 (en) * 2005-08-31 2010-03-16 Fujitsu Limited Text editing and reproduction apparatus, content editing and reproduction apparatus, and text editing and reproduction method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5880388A (en) * 1995-03-06 1999-03-09 Fujitsu Limited Karaoke system for synchronizing and reproducing a performance data, and karaoke system configuration method
US7681115B2 (en) * 2005-08-31 2010-03-16 Fujitsu Limited Text editing and reproduction apparatus, content editing and reproduction apparatus, and text editing and reproduction method
CN101322179A (en) * 2005-12-09 2008-12-10 索尼株式会社 Music edit device, music edit information creating method, and recording medium where music edit information is recorded

Cited By (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102290081A (en) * 2011-06-27 2011-12-21 深圳市基思瑞科技有限公司 Language study play control method
CN102324191B (en) * 2011-09-28 2015-01-07 Tcl集团股份有限公司 Method and system for synchronously displaying audio book word by word
CN102324191A (en) * 2011-09-28 2012-01-18 Tcl集团股份有限公司 Method and system for synchronously displaying audio book word by word
CN102568527A (en) * 2011-12-20 2012-07-11 广东步步高电子工业有限公司 Method and system for easily cutting audio files and applied mobile handheld device
CN102820027A (en) * 2012-06-21 2012-12-12 福建星网视易信息系统有限公司 Accompaniment subtitle display system and method
CN102820027B (en) * 2012-06-21 2014-04-16 福建星网视易信息系统有限公司 Accompaniment subtitle display system and method
CN102881309A (en) * 2012-09-24 2013-01-16 广东欧珀移动通信有限公司 Lyric file generating and correcting method and device
CN102881309B (en) * 2012-09-24 2016-08-24 广东欧珀移动通信有限公司 Lyrics file generates method and device
CN105609121A (en) * 2014-11-20 2016-05-25 深圳市腾讯计算机系统有限公司 Method and device for controlling multimedia playing progress
CN105609121B (en) * 2014-11-20 2019-03-12 广州酷狗计算机科技有限公司 Multimedia progress monitoring method and device
CN104575542A (en) * 2014-12-15 2015-04-29 天脉聚源(北京)科技有限公司 Method and device for realizing audio regional play
CN104751870A (en) * 2015-03-24 2015-07-01 联想(北京)有限公司 Information processing method and electronic equipment
CN104751870B (en) * 2015-03-24 2018-07-06 联想(北京)有限公司 A kind of information processing method and electronic equipment
CN105893430A (en) * 2015-12-08 2016-08-24 乐视移动智能信息技术(北京)有限公司 Lyrics matching method and device
CN105868307A (en) * 2016-03-26 2016-08-17 深圳市金立通信设备有限公司 An audio frequency information display method and a terminal
WO2017186015A1 (en) * 2016-04-26 2017-11-02 厦门幻世网络科技有限公司 Method and device for dubbing audio-visual digital media
CN106297847A (en) * 2016-08-12 2017-01-04 青岛海信移动通信技术股份有限公司 The reminding method of a kind of lyrics playing duration and equipment
CN106297847B (en) * 2016-08-12 2019-06-04 青岛海信移动通信技术股份有限公司 A kind of reminding method and equipment of lyrics playing duration
CN106652983A (en) * 2016-09-18 2017-05-10 福建网龙计算机网络信息技术有限公司 Subtitling method and subtitling system
CN106652983B (en) * 2016-09-18 2021-04-02 福建网龙计算机网络信息技术有限公司 Subtitle making method and system
CN108206029A (en) * 2016-12-16 2018-06-26 北京酷我科技有限公司 A kind of method and system for realizing the word for word lyrics
CN108563650B (en) * 2017-12-15 2019-10-15 维沃移动通信有限公司 A kind of adjusting method and mobile terminal of audio file playback progress
CN108563650A (en) * 2017-12-15 2018-09-21 维沃移动通信有限公司 A kind of adjusting method and mobile terminal of audio file playing progress rate
CN109543064A (en) * 2018-11-30 2019-03-29 北京微播视界科技有限公司 Lyrics display processing method, device, electronic equipment and computer storage medium
CN109543064B (en) * 2018-11-30 2020-12-18 北京微播视界科技有限公司 Lyric display processing method and device, electronic equipment and computer storage medium
CN109558510A (en) * 2018-12-06 2019-04-02 北京微播视界科技有限公司 Lyrics analytic method, device, electronic equipment and computer storage medium
CN109672832A (en) * 2018-12-20 2019-04-23 四川湖山电器股份有限公司 The processing method of digital movie interlude lyrics subtitle realization dynamic Special display effect
CN110087122A (en) * 2019-05-06 2019-08-02 北京字节跳动网络技术有限公司 For handling system, the method and apparatus of information
CN110867180A (en) * 2019-10-15 2020-03-06 北京雷石天地电子技术有限公司 System and method for generating word-by-word lyric file based on K-means clustering algorithm
CN110867180B (en) * 2019-10-15 2022-03-29 北京雷石天地电子技术有限公司 System and method for generating word-by-word lyric file based on K-means clustering algorithm
CN114064963A (en) * 2020-08-03 2022-02-18 北京字跳网络技术有限公司 Information display method and device
CN112115283A (en) * 2020-08-25 2020-12-22 天津洪恩完美未来教育科技有限公司 Method, device and equipment for processing picture book data
CN112347298A (en) * 2020-11-13 2021-02-09 广州酷狗计算机科技有限公司 Character information display method, device, terminal and storage medium

Also Published As

Publication number Publication date
CN101984490B (en) 2012-06-27

Similar Documents

Publication Publication Date Title
CN101984490B (en) Word-for-word synchronous lyric file generating method and system thereof
TWI576822B (en) Processing method of making song request and system thereof
CN106652997B (en) Audio synthesis method and terminal
CN104978973B (en) A kind of audio-frequency processing method and device
CN105006234A (en) Karaoke processing method and apparatus
CN101694772A (en) Method for converting text into rap music and device thereof
JP2017513049A (en) How to provide users with feedback on the performance of karaoke songs
WO2016188211A1 (en) Audio processing method, apparatus and system
CN101615417B (en) Synchronous Chinese lyrics display method which is accurate to words
CN108109652A (en) A kind of method of K songs chorus recording
JP6452229B2 (en) Karaoke sound effect setting system
CN106611603A (en) Audio processing method and audio processing device
WO2023207472A1 (en) Audio synthesis method, electronic device and readable storage medium
JP5598516B2 (en) Voice synthesis system for karaoke and parameter extraction device
CN101930732B (en) Music producing method and device based on user input voice and intelligent terminal
CN108346418A (en) A kind of method, system and terminal that song generates
JP4171680B2 (en) Information setting device, information setting method, and information setting program for music playback device
JP6589521B2 (en) Singing standard data correction device, karaoke system, program
JP7117228B2 (en) karaoke system, karaoke machine
CN113096689A (en) Song singing evaluation method, equipment and medium
JP2014013340A (en) Music composition support device, music composition support method, music composition support program, recording medium storing music composition support program and melody retrieval device
JP6144593B2 (en) Singing scoring system
JP7419768B2 (en) Music generation method and music generation system
KR102077269B1 (en) Method for analyzing song and apparatus using the same
CN112489607A (en) Method and device for recording songs, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant