CN101615417A

CN101615417A - A kind of Chinese synchronously displaying lyrics method that is accurate to word

Info

Publication number: CN101615417A
Application number: CN200910089572A
Authority: CN
Inventors: 史岩
Original assignee: Beijing Haier IC Design Co Ltd
Current assignee: Beijing Haier IC Design Co Ltd
Priority date: 2009-07-24
Filing date: 2009-07-24
Publication date: 2009-12-30
Anticipated expiration: 2029-07-24
Also published as: CN101615417B

Abstract

The present invention relates to field of audio play, relate in particular to a kind of Chinese synchronously displaying lyrics method that is accurate to word.The present invention is divided into several portions by the voice with every lyrics, and the quantity of this several portions equals this lyrics number of words and adds an ending ventilation, and the every part voice that are divided into are mated respectively and then obtain matching attribute α _xAnd then with every kind cut apart the voice that obtain in turn with these lyrics in each word carry out the phoneme coupling, and obtain corresponding matching degree β _xChoose λ * α at last _x+ (1-λ) * β _xValue is maximum as optimal dividing, and wherein λ is weight coefficient and satisfies 0≤λ≤1.The inventive method has solved the problem that synchronously displaying lyrics can not be accurate to word, waits in the equipment that needs synchronously displaying lyrics in Karaoke to have significant application value.

Description

A kind of Chinese synchronously displaying lyrics method that is accurate to word

Technical field

The present invention relates to field of audio play, relate in particular to the method for synchronously displaying lyrics in the audio frequency broadcast system.

Background technology

The lyrics Presentation Function of playout software makes people can see the lyrics of audio file when hearing graceful melody, and present many playout softwares all have the function of synchronously displaying lyrics.Concrete grammar is that the lyrics are stored in the text-only file, and a time tag that presents with [MM:SS] form before beginning, every lyrics is arranged, wherein MM is the time minute value that is played song, SS is the value in second, when the lyrics be played to MM divide SS during second playout software just can show these lyrics, and then the lyrics and the voice that make to show are synchronous.

The above conventional synchronization shows that lyrics method is by sentence writing time, gives each word with every lyrics by reallocation after the equal divisional processing of time then, can only be accurate to and can not be accurate to word so the lyrics show.Yet a lot of application scenarios are arranged at present as Karaoke (TV song accompaniment apparatus) etc., all need a kind ofly correctly to show the playout software of the lyrics, and present synchronously displaying lyrics method precision is very poor, almost can not correctly show the time of each word in the lyrics by word.

Summary of the invention

The invention provides the Chinese synchronously displaying lyrics method that is accurate to word in a kind of audio frequency broadcast system that can overcome the above problems.

In first aspect, the invention provides a kind of Chinese synchronously displaying lyrics method that is accurate to word, this method at first is divided into several portions with the voice of every lyrics, the quantity of this several portions equals this lyrics number of words and adds an ending ventilation, and the every part voice that are divided into are mated respectively and then obtain matching attribute α _xAnd then with every kind cut apart the voice that obtain in turn with these lyrics in each word carry out the phoneme coupling, and obtain corresponding matching degree β _xChoose λ * α at last _x+ (1-λ) * β _xValue is maximum as optimal dividing, and wherein λ is weight coefficient and satisfies 0≤λ≤1.

In one embodiment of the invention, with in the optimal dividing the zero-time of corresponding each part as the zero-time of each word in the lyrics, and this time is kept in the text-only file of store lyrics.

In another embodiment of the present invention, the zero-time of lyrics word in the manual adjustment text-only file so that the demonstration time of this lyrics word can be synchronized with this lyrics word more.

The present invention utilizes precisely original lyrics of sentence, the voice of every lyrics are divided into the section identical with this lyrics syllable, and comprehensively the section of cutting apart coupling obtains optimal dividing with the matching degree that phoneme mates.And then solved the problem that synchronously displaying lyrics can not be accurate to word, wait in the equipment that needs the synchronously displaying lyrics word in Karaoke to have significant application value.

Description of drawings

Below with reference to accompanying drawings specific embodiments of the present invention is described in detail, in the accompanying drawings:

Fig. 1 is the Chinese synchronously displaying lyrics process flow diagram that is accurate to word.

Embodiment

In step 110, the lyrics are divided into some sentences, lyrics of each correspondence.

Preferably, adopt elimination musical sound algorithm to eliminate or to weaken musical sound and outstanding voice in step 111 pair every song, described elimination musical sound algorithm can adopt any one voice enhancement algorithm.

In step 120, according to the hop count of every lyrics of lyrics content statistics, this hop count comprises the ventilation in every when ending song, and promptly this hop count number of words of equaling every lyrics adds an ending ventilation.

In step 130, the voice of every lyrics are divided into the hop count voice that step 120 statistics obtains, and each voice after cutting apart are mated, and then obtain a plurality of matching attributes.

Particularly, according to speech recognition algorithm the voice of described every lyrics are divided into several portions, the concrete quantity of described several portions equals this lyrics hop count that step 120 statistics obtains, and optimum segmentation comprises a complete syllable i.e. a Chinese character or an ending ventilation for each part.

N kind different feasible cutting apart arranged in speech recognition algorithm is cut apart the process of every lyrics voice, and each is cut apart resulting syllable and all has matching attribute α corresponding with it, and then obtains the multiple different matching attribute α of these lyrics voice ₁, α ₂, α ₃..., α _nThis α value is used to estimate the corresponding quality of cutting apart with it, and the big more then explanation of α value is cut apart accurately more.

In step 140, carry out phoneme (being each Chinese character) coupling, obtain different matching degree β.

Particularly, n kind in the step 130 is cut apart every kind cut apart resulting syllable in order with this song in the phoneme of each word mate, the matching degree that obtains is β, obtains matching degree β respectively so the n kind is cut apart ₁, β ₂, β ₃..., β _nDescribed matching degree method can be any one voice match algorithm.

In step 150, with α and β according to certain weight, and by setting threshold and then definite optimal dividing.

Particularly, the minimum threshold of choosing α is α min, and the minimum threshold of β is β min, and sets weight coefficient λ (0≤λ≤1).Choose and make λ * α _x+ (1-λ) * β _xValue is maximum, and satisfies α _x＞α min and β _xThe pairing optimal dividing that is divided into of the x of＞β min.Just do not satisfy α simultaneously if do not exist in these lyrics _x＞α min and β _xThe x of＞β min then directly chooses and makes λ * α _x+ (1-λ) * β _xMaximum x institute correspondence is divided into optimal dividing.

In step 160, determine the zero-time of each word in the lyrics.

Particularly, in the optimal dividing that step 150 is obtained the zero-time of corresponding each part as the zero-time of each word in the lyrics, and this time is remained in the text-only file of these lyrics of storage.

Preferably, in the zero-time of step 161, and then reach the more accurately purpose of synchronously displaying lyrics by some lyrics word (words in the inaccurate lyrics of time) in the text of the described store lyrics of manual adjustment.

Obviously, under the prerequisite that does not depart from true spirit of the present invention and scope, the present invention described here can have many variations.Therefore, the change that all it will be apparent to those skilled in the art that all should be included within the scope that these claims contain.The present invention's scope required for protection is only limited by described claims.

Claims

1. Chinese synchronously displaying lyrics method that is accurate to word comprises:

Step a is divided into several portions with the voice of every lyrics, and the quantity of this several portions equals this lyrics number of words and adds an ending ventilation, and the every part voice that are divided into are mated respectively and then obtain matching attribute α _x

Step b, with described every kind cut apart the voice that obtain in turn with these lyrics in each word carry out the phoneme coupling, and obtain corresponding matching degree β _x

Step c chooses λ * α _x+ (1-λ) * β _xValue is maximum as optimal dividing, and wherein λ is weight coefficient and satisfies 0≤λ≤1.

2. a kind of Chinese synchronously displaying lyrics method that is accurate to word as claimed in claim 1 is characterized in that, comprises before step a:

Steps d is divided into some sentences with the lyrics, lyrics of each correspondence, and the musical sound algorithm is eliminated in every song employing given prominence to voice to subdue musical sound.

3. a kind of Chinese synchronously displaying lyrics method that is accurate to word as claimed in claim 1 is characterized in that, the optimum segmentation among the step a is that each part that is divided into all comprises a complete syllable.

4. a kind of Chinese synchronously displaying lyrics method that is accurate to word as claimed in claim 1 is characterized in that, the minimum threshold of setting α in step c is α min, and the minimum threshold of β is β min, and satisfies α _x＞α min and β _x＞β min.

5. a kind of Chinese synchronously displaying lyrics method that is accurate to word as claimed in claim 1 is characterized in that, comprises after step c:

Step e: with in the described optimal dividing the zero-time of corresponding each part as the zero-time of each word in the lyrics, and this time is kept in the text-only file of the described lyrics of storage.

6. a kind of Chinese synchronously displaying lyrics method that is accurate to word as claimed in claim 5 is characterized in that, comprises after step e:

Step f: the zero-time of lyrics word in the described text-only file of manual adjustment so that the demonstration time of this lyrics word can be synchronized with this lyrics word more.

7. Chinese synchronously displaying lyrics device that is accurate to word comprises:

The voice of every lyrics are divided into several portions, and the every part voice that are divided into are mated and then obtain matching attribute α _xModule, the quantity of described several portions equals this lyrics number of words and adds an ending ventilation;

And with described every kind cut apart the voice that obtain in turn with these lyrics in each word carry out the phoneme coupling, and obtain corresponding matching degree β _xModule;

And with λ * α _x+ (1-λ) * β _xThe maximum module as optimal dividing of value, wherein λ is weight coefficient and satisfies 0≤λ≤1.