CN106294310B

CN106294310B - A kind of Tibetan language tone prediction technique and system

Info

Publication number: CN106294310B
Application number: CN201510325742.6A
Authority: CN
Inventors: 祖漪清; 尹大勇; 高杰; 朱荣华; 王影; 胡国平; 胡郁; 刘庆峰
Original assignee: Xun Feizhi Metamessage Science And Technology Ltd
Current assignee: Xun Feizhi Metamessage Science And Technology Ltd
Priority date: 2015-06-12
Filing date: 2015-06-12
Publication date: 2019-05-03
Anticipated expiration: 2035-06-12
Also published as: CN106294310A

Abstract

The invention discloses a kind of Tibetan language tone prediction technique and systems, comprising: receives Tibetan language text to be processed；Word segmentation processing is carried out to the Tibetan language text to be processed, obtains each word unit；According to context information of institute's predicate unit in the Tibetan language text to be processed, the part of speech of institute's predicate unit is determined；It predicts the rhythm boundary of the Tibetan language text to be processed, and according to the part of speech of rhythm boundary word unit, adjusts the word elementary boundary of rhythm boundary；According to the part of speech of each word unit, tone prediction is carried out to the syllable unit of the Tibetan language text to be processed after adjustment word elementary boundary, obtains the tone information of Tibetan language text to be processed.Using the present invention, it can solve modified tone problem of more tone recognition words in different rhythm boundaries, effectively improve Tibetan voice systematic difference effect.

Description

A kind of Tibetan language tone prediction technique and system

Technical field

The present invention relates to Tibetan language field of information processing, and in particular to a kind of Tibetan language tone prediction technique and system.

Background technique

Speech synthesis is the important component in language information processing, defeated after referring to text through certain conversion The process of voice out, and make the voice of synthesis that there is good naturalness and intelligibility as far as possible, text-processing is speech synthesis system The front end text analyzing of system is handled.Wherein, the uniqueness that Tibetan language tone prediction technique is pronounced due to Tibetan language is Tibetan language text-processing One emphasis of middle research.

Tibetan language includes Lhasa words, health bar words, Anduo County's words etc., wherein based on being talked about with Lhasa.One prominent voice of Lhasa words Feature is exactly tone, and Tibetan language as described below refers mainly to Lhasa words.Tibetan voice synthetic method mainly include making character fonts and Tone prediction etc..Briefly, tone is exactly the height of sound.Due to particularity existing for Tibetan language itself, so that Tibetan language More tone recognition words are different in the tone information of different rhythm boundaries, and the variation of tone can seriously affect semantic understanding. If the tone tune type of Tibetan language cannot be predicted accurately, it will reduce the application effect of Tibetan language making character fonts.Existing Tibetan language sound Adjusting prediction technique is typically all rule-based method, i.e., classifies to the initial consonant of syllable, simple or compound vowel of a Chinese syllable, according to sorted initial consonant With the combined situation of simple or compound vowel of a Chinese syllable, the tone tune type of syllable is obtained by looking into tone tune type table.But existing method becomes tone tune type The analysis of change mechanism is sufficiently complete, does not consider the characteristic of Tibetan language itself, causes existing Tibetan language tone prediction technique quasi- More tone recognition words are really predicted in the tone of different rhythm boundaries, so that the naturalness of Tibetan voice synthesis reduces or even shadow Ring intelligibility.

Summary of the invention

The embodiment of the present invention provides a kind of Tibetan language tone prediction technique and system, solves in Tibetan language more tone recognition words not With the tone different problems of rhythm boundary, so that Tibetan voice synthesis is more natural.

For this purpose, the embodiment of the present invention provides the following technical solutions:

A kind of Tibetan language tone prediction technique, comprising:

Receive Tibetan language text to be processed；

Word segmentation processing is carried out to the Tibetan language text to be processed, obtains each word unit；

According to context information of institute's predicate unit in the Tibetan language text to be processed, institute's predicate unit is determined Part of speech；

It predicts the rhythm boundary of the Tibetan language text to be processed, and according to the part of speech of rhythm boundary word unit, adjusts rhythm Restrain the word elementary boundary of boundary；

According to the part of speech of each word unit, to the syllable unit carry out sound of the Tibetan language text to be processed after adjustment word elementary boundary Prediction is adjusted, the tone information of Tibetan language text to be processed is obtained.

Preferably, the context information according to each word unit in the Tibetan language text to be processed, really The part of speech of each word unit includes: calmly

To this progress of Tibetan language and literature subordinate sentence to be processed；

Predict part of speech of institute's predicate unit in sentence；

Determine the type of institute's predicate unit；

According to the type of institute's predicate unit, the part of speech of institute's predicate unit is adjusted.

Preferably, described to include: to this progress of Tibetan language and literature subordinate sentence to be processed

Predict the level-one part of speech of each word unit, the level-one part of speech include: verb, notional word, pronoun, function word, general affixe, Verb configuration affixe；

If Dan ChuifuThe level-one part of speech of previous word unit is verb or verb configuration affixe, then at Dan Chuifu For sentence boundary；

If Dan ChuifuThe level-one part of speech of previous word unit is not verb or verb configuration affixe, then passes through system The method of meter modeling predicts sentence boundary.

Preferably, the level-one part of speech of each word unit of prediction includes:

Obtain the candidate level-one part of speech of each word unit；

Extract the context-sensitive feature of current word unit；

According to the context-sensitive feature of current word unit, by the method for statistical modeling from the candidate one of current word unit The level-one part of speech of current word unit is determined in grade part of speech.

Preferably, the type of institute's predicate unit includes any of the following or a variety of: more tone recognition words, function word, affixe, Conventional word.

Preferably, the part of speech according to rhythm boundary word unit, the word elementary boundary for adjusting rhythm boundary include:

When the word unit of rhythm boundary is more tone recognition word units, and part of speech is verb or adjective, with syllable More tone recognition word units are split for unit, carry out subsequent tone prediction using syllable unit after fractionation.

A kind of Tibetan language tone forecasting system, comprising:

Receiving module, for receiving Tibetan language text；

Word segmentation module obtains each word unit for carrying out word segmentation processing to the Tibetan language text to be processed；

Part of speech determining module, for being believed according to context environmental of institute's predicate unit in the Tibetan language text to be processed Breath, determines the part of speech of institute's predicate unit；

Word elementary boundary adjusts module, for predicting the rhythm boundary of the Tibetan language text to be processed, and according to rhythm side The part of speech of word unit at boundary adjusts the word elementary boundary of rhythm boundary；

Tone prediction module, for the part of speech according to each word unit, to the Tibetan language and literature to be processed after adjustment word elementary boundary This syllable unit carries out tone prediction, obtains the tone information of Tibetan language text to be processed.

Preferably, the part of speech determining module includes:

Clause unit, for this progress of Tibetan language and literature subordinate sentence to be processed；

Part of speech predicting unit, for predicting part of speech of each word unit in sentence；

Word type determining units, for determining the type of each word unit；

Part of speech adjustment unit adjusts the part of speech of word unit for the type according to word unit.

Preferably, the clause unit includes:

Level-one part of speech predicts subelement, for predicting the level-one part of speech of each word unit, the level-one part of speech include: verb, Notional word, pronoun, function word, general affixe, verb configuration affixe；

First boundary determines subelement, if being used for Dan ChuifuThe level-one part of speech of previous word unit is verb Or verb configuration affixe, then it is sentence boundary at Dan Chuifu；

Second boundary determines subelement, if being used for Dan ChuifuThe level-one part of speech of previous word unit is not Word or verb configuration affixe then predict sentence boundary by the method for statistical modeling.

Preferably, first boundary determines that subelement is specifically used for: the candidate one of each word unit is obtained by tabling look-up Grade part of speech；Extract the context-sensitive feature of current word unit；According to the context-sensitive feature of current word unit, pass through statistics The method of modeling determines the level-one part of speech of current word unit from the candidate level-one part of speech of current word unit.

Preferably, institute's predicate elementary boundary adjustment module is specifically used for: when the word unit of rhythm boundary is more tone moulds Formula word unit, and part of speech be verb or adjective when, more tone recognition word units are split as unit of syllable, utilize sound after fractionation It saves unit and carries out subsequent tone prediction.

A kind of Tibetan language tone prediction technique and system provided in an embodiment of the present invention, by by received Tibetan language and literature to be processed This progress word segmentation processing obtains each word unit, and determines the type of each word unit, then according to the type of word unit to acquisition Rhythm boundary word elementary boundary is adjusted, and predicts the word unit sound of the Tibetan language text to be processed after adjustment word elementary boundary It adjusts.Due to having adjusted the word list of rhythm boundary in Tibetan language text according to word unit part of speech during predicting word unit tone The tone of the word unit of rhythm boundary is predicted on first boundary, and the part of speech of the word unit according to rhythm boundary adjusted, from And it solves more tone recognition words in Tibetan language and improves Tibetan voice synthesis oneself in the tone different problems of different rhythm boundaries So degree, and ensure its intelligibility.

Detailed description of the invention

In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, below will be to institute in embodiment Attached drawing to be used is needed to be briefly described, it should be apparent that, the accompanying drawings in the following description is only one recorded in the present invention A little embodiments are also possible to obtain other drawings based on these drawings for those of ordinary skill in the art.

Fig. 1 is the flow chart of Tibetan language tone prediction technique provided in an embodiment of the present invention；

Fig. 2 is a kind of structural schematic diagram of Tibetan language tone forecasting system provided in an embodiment of the present invention；

Fig. 3 is a kind of structural schematic diagram of Tibetan language text processing system provided in an embodiment of the present invention.

Specific embodiment

The scheme of embodiment in order to enable those skilled in the art to better understand the present invention with reference to the accompanying drawing and is implemented The present invention is described in further detail for mode.Following embodiment is exemplary, and for explaining only the invention, and cannot be solved It is interpreted as limitation of the present invention.

For a better understanding of the present invention, Tibetan language tone prediction technique in the prior art is carried out briefly first below It is bright.Existing Tibetan language tone prediction technique is usually used rule-based method and predicts the tone of text to be processed, Such as: according to the combination of the initial and the final, looks into tone tune type table and obtain the tone tune type of syllable.Single syllable tone is in Tibetan language It is determined by initial consonant and simple or compound vowel of a Chinese syllable, wherein initial consonant is made of pre-script, upper word adding, base word, down word adding, the sound of initial consonant Tune is divided into high and low two class, indicates that the starting point of syllable tone is high and low.In general, base word is turbid situation initial consonant to be high, base word Be clear situation initial consonant be it is low.Pre-script, upper word adding can change base word tone and obtain height.Simple or compound vowel of a Chinese syllable be by vowel character, back word adding, Back word adding is constituted again, simple or compound vowel of a Chinese syllable can be divided into 3 classes according to the ending of a final of simple or compound vowel of a Chinese syllable, i.e., long simple or compound vowel of a Chinese syllable promotees simple or compound vowel of a Chinese syllable, single vowel simple or compound vowel of a Chinese syllable.It is right When syllable carries out tone prediction in current word unit, the type of current syllable sound mother is determined according to the initial and the final type list first Combination, the sound parent type table are generally constructed by domain expert；Then tone tune type table is searched, determines the initial and the final combination Syllable tone tune type contains initial consonant and the various combined tone tune types of simple or compound vowel of a Chinese syllable in the tone tune type table, is generally basede on rule Method building, the rule describes the characteristic of speech sounds of Tibetan language.Such as syllableWherein, base word isIt is preceding to add Word isInitial consonant isBelong to high class；Pre-script isInitial consonant attribute is not changed, vowel sign isAfterwards plus Word isSimple or compound vowel of a Chinese syllable isBelong to rush simple or compound vowel of a Chinese syllable, so that it is determined that sound mother's group of syllable is combined into the simple or compound vowel of a Chinese syllable of high initial consonant and rush, passes through Tone tune type table is looked into, syllable is obtainedTone be falling tone (f).However, the prior art is not associated with existing for Tibetan language itself Characteristic, such as do not consider more tone recognition words different rhythm boundaries tone with conjugations different problems, to make Decline at the application effect of voice system.

Tibetan language tone prediction technique and system provided by the invention, by being obtained to this progress of Tibetan language and literature word segmentation processing to be processed Each word unit is obtained, and predicts the part of speech of each word unit, then obtains the rhythm boundary of Tibetan language text to be processed, and according to rhythm side The part of speech of boundary's word unit adjusts its word boundary, solves more tone recognition words in different rhythm boundaries, tone different problems, Tone prediction then is carried out to the Tibetan language text to be processed after adjustment word elementary boundary, obtains the tone letter of Tibetan language text to be processed Breath.Due to having adjusted the boundary of rhythm boundary word unit, and according to the part of speech of word unit behind adjustment word boundary, its sound is predicted It adjusts, solves the problems, such as modified tone of more tone recognition words in different rhythm boundaries, so that the tone of prediction is more acurrate, effectively mention The application effect of voice system is risen.

Technical solution and technical effect in order to better understand the present invention, below with reference to flow chart and specific implementation Example is described in detail.

As shown in Figure 1, being the flow chart of Tibetan language tone prediction technique provided in an embodiment of the present invention, comprising the following steps:

Step 101, Tibetan language text to be processed is received.

Step 102, to this progress of Tibetan language and literature word segmentation processing to be processed, each word unit is obtained.

In the present embodiment, principle can be segmented according to Tibetan language to be segmented.

Specifically, first with double vertical symbolsIt is segmented for label, then with double vertical symbolsOr Dan ChuifuTo be segmented in the segmentation of label, obtain with double vertical symbolsOr Dan ChuifuFor the text phase to be processed on boundary String should be segmented.

Further, it obtains after text to be processed accordingly segments string, it can also be to can in the text to be processed participle string It can be marked to stick together the case adverbial verb of case adverbial verb, the case adverbial verb that sticks together refers to that word unit is followed by having the lattice of adhesive properties to help Word, the case adverbial verb can be combined into a syllable with the word unit ultima that gets adhered in the form of back word adding, and Become the ending of a final of this syllable on pronunciation.

Step 103, the context information according to each word unit in the Tibetan language text to be processed, determines institute State the part of speech of each word unit.

In practical applications, can by tabling look-up or determining using method of statistical modeling etc. the part of speech of each word unit, In the present embodiment, by context-sensitive feature of each word unit in sentence, each word unit is predicted by the method for statistical modeling Part of speech adjust the part of speech of each word unit then further according to the type of word unit, with ensure obtain word unit part of speech standard True property, can specifically include:

Predict part of speech of each word unit in sentence；

According to the type of word unit, the part of speech of word unit is adjusted.

Step is a) to this progress of Tibetan language and literature subordinate sentence to be processed

Since the basic composed structure of Tibetan language sentence is " subject-object-predicate "；Therefore, text to be processed is divided When sentence, first have to determine predicate verb position.The present embodiment is according to Dan ChuifuThe level-one part of speech of previous word unit is come true Order, which hangs down, to be accorded withWhether corresponding boundary is a boundary.Wherein, the level-one part of speech includes: verb, notional word, pronoun, void Word, general affixe, verb configuration affixe, it is described to may include: to this progress of Tibetan language and literature subordinate sentence to be processed

Predict the level-one part of speech of each word unit；

Sentence boundary is determined according to the level-one part of speech of each word unit.

In practical applications, the level-one part of speech that word unit can be carried out by the method for statistical modeling is predicted, specific to predict When, the candidate level-one part of speech of each word unit is obtained by tabling look-up first；Then the context-sensitive feature of current word unit is extracted, Wherein, context-sensitive feature can be for current word unit be previous and/or the candidate level-one part of speech of the latter word unit, and works as Candidate level-one part of speech of preceding word unit etc.；Finally according to the context-sensitive feature of current word unit, pass through the side of statistical modeling Method determines the level-one part of speech of current word unit from the candidate level-one part of speech of current word unit, wherein statistical model such as decision tree Model, level-one part of speech can specifically include:

1. notional word specifically includes: noun, adjective, adverbial word, number；

2. pronoun, such as demonstrative pronoun(I),(you),(he)；

3. function word specifically includes: case adverbial verb, conjunction, preposition；

4. verb；

5. general affixe, specifically includes: noun affixe, adjective affixe, adverbial word affixe, verb morphological affix, it is described dynamic Word morphological affix refers to be added in verb after change the verb affixe of original semanteme；

6. verb configuration affixe, specifically includes: suffix, modal particle, auxiliary verb, verb affixe string, the verb configuration word Sew generally to be added in and do not change the originally semantic verb affixe of verb after verb, the expression of verb configuration affixe is syntactic function.Institute Type of attachment when stating the i.e. multiple verb configuration affixes of verb affixe string while occurring.

When determining sentence boundary according to the level-one part of speech of each word unit, it is specifically as follows: judges Dan ChuifuPrevious word Whether the level-one part of speech of unit is verb or verb configuration affixe, if it is, thinking Dan ChuifuFor sentence boundary；Otherwise, Method based on statistical modeling predicts sentence boundary, and the statistical model such as decision-tree model obtains the sentence of Tibetan language text to be processed Boundary.

It should be noted that in the present embodiment, double vertical symbolsCorresponding boundary is both segment boundary and sentence boundary. And Dan ChuifuCorresponding boundary is not necessarily a boundary, it is also possible to be phrasal boundary or word boundary.

Step b) predicts part of speech of each word unit in sentence

Since level-one part of speech includes a variety of different part-of-speech informations, when such as level-one part of speech being notional word, comprising noun, adjective, Four kinds of adverbial word, number part-of-speech informations, so that the level-one part of speech harmony tune of word unit is not also one-to-one relationship, due to more tones When the part of speech difference of mode word, tone may also be different, tone when such as more tone recognition words are as noun and as adjective When tone may be different.Therefore, the present embodiment, according to contextual feature of the word unit in sentence, predicts word list after subordinate sentence The specific part of speech of member, i.e. the second level part of speech of word unit；The second level part of speech is identical as the part of speech of ordinary meaning, and such as noun describes Word etc..The present embodiment carries out the prediction of second level part of speech, specific prediction technique and prior art phase to each word unit in each subordinate sentence Together, as predicted using the method for statistical modeling, the second level part of speech of each word unit is obtained.

Step c) determines the type of word unit

Since word unit same in Tibetan language text is in different context environmentals, different parts of speech can be taken on, shown Different tones, the second level part of speech range predicted as unit of sentence is larger, the second level part of speech inaccuracy for being easy to cause prediction to obtain； In order to more accurately predict the tone information of Tibetan language text to be processed, therefore, to assure that step b) obtains the second level part of speech of each word unit Accuracy.Further the second level part of speech of the word unit of step b) prediction can be adjusted by the type of word unit, because This, needs first to determine the type of word unit.

In practical applications, can according to tone different context the form of expression by word dividing elements be 4 seed types, i.e., it is more Tone recognition word, function word, affixe, conventional word, specific as follows:

1. function word: for the function word of different initial consonants in isolated syllable, tone is different；If high initial consonant is in the effect of different back word addings Under, tone is h (height), f (drop)；For mother under the action of different back word addings, tone is l (low), r (liter) in a low voice；It is several in flow All function word tones are all read as l or r, such as case adverbial verb, conjunction, preposition, state, modal particle all read it is lower；

2. tone recognition word more than: for more tone recognition word units when as different parts of speech, tone is different, such as:Latin transliteration is khrom skor, and tone integrated mode is hh when doing noun, and tone combines when doing verb Mode is fr.

3. affixe: it is softly relatively similar with Mandarin Chinese with the pronunciation of weak read mode, such as ba, wa, bo, pa, po, ma, mo Deng.

4. conventional word: there is inherent modified tone rule in tone.For example, two syllable lists are all read as the syllable of low-key l when reading A word is formed, is not to be read as ll, but be often read as lh, i.e., dissyllabic low-key becomes to a high-profile.

In the present embodiment, the type that word unit can be obtained by way of looking into various word cell type dictionaries, may be used also The type of word unit is obtained in a manner of by originally carrying out artificial marking types to Tibetan language and literature to be processed in advance.

Step d) adjusts the part of speech of word unit according to the type of word unit.

In the present embodiment, the part of speech of word unit is adjusted according to the type of each word unit, it is ensured that the second level word of each word unit The accuracy of property.For example, the part of speech of word unit can be adjusted according to the type and its context-sensitive feature of word unit.In addition, also The part of speech of word unit can be adjusted by the method for statistical modeling.Specifically, word unit part of speech adjustment can be divided into it is following several Kind situation:

1. function word unit part of speech adjusts

The effect of function word is the different sentence element of connection, since the function word with different role is identical in form, but Different parts of speech may be taken in actual context, i.e. function word has ambiguous category part of speech；In the present embodiment, above and below word unit Literary environment carries out the adjustment of function word part of speech using the method for statistical modeling, to obtain the second level part of speech of accurately each word unit. For example, case adverbial verbIt is also likely to be conjunction, case adverbial verb can be determined by the connection of ingredient each in sentenceIn sentence The specific location of son is when case adverbial verb or to work as conjunction.

The adjustment of the word unit part of speech of tone recognition more than 2.

Using function word adjusted as boundary demarcation text fragments to be processed, part of speech tune is carried out to more tone recognition word units It is whole, the part of speech of more tone recognition words is adjusted with specific reference to context environmental or using the method for statistical modeling.For example, more tones Mode word unit(between two "/" labels) part of speech is different in following two sentences:

①.

Wherein,Work as verb in end of the sentence, indicates to occur, occur, tone group is combined into rl；

②.

Wherein,When noun indicates that history, biography, tone group are combined into lr in this sentence.

When part of speech adjusts, according toPlace context environmental, adjusts accordingly.

3. affixe unit and the adjustment of conventional word unit part of speech

Word unit and conventional word unit where affixe unit directly use the second level part of speech of each word unit, the present embodiment not into The adjustment of row part of speech.It should be noted that affixe is divided into independent affixe and fixed two kinds of affixe part for depending on other word units, In the present embodiment, the part of speech for setting independent affixe and affixe part is identical, for example, other word lists are depended in independent affixe and fixation The part of speech of the affixe part of member can be affixe.

Step 104, the rhythm boundary of the Tibetan language text to be processed is predicted, and according to the word of rhythm boundary word unit Property, adjust the word elementary boundary of rhythm boundary.

Rhythm boundary refers in verbal communication, in order to express semantic information, and the pause between the word and word occurred, rhythm side The part of speech of more tone recognition words at boundary is related with rhythm boundary, and tone can change with the change of part of speech, therefore, it is necessary to According to the part of speech of rhythm boundary word unit, the word elementary boundary of rhythm boundary is adjusted.

In the present embodiment, firstly, rhythm Boundary Prediction is carried out to text to be processed, for example, the prediction to stall position The as prediction on rhythm boundary, specific Boundary Prediction process is same as the prior art, for example, can be according to the context phase of word unit Information is closed, rhythm Boundary Prediction is carried out using statistical modeling method；Then, according to rhythm boundary word unit second level adjusted Part of speech adjusts the word elementary boundary of rhythm boundary.Specifically, when the type of the word unit of rhythm boundary is more tone modelings Formula word unit, and its second level part of speech be verb or adjective when, more tone recognition word units are split as unit of syllable, utilize Syllable unit after fractionation carries out tone prediction.

Step 105, according to the part of speech of each word unit, to the syllable list of the Tibetan language text to be processed after adjustment word elementary boundary Member carries out tone prediction, obtains the tone information of Tibetan language text to be processed.

In practical applications, can own using syllable unit as the load bearing unit of tone in the Tibetan language text to be processed Each syllable unit carries out tone prediction in word unit, such as according to tone predicted characteristics, is determined by looking into tone tune type table every The tone of a syllable unit, wherein tone predicted characteristics can be initial consonant classification, the simple or compound vowel of a Chinese syllable class of current syllable of current syllable Not, the position before and after current syllable where the initial and the final classification of syllable, current syllable in word unit, word list where current syllable Length, part of speech of current syllable place word unit of member etc..Furthermore, it is possible to according to the tone rule of Tibetan language pronunciation to word unit Tone is adjusted, for example, all affixes are set as weak reading, can specifically set all syllable units of affixe as weak reading, often With affixe such as ba, wa, bo, pa, po, ma, mo.

Further, the tone of each syllable unit obtained through this embodiment can be applied to speech synthesis field, example Such as, the process that the present embodiment carries out tone prediction can be carried out after completing Tibetan language making character fonts, to make finally to synthesize Tibetan voice it is more natural.

Tibetan language tone prediction technique provided in an embodiment of the present invention, by segmenting received Tibetan language text to be processed Processing obtains each word unit, and determines the type of each word unit, then according to the type of word unit to the rhythm boundary of acquisition Word elementary boundary is adjusted, so that carrying out tone to the Tibetan language text after adjustment word elementary boundary according to the part of speech of word unit When prediction, it is contemplated that influence of the part of speech of more tone recognition words to the tone of rhythm boundary word unit can solve more in Tibetan language Tone recognition word different rhythm boundaries tone different problems so that Tibetan voice synthesis it is more natural.

Correspondingly, the present invention also provides Tibetan language tone forecasting systems, as shown in Figure 2, comprising:

Receiving module 201, for receiving Tibetan language text；

Word segmentation module 202 obtains each word unit for carrying out word segmentation processing to the Tibetan language text to be processed；

Part of speech determining module 203, for the context environmental according to institute's predicate unit in the Tibetan language text to be processed Information determines the part of speech of institute's predicate unit；

Word elementary boundary adjusts module 204, for predicting the rhythm boundary of the Tibetan language text to be processed, and according to the rhythm The part of speech of boundary word unit adjusts the word elementary boundary of rhythm boundary；

Tone prediction module 205, for the part of speech according to each word unit, to the Tibetan language to be processed after adjustment word elementary boundary The syllable unit of text carries out tone prediction, obtains the tone information of Tibetan language text to be processed.

In practical applications, the present embodiment determines it by context-sensitive feature of each word unit where it in sentence Part of speech, the part of speech determining module 203 include:

Word type determining units, for determining the type of each word unit；

Wherein, pass through the Dan Chuifu in Tibetan language text to be processedThe level-one part of speech of previous word unit passes through system The method of meter modeling predicts sentence boundary, and the clause unit includes:

In practical applications, first boundary determines that subelement is specifically used for: obtaining each word unit by tabling look-up Candidate level-one part of speech；Extract the context-sensitive feature of current word unit；According to the context-sensitive feature of current word unit, lead to The method for crossing statistical modeling determines the level-one part of speech of current word unit from the candidate level-one part of speech of current word unit.Wherein, institute State the position of the front and back word unit level-one part-of-speech information and current word unit of feature such as current word unit in sentence.

In the present embodiment, institute's predicate elementary boundary adjustment module 204 is particularly used in: when the word unit of rhythm boundary For more tone recognition word units, and when part of speech is verb or adjective, more tone recognition word units are split as unit of syllable, benefit Tone prediction is carried out with the syllable unit after fractionation.

Finally, it is exchanged by tone prediction module 205 according to the part of speech for each word unit that part of speech determining module 203 obtains The syllable unit of Tibetan language text to be processed after whole word elementary boundary carries out tone prediction, obtains the tone of Tibetan language text to be processed Information.

Certainly, in practical applications, which can also further comprise: memory module (not shown), for saving dictionary Information, tone prediction result etc..In this way, automatically processing to facilitate to this progress of Tibetan language and literature computer to be processed, and store synthesis Speech related information etc..

In practical applications, the system can be applied in Tibetan voice synthesis field, for example, the system 301 can be with Making character fonts systems 302 etc. carry out Tibetan language and literature by text processing system collectively as the subsystem of text processing system 400 Present treatment, to improve the naturalness of Tibetan voice synthesis, as shown in Figure 3.

Tibetan language tone forecasting system provided in an embodiment of the present invention, by the received Tibetan language text to be processed of receiving module 201, Word segmentation processing is carried out by word segmentation module 202, obtains each word unit, each word unit is then determined by part of speech determining module 203 Type, the rhythm boundary word elementary boundary of Tibetan language and literature sheet to be processed is adjusted according to the result of part of speech determining module 203 It is whole, when so that the system carrying out tone prediction to rhythm boundary word unit according to the part of speech of word unit, it is contemplated that more tone moulds Influence of the part of speech of formula word to the tone of rhythm boundary word unit can solve in Tibetan language more tone recognition words on different rhythm sides Tone different problems at boundary, so that Tibetan voice synthesis is more natural.

All the embodiments in this specification are described in a progressive manner, same and similar portion between each embodiment Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.Especially for system reality For applying example, since it is substantially similar to the method embodiment, so describing fairly simple, related place is referring to embodiment of the method Part explanation.System embodiment described above is only schematical, wherein described be used as separate part description Unit may or may not be physically separated, component shown as a unit may or may not be Physical unit, it can it is in one place, or may be distributed over multiple network units.It can be according to the actual needs Some or all of the modules therein is selected to achieve the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying In the case where creative work, it can understand and implement.

The embodiment of the present invention has been described in detail above, and specific embodiment used herein carries out the present invention It illustrates, the above description of the embodiments is only used to help understand the method and apparatus of the present invention；Meanwhile for the one of this field As technical staff, according to the thought of the present invention, there will be changes in the specific implementation manner and application range, to sum up institute It states, the contents of this specification are not to be construed as limiting the invention.

Claims

1. a kind of Tibetan language tone prediction technique characterized by comprising

Receive Tibetan language text to be processed；

According to context information of institute's predicate unit in the Tibetan language text to be processed, the word of institute's predicate unit is determined Property；

It predicts the rhythm boundary of the Tibetan language text to be processed, and according to the part of speech of rhythm boundary word unit, adjusts rhythm side Word elementary boundary at boundary；

According to the part of speech of each word unit, it is pre- that tone is carried out to the syllable unit of the Tibetan language text to be processed after adjustment word elementary boundary It surveys, obtains the tone information of Tibetan language text to be processed.

2. the method according to claim 1, wherein it is described according to institute's predicate unit in the Tibetan language and literature to be processed Context information in this, determines that the part of speech of institute's predicate unit includes:

Predict part of speech of institute's predicate unit in sentence；

Determine the type of institute's predicate unit；

3. according to the method described in claim 2, it is characterized in that, described include: to this progress of Tibetan language and literature subordinate sentence to be processed

Predict that the level-one part of speech of each word unit, the level-one part of speech include: verb, notional word, pronoun, function word, general affixe, verb Configuration affixe；

It is sentence side at Dan Chuifu if the level-one part of speech of the previous word unit of Dan Chuifu " | " is verb or verb configuration affixe Boundary；

If the level-one part of speech of the previous word unit of Dan Chuifu " | " is not verb or verb configuration affixe, pass through statistical modeling Method predict sentence boundary.

4. according to the method described in claim 3, it is characterized in that, the level-one part of speech of each word unit of prediction includes:

Obtain the candidate level-one part of speech of each word unit；

Extract the context-sensitive feature of current word unit；

According to the context-sensitive feature of current word unit, by the method for statistical modeling from the candidate primary word of current word unit Property in determine current word unit level-one part of speech.

5. according to the method described in claim 2, it is characterized in that, the type of institute's predicate unit includes any of the following or more Kind: more tone recognition words, function word, affixe, conventional word.

6. the method according to claim 1, wherein the part of speech according to rhythm boundary word unit, adjustment The word elementary boundary of rhythm boundary includes:

It is single with syllable when the word unit of rhythm boundary is more tone recognition word units, and part of speech is verb or adjective Position splits more tone recognition word units, carries out subsequent tone prediction using syllable unit after fractionation.

7. a kind of Tibetan language tone forecasting system characterized by comprising

Receiving module, for receiving Tibetan language text to be processed；

Part of speech determining module, for the context information according to institute's predicate unit in the Tibetan language text to be processed, really Determine the part of speech of institute's predicate unit；

Word elementary boundary adjusts module, for predicting the rhythm boundary of the Tibetan language text to be processed, and according to rhythm boundary The part of speech of word unit adjusts the word elementary boundary of rhythm boundary；

Tone prediction module, for the part of speech according to each word unit, to the Tibetan language text to be processed after adjustment word elementary boundary Syllable unit carries out tone prediction, obtains the tone information of Tibetan language text to be processed.

8. system according to claim 7, which is characterized in that the part of speech determining module includes:

Word type determining units, for determining the type of each word unit；

9. system according to claim 8, which is characterized in that the clause unit includes:

First boundary determines subelement, if the level-one part of speech for Dan Chuifu " | " previous word unit is verb or verb Configuration affixe is then sentence boundary at Dan Chuifu；

Second boundary determines subelement, if the level-one part of speech for Dan Chuifu " | " previous word unit is not verb or moves Word configuration affixe then predicts sentence boundary by the method for statistical modeling.

10. system according to claim 9, which is characterized in that first boundary determines that subelement is specifically used for: logical It crosses to table look-up and obtains the candidate level-one part of speech of each word unit；Extract the context-sensitive feature of current word unit；According to current word list The context-sensitive feature of member determines current word list by the method for statistical modeling from the candidate level-one part of speech of current word unit The level-one part of speech of member.

11. system according to claim 7, which is characterized in that institute's predicate elementary boundary adjustment module is specifically used for: working as rhythm Restrain boundary word unit be more tone recognition word units, and part of speech be verb or adjective when, as unit of syllable split it is more Tone recognition word unit carries out subsequent tone prediction using syllable unit after fractionation.