CN101008942A

CN101008942A - Machine translation device and method thereof

Info

Publication number: CN101008942A
Application number: CNA200610002940XA
Authority: CN
Inventors: 王亚军; 李大冀
Original assignee: JINYUANJIAN COMPUTER TECHNOLOGY Co Ltd BEIJING
Current assignee: JINYUANJIAN COMPUTER TECHNOLOGY Co Ltd BEIJING
Priority date: 2006-01-25
Filing date: 2006-01-25
Publication date: 2007-08-01

Abstract

This invention discloses one machine translation device, which comprises the following parts: sound information input unit to input first sound information; sound identification unit to identify first sound information and to convert it into text information; translation unit to translate text information into second one. This invention translation method comprises the following steps: sound information input unit to input first sound information; sound identification unit to identify first sound information and to convert it into text information; translation unit to translate text information into second one.

Description

Machine translation apparatus and machine translation method

Technical field

The present invention relates to a kind of machine translation apparatus and machine translation method.

Background technology

All the time, people are when the people who says foreign language carries out session and exchanges, and hope can have a kind of portable unit, can at any time the foreign language translation of hearing be become the mother tongue of oneself, oneself can be translated into foreign language again and say to the other side and listen.

In correlation technique, current have three kinds of technology: the one, and speech recognition technology utilizes this technology, and computing machine can be discerned the voice of input, the text that speech conversion is become to be convenient to handle; The 2nd, text-converted becomes voice technology, i.e. text voice conversion TTS (Text To Speech) technology is utilized this technology, and the voice that computing machine can become people to understand the text-converted after the translation are exported; The 3rd, machine translation mothod promptly utilizes calculation element that another language translated in a kind of language.

1. speech recognition technology

The computer speech identifying is consistent with the people to the voice recognition processing process basically.The speech recognition technology of main flow is based on the basic theories of statistical model identification at present.The implementation procedure of a typical speech recognition system 100 can roughly be divided into three parts as shown in Figure 1: (1) phonetic feature extraction unit 110: its objective is to extract time dependent phonetic feature sequence from speech waveform.(2) acoustic model (model bank 120) and pattern match 130 (recognizer): acoustic model produces the phonetic feature that obtains usually by learning algorithm.Phonetic feature with input when identification mates and compares with acoustics model (pattern), obtains best recognition result.(3) language model and Language Processing: language model comprises grammer network that is made of voice command recognition or the language model that is made of statistical method, and Language Processing can be carried out grammer, semantic analysis.To little vocabulary speech recognition system, often do not need the Language Processing part.

Acoustic model is the bottom model of recognition system, and is the part of most critical in the speech recognition system.The purpose of acoustic model provides a kind of feature vector sequence of effective method computing voice and the distance between each pronunciation template.The design of acoustic model is closely related with the language pronouncing characteristics.Acoustic model cell size (word pronunciation model, semitone joint model or phoneme model) is to voice training data volume size, system recognition rate, and dirigibility has bigger influence.Must determine the size of recognition unit according to the characteristics of different language, the size of recognition system vocabulary.

About speech recognition application systematic research exploitation, more existing practical speech recognition systems drop into commercial operation.The software that has occurred some speech recognitions on market helps people to utilize speech recognition technology to realize the text typing, and simple computer operation etc.For example, in some mobile phone, people have realized voice dial-up function; When some software can help people to use PC, use the microphone utterance that is connected on the PC is come the typing text.

2.TTS technology

The TTS technology claims the text voice switch technology again, the Word message of input that it produces computing machine oneself or outside change into can listen the technology of Chinese characters spoken language output that understand, fluent, be under the jurisdiction of phonetic synthesis.Phonetic synthesis is for producing the technology of artificial voice by method machinery, electronics.Be example with Fig. 2 below, the principle of TTS technology is described.Linguistics is handled (S210), in text-to-speech system, play an important role, main anthropomorphic dummy is to the understanding process of natural language---and text is regular, the cutting of speech, grammatical analysis and semantic analysis, computing machine can be understood fully to the text of input, and provide the needed various pronunciation promptings of back two parts.The rhythm is handled (S220), for synthetic speech is cooked up segment5al feature, as pitch, the duration of a sound and loudness of a sound etc., makes synthetic speech can correctly express the meaning of one's words, sounds more natural.Acoustic treatment (S230) is according to requirement output voice, the i.e. synthetic speech of preceding two parts result.

Occurred the software of a lot of TTS on the current market, and the TTS software of various language has been arranged, can become the voice of this language to export the text-converted of various language such as Chinese, English, Japanese by these softwares.For example we can hear the voice suggestion that utilizes the TTS technology export in the application of various hotlines, Games Software, instrument and meter etc.

3. machine translation mothod

Currently a large amount of mechanical translation software occurs, can be installed on the various devices such as PC, PDA, carried out other translation of various languages.For example the Wenquxing TMV5100 of company of Golden Global View can realize the translation of english vernacular.This device can carry out English-Chinese, Chinese-English two-way whole sentence translation to works and expressions for everyday use.When carrying out the English interchange,, just can obtain translation result to Wenquxing TMV5100 input Chinese or English.Works and expressions for everyday use are translated the cross discipline of multiple subjects such as relating to computational linguistics, artificial intelligence, natural language processing and technology.The works and expressions for everyday use translation comprises that mainly works and expressions for everyday use are resolved and translation.Works and expressions for everyday use are resolved main works and expressions for everyday use sentence pattern, automatic segmentation long sentence and the rewriting statement analyzed.The result that works and expressions for everyday use translations draws at semantic analysis utilizes the database of setting up in advance, determines to understand the second language of works and expressions for everyday useization.The automatic works and expressions for everyday use translation system of computing machine of exploitation has had much at present, has wherein introduced the works and expressions for everyday use translation on the Wenquxing TMV5100 as embedded system device.

Yet people a kind of machine translation apparatus wish to occur in the translation application of reality, can carry, and can understand people's word, and will hear and can say by opening after translating out.

In existing machine translation apparatus, utilized the machine translation mothod in the correlation technique, a kind of text translation of language can be become the text of another kind of language, or combine the TTS technology in the correlation technique, translation result can be said; But there are following problems in these machine translation apparatus, that is, these machine translation apparatus can not be listened people's word, more can not understand people's word, will hear then and can say by opening after translating out.

In addition, all there is a problem in above-mentioned correlation technique, that is, because the problem of the large vocabulary of language and the flexible and changeable semantic analysis that grammer brought.And this problem seems particularly outstanding in the middle of the words of the whole sentence of mechanical translation.

Summary of the invention

Therefore, the present invention relates to a kind of machine translation apparatus and machine translation method, it can solve well owing to the restriction of correlation technique and the not enough one or more problems that cause.

An advantage of the present invention is to provide a kind of machine translation apparatus and machine translation method, in particular to a kind of language that is used to translate by phonetic entry, then with the machine translation apparatus of translation result with voice output.

Another advantage of the present invention is can realize by embedded system device according to machine translation apparatus of the present invention, thus be easy to carry and cost very low.

Other features and advantages of the present invention will be set forth in the following description, and, partly from instructions, become apparent, perhaps understand by implementing the present invention.Purpose of the present invention and other advantages can realize and obtain by specifically noted structure in the instructions of being write, claims and accompanying drawing.

In order to realize that according to these purposes of the present invention and other advantage concrete and general description provides a kind of machine translation apparatus as institute in the literary composition, it is characterized in that, comprising: the voice messaging input block is used to import the voice messaging of first language; Voice recognition unit is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation unit, be used for the text message of being changed is translated into the text message of second language.

According to machine translation apparatus of the present invention, also can comprise: the text voice converting unit, the text message that is used for the second language that will be translated converts voice messaging to; And the voice messaging output unit, be used for the voice messaging output that will be changed.

According to machine translation apparatus of the present invention, also can comprise: judging unit is used to judge that the text message of being changed is word or sentence; And the dictionary unit, be used for translation of words; Wherein, when the text of changing when described judgment unit judges is word, described word translation is become the text message of second language by described dictionary unit; And the text of changing when described judgment unit judges becomes described sentence translation by described translation unit the text message of second language when being sentence.

According to machine translation apparatus of the present invention, also can comprise: the text message output unit is used to export the text message of text message of being changed and the second language of being translated; And the text message input block, the text message that is used to import the text message of first language and revises the second language of being changed.

According to machine translation apparatus of the present invention, available computers realizes.

According to machine translation apparatus of the present invention, available embedded system device is realized.

Described scheduled instruction can be defined as the works and expressions for everyday use in a kind of existing language, the industry term in perhaps a kind of existing language.

According to a further aspect of the invention, provide a kind of machine translation method, it is characterized in that may further comprise the steps: the voice messaging input step is used to import the voice messaging of first language; Speech recognition steps is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation steps, the text translation that is used for being changed becomes the text message of second language.

According to machine translation method of the present invention, also can may further comprise the steps: the text voice switch process, the text message that is used for the second language that will be translated converts voice messaging to; And voice messaging output step, be used for the voice messaging output that will be changed.

According to machine translation method of the present invention, also can may further comprise the steps: determining step is used to judge that the text message of being changed is word or sentence; And the word translation step, be used for translation of words, wherein when judging that the text message of being changed is word, described word translation is become the text message of second language by described word translation step; And when judging that the text message of being changed is sentence, described sentence translation is become the text message of second language by described translation steps.

According to machine translation method of the present invention, also can may further comprise the steps: text message output step is used to export the text message of text message of being changed and the second language of being translated; And the text message input step, be used to import the text message of first language and revise the text message of being changed.

By following explanation to specific embodiments of the invention, and in conjunction with the accompanying drawings, it is obvious that other aspects of the present invention and feature will become concerning those skilled in the art person.

Description of drawings

Below in conjunction with accompanying drawing, embodiments of the invention are described, should be appreciated that these embodiment are used to illustrate the present invention, rather than the present invention is limited, wherein:

Fig. 1 is the synoptic diagram that the realization of speech recognition is shown;

Fig. 2 is the synoptic diagram that the realization of TTS is shown;

Fig. 3 is the schematic diagram that illustrates according to machine translation apparatus of the present invention;

Fig. 4 is the block scheme that machine translation apparatus according to an embodiment of the invention is shown;

Fig. 5 is the process flow diagram that machine translation method according to an embodiment of the invention is shown;

Fig. 6 is the block scheme that machine translation apparatus according to another embodiment of the invention is shown; And

Fig. 7 is the process flow diagram that machine translation method according to another embodiment of the invention is shown.

Embodiment

Below will be in detail with reference to the preferred embodiments of the present invention, the example is shown in the drawings.

People a kind of machine translation apparatus wish to occur in the foreign language session of reality, can carry, and can understand people's word, and will hear and can say by opening after translating out.

In order to achieve this end, need to solve the problem of three aspects.The one, understand people's word, the 2nd, the words of hearing are translated out, the 3rd, the translation result opening is said.

Fig. 3 is the schematic diagram that illustrates according to machine translation apparatus of the present invention.As shown in Figure 3, according to principle of the present invention, speech recognition technology, TTS technology and the machine translation mothod of correlation technique are combined, promptly utilize speech recognition technology that the speech conversion of input is become text (understanding people's word) (S410), utilize machine translation mothod then, be about to a kind of language and translate into another language (will hear and translate out) (S420), utilize the TTS technology at last, the language conversion that translation is come out becomes voice output (the translation result opening is said) (S430).

Yet, in speech recognition technology, TTS technology and the machine translation mothod of correlation technique, have a problem, promptly because the problem of the large vocabulary of language and the flexible and changeable semantic analysis that grammer brought.And this problem seems particularly outstanding in the middle of the words of the whole sentence of mechanical translation.

Because the semantic huge complicacy of language, make the semantic analysis particular importance, to adopt the algorithm of optimization on the one hand, on the other hand, be defined as specific area in the language by reducing the language that to translate, can reduce the calculated amount of semantic analysis significantly, make the combination of above correlation technique become and realize easily.

Therefore, according to principle of the present invention,, the language that translate is defined in specific field in order to solve the problem of above-mentioned semantic analysis.

Preferably, for example, be directed to session and exchange, the language that translate can be defined in the works and expressions for everyday use translation, this day term translation.

Alternatively, if be directed to some sector application, for example the communications field also can be defined in the language that will translate the communication term.

By with upper type, can reduce the calculated amount of semantic analysis significantly, and accelerate translation speed significantly, and owing to the requirement that has reduced machine translation apparatus arithmetic capability and storage capacity, so strengthened striding the hardware platform performance significantly, for example not only PC can be applied to, and various embedded systems such as single-chip microcomputer can be applied to.

Alternatively, in order to improve portability, realize that better session anywhere or anytime exchanges, and can realize with 32 single-chip microcomputers of FLASH storer according to machine translation apparatus of the present invention.

Describe machine translation apparatus and machine translation method according to an embodiment of the invention in detail below in conjunction with accompanying drawing.

Fig. 4 is the block scheme that machine translation apparatus 500 according to an embodiment of the invention is shown.

Machine translation apparatus 500 comprises: voice messaging input block 510 is used to import the voice messaging of first language; Voice recognition unit 520 is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation unit 530, be used for the text message of being changed is translated into the text message of second language.

Fig. 5 is the process flow diagram that machine translation method according to an embodiment of the invention is shown.

This machine translation method may further comprise the steps: voice messaging input step S610 is used to import the voice messaging of first language; Speech recognition steps S620 is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation steps S630, the text translation that is used for being changed becomes the text message of second language.

Fig. 6 is the block scheme that machine translation apparatus 700 according to an embodiment of the invention is shown.

As shown in Figure 6, machine translation apparatus 700 according to the present invention comprises: microphone 710 is used to import user's voice; Voice recognition unit 720 utilizes speech recognition technology to discern the voice of being imported, and converts thereof into text; Judging unit 730 is used to judge that the text of being changed is a word or works and expressions for everyday use; Dictionary unit 740 is used for translation of words; Works and expressions for everyday use translation unit 750 is used to translate works and expressions for everyday use; Display screen 760 is used to show translation result; TTS unit 770 is used for converting translation result to voice messaging; Loudspeaker 780 is used to export voice messaging; And keyboard 790, the text message that is used to import the text message of first language and revises the second language of being changed.

Alternatively, this machine translation apparatus can be realized with PC, also can realize with embedded system.It should be noted that machine translation apparatus according to the present invention is not limited to specific hardware platform.

Alternatively, this machine translation apparatus can be used for Sino-British works and expressions for everyday use paginal translation, Sino-Japan works and expressions for everyday use paginal translation etc.It should be noted that machine translation apparatus according to the present invention is not limited to one or more language.

Alternatively, this machine translation apparatus can be used for works and expressions for everyday use translation, the translation of industry term etc.It should be noted that machine translation apparatus according to the present invention is not limited to the term of specific area.

Below the operating process of machine translation apparatus 700 according to another embodiment of the invention will be described according to Fig. 7.

Fig. 7 is the process flow diagram that machine translation apparatus 700 according to an embodiment of the invention is shown.

In S810, the user speaks to the microphone of machine translation apparatus with predetermined a kind of language, i.e. input needs the voice messaging of translation; In S820, utilize speech recognition technology to discern the voice of being imported, and convert thereof into text; In S830, judge that the text of being changed is a word or works and expressions for everyday use; If word then forwards S840 to, in S840, translation of words; If works and expressions for everyday use then forward S850 to, in S850, the translation works and expressions for everyday use; In S860, show translation result; In S870, convert translation result to voice messaging; In S880, export voice messaging to the user; And in S880, directly input text information can be used for importing the text message of first language and the text message of the second language that modification is changed.

By machine translation apparatus of the present invention, an one advantage is not only can carry out mechanical translation, and can directly understand people's word, no longer needs text input manually.

According to machine translation apparatus of the present invention, its another advantage be can be enough embedded system device realize, thereby be easy to carry and with low cost.

By machine translation apparatus of the present invention, its another advantage is and the result of mechanical translation directly can also be said.

By machine translation apparatus of the present invention, its another advantage is not only can translation of words, can also translate the whole sentence of specific area, for example works and expressions for everyday use or industry term.

By machine translation apparatus of the present invention, realized that people in the foreign language session of reality, can understand people's word, and will hear and to say by opening after translating out.Generally speaking, carry such machine translation apparatus just as being equipped with a translator with oneself.

The above is the preferred embodiments of the present invention only, is not limited to the present invention, and for a person skilled in the art, the present invention can have various changes and variation.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims

1. a machine translation apparatus is characterized in that, comprising:

The voice messaging input block is used to import the voice messaging of first language;

Voice recognition unit is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And

Translation unit is used for the text message of being changed is translated into the text message of second language.

2. machine translation apparatus according to claim 1 is characterized in that, also comprises:

The text voice converting unit, the text message that is used for the second language that will be translated converts voice messaging to; And

The voice messaging output unit is used for the voice messaging output that will be changed.

3. machine translation apparatus according to claim 1 is characterized in that, also comprises:

Judging unit is used to judge that the text message of being changed is word or sentence; And

The dictionary unit is used for translation of words; Wherein,

When the text of changing when described judgment unit judges is word, described word translation is become the text message of second language by described dictionary unit; And

When the text of changing when described judgment unit judges is sentence, described sentence translation is become the text message of second language by described translation unit.

4. according to each described machine translation apparatus in the claim 1 to 3, it is characterized in that, also comprise:

The text message output unit is used to export the text message of text message of being changed and the second language of being translated; And

The text message input block, the text message that is used to import the text message of first language and revises the second language of being changed.

5. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that, realize with computing machine.

6. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that, realize with embedded system device.

7. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that described first language is defined as the works and expressions for everyday use in a kind of existing language.

8. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that described first language is defined as the industry term in a kind of existing language.

9. a machine translation method is characterized in that, may further comprise the steps:

The voice messaging input step is used to import the voice messaging of first language;

Speech recognition steps is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And

Translation steps, the text translation that is used for being changed becomes the text message of second language.

10. method according to claim 9 is characterized in that, and is further comprising the steps of:

The text voice switch process, the text message that is used for the second language that will be translated converts voice messaging to; And

Voice messaging output step is used for the voice messaging output that will be changed.

11. method according to claim 9 is characterized in that, and is further comprising the steps of:

Determining step is used to judge that the text message of being changed is word or sentence; And

The word translation step is used for translation of words, wherein

When judging that the text message of being changed is word, described word translation is become the text message of second language by described word translation step; And

When judging that the text message of being changed is sentence, described sentence translation is become the text message of second language by described translation steps.

12. according to each described method in the claim 9 to 11, it is characterized in that, further comprising the steps of:

Text message output step is used to export the text message of text message of being changed and the second language of being translated; And

The text message input step is used to import the text message of first language and revises the text message of being changed.

13., it is characterized in that described scheduled instruction is defined as the works and expressions for everyday use in a kind of existing language according to each described method in the claim 9 to 12.

14., it is characterized in that described scheduled instruction is defined as the industry term in a kind of existing language according to each described method in the claim 9 to 12.