CN101008942A - Machine translation device and method thereof - Google Patents
Machine translation device and method thereof Download PDFInfo
- Publication number
- CN101008942A CN101008942A CNA200610002940XA CN200610002940A CN101008942A CN 101008942 A CN101008942 A CN 101008942A CN A200610002940X A CNA200610002940X A CN A200610002940XA CN 200610002940 A CN200610002940 A CN 200610002940A CN 101008942 A CN101008942 A CN 101008942A
- Authority
- CN
- China
- Prior art keywords
- language
- text message
- translation
- changed
- voice messaging
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
This invention discloses one machine translation device, which comprises the following parts: sound information input unit to input first sound information; sound identification unit to identify first sound information and to convert it into text information; translation unit to translate text information into second one. This invention translation method comprises the following steps: sound information input unit to input first sound information; sound identification unit to identify first sound information and to convert it into text information; translation unit to translate text information into second one.
Description
Technical field
The present invention relates to a kind of machine translation apparatus and machine translation method.
Background technology
All the time, people are when the people who says foreign language carries out session and exchanges, and hope can have a kind of portable unit, can at any time the foreign language translation of hearing be become the mother tongue of oneself, oneself can be translated into foreign language again and say to the other side and listen.
In correlation technique, current have three kinds of technology: the one, and speech recognition technology utilizes this technology, and computing machine can be discerned the voice of input, the text that speech conversion is become to be convenient to handle; The 2nd, text-converted becomes voice technology, i.e. text voice conversion TTS (Text To Speech) technology is utilized this technology, and the voice that computing machine can become people to understand the text-converted after the translation are exported; The 3rd, machine translation mothod promptly utilizes calculation element that another language translated in a kind of language.
1. speech recognition technology
The computer speech identifying is consistent with the people to the voice recognition processing process basically.The speech recognition technology of main flow is based on the basic theories of statistical model identification at present.The implementation procedure of a typical speech recognition system 100 can roughly be divided into three parts as shown in Figure 1: (1) phonetic feature extraction unit 110: its objective is to extract time dependent phonetic feature sequence from speech waveform.(2) acoustic model (model bank 120) and pattern match 130 (recognizer): acoustic model produces the phonetic feature that obtains usually by learning algorithm.Phonetic feature with input when identification mates and compares with acoustics model (pattern), obtains best recognition result.(3) language model and Language Processing: language model comprises grammer network that is made of voice command recognition or the language model that is made of statistical method, and Language Processing can be carried out grammer, semantic analysis.To little vocabulary speech recognition system, often do not need the Language Processing part.
Acoustic model is the bottom model of recognition system, and is the part of most critical in the speech recognition system.The purpose of acoustic model provides a kind of feature vector sequence of effective method computing voice and the distance between each pronunciation template.The design of acoustic model is closely related with the language pronouncing characteristics.Acoustic model cell size (word pronunciation model, semitone joint model or phoneme model) is to voice training data volume size, system recognition rate, and dirigibility has bigger influence.Must determine the size of recognition unit according to the characteristics of different language, the size of recognition system vocabulary.
About speech recognition application systematic research exploitation, more existing practical speech recognition systems drop into commercial operation.The software that has occurred some speech recognitions on market helps people to utilize speech recognition technology to realize the text typing, and simple computer operation etc.For example, in some mobile phone, people have realized voice dial-up function; When some software can help people to use PC, use the microphone utterance that is connected on the PC is come the typing text.
2.TTS technology
The TTS technology claims the text voice switch technology again, the Word message of input that it produces computing machine oneself or outside change into can listen the technology of Chinese characters spoken language output that understand, fluent, be under the jurisdiction of phonetic synthesis.Phonetic synthesis is for producing the technology of artificial voice by method machinery, electronics.Be example with Fig. 2 below, the principle of TTS technology is described.Linguistics is handled (S210), in text-to-speech system, play an important role, main anthropomorphic dummy is to the understanding process of natural language---and text is regular, the cutting of speech, grammatical analysis and semantic analysis, computing machine can be understood fully to the text of input, and provide the needed various pronunciation promptings of back two parts.The rhythm is handled (S220), for synthetic speech is cooked up segment5al feature, as pitch, the duration of a sound and loudness of a sound etc., makes synthetic speech can correctly express the meaning of one's words, sounds more natural.Acoustic treatment (S230) is according to requirement output voice, the i.e. synthetic speech of preceding two parts result.
Occurred the software of a lot of TTS on the current market, and the TTS software of various language has been arranged, can become the voice of this language to export the text-converted of various language such as Chinese, English, Japanese by these softwares.For example we can hear the voice suggestion that utilizes the TTS technology export in the application of various hotlines, Games Software, instrument and meter etc.
3. machine translation mothod
Currently a large amount of mechanical translation software occurs, can be installed on the various devices such as PC, PDA, carried out other translation of various languages.For example the Wenquxing TMV5100 of company of Golden Global View can realize the translation of english vernacular.This device can carry out English-Chinese, Chinese-English two-way whole sentence translation to works and expressions for everyday use.When carrying out the English interchange,, just can obtain translation result to Wenquxing TMV5100 input Chinese or English.Works and expressions for everyday use are translated the cross discipline of multiple subjects such as relating to computational linguistics, artificial intelligence, natural language processing and technology.The works and expressions for everyday use translation comprises that mainly works and expressions for everyday use are resolved and translation.Works and expressions for everyday use are resolved main works and expressions for everyday use sentence pattern, automatic segmentation long sentence and the rewriting statement analyzed.The result that works and expressions for everyday use translations draws at semantic analysis utilizes the database of setting up in advance, determines to understand the second language of works and expressions for everyday useization.The automatic works and expressions for everyday use translation system of computing machine of exploitation has had much at present, has wherein introduced the works and expressions for everyday use translation on the Wenquxing TMV5100 as embedded system device.
Yet people a kind of machine translation apparatus wish to occur in the translation application of reality, can carry, and can understand people's word, and will hear and can say by opening after translating out.
In existing machine translation apparatus, utilized the machine translation mothod in the correlation technique, a kind of text translation of language can be become the text of another kind of language, or combine the TTS technology in the correlation technique, translation result can be said; But there are following problems in these machine translation apparatus, that is, these machine translation apparatus can not be listened people's word, more can not understand people's word, will hear then and can say by opening after translating out.
In addition, all there is a problem in above-mentioned correlation technique, that is, because the problem of the large vocabulary of language and the flexible and changeable semantic analysis that grammer brought.And this problem seems particularly outstanding in the middle of the words of the whole sentence of mechanical translation.
Summary of the invention
Therefore, the present invention relates to a kind of machine translation apparatus and machine translation method, it can solve well owing to the restriction of correlation technique and the not enough one or more problems that cause.
An advantage of the present invention is to provide a kind of machine translation apparatus and machine translation method, in particular to a kind of language that is used to translate by phonetic entry, then with the machine translation apparatus of translation result with voice output.
Another advantage of the present invention is can realize by embedded system device according to machine translation apparatus of the present invention, thus be easy to carry and cost very low.
Other features and advantages of the present invention will be set forth in the following description, and, partly from instructions, become apparent, perhaps understand by implementing the present invention.Purpose of the present invention and other advantages can realize and obtain by specifically noted structure in the instructions of being write, claims and accompanying drawing.
In order to realize that according to these purposes of the present invention and other advantage concrete and general description provides a kind of machine translation apparatus as institute in the literary composition, it is characterized in that, comprising: the voice messaging input block is used to import the voice messaging of first language; Voice recognition unit is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation unit, be used for the text message of being changed is translated into the text message of second language.
According to machine translation apparatus of the present invention, also can comprise: the text voice converting unit, the text message that is used for the second language that will be translated converts voice messaging to; And the voice messaging output unit, be used for the voice messaging output that will be changed.
According to machine translation apparatus of the present invention, also can comprise: judging unit is used to judge that the text message of being changed is word or sentence; And the dictionary unit, be used for translation of words; Wherein, when the text of changing when described judgment unit judges is word, described word translation is become the text message of second language by described dictionary unit; And the text of changing when described judgment unit judges becomes described sentence translation by described translation unit the text message of second language when being sentence.
According to machine translation apparatus of the present invention, also can comprise: the text message output unit is used to export the text message of text message of being changed and the second language of being translated; And the text message input block, the text message that is used to import the text message of first language and revises the second language of being changed.
According to machine translation apparatus of the present invention, available computers realizes.
According to machine translation apparatus of the present invention, available embedded system device is realized.
Described scheduled instruction can be defined as the works and expressions for everyday use in a kind of existing language, the industry term in perhaps a kind of existing language.
According to a further aspect of the invention, provide a kind of machine translation method, it is characterized in that may further comprise the steps: the voice messaging input step is used to import the voice messaging of first language; Speech recognition steps is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation steps, the text translation that is used for being changed becomes the text message of second language.
According to machine translation method of the present invention, also can may further comprise the steps: the text voice switch process, the text message that is used for the second language that will be translated converts voice messaging to; And voice messaging output step, be used for the voice messaging output that will be changed.
According to machine translation method of the present invention, also can may further comprise the steps: determining step is used to judge that the text message of being changed is word or sentence; And the word translation step, be used for translation of words, wherein when judging that the text message of being changed is word, described word translation is become the text message of second language by described word translation step; And when judging that the text message of being changed is sentence, described sentence translation is become the text message of second language by described translation steps.
According to machine translation method of the present invention, also can may further comprise the steps: text message output step is used to export the text message of text message of being changed and the second language of being translated; And the text message input step, be used to import the text message of first language and revise the text message of being changed.
Described scheduled instruction can be defined as the works and expressions for everyday use in a kind of existing language, the industry term in perhaps a kind of existing language.
By following explanation to specific embodiments of the invention, and in conjunction with the accompanying drawings, it is obvious that other aspects of the present invention and feature will become concerning those skilled in the art person.
Description of drawings
Below in conjunction with accompanying drawing, embodiments of the invention are described, should be appreciated that these embodiment are used to illustrate the present invention, rather than the present invention is limited, wherein:
Fig. 1 is the synoptic diagram that the realization of speech recognition is shown;
Fig. 2 is the synoptic diagram that the realization of TTS is shown;
Fig. 3 is the schematic diagram that illustrates according to machine translation apparatus of the present invention;
Fig. 4 is the block scheme that machine translation apparatus according to an embodiment of the invention is shown;
Fig. 5 is the process flow diagram that machine translation method according to an embodiment of the invention is shown;
Fig. 6 is the block scheme that machine translation apparatus according to another embodiment of the invention is shown; And
Fig. 7 is the process flow diagram that machine translation method according to another embodiment of the invention is shown.
Embodiment
Below will be in detail with reference to the preferred embodiments of the present invention, the example is shown in the drawings.
People a kind of machine translation apparatus wish to occur in the foreign language session of reality, can carry, and can understand people's word, and will hear and can say by opening after translating out.
In order to achieve this end, need to solve the problem of three aspects.The one, understand people's word, the 2nd, the words of hearing are translated out, the 3rd, the translation result opening is said.
Fig. 3 is the schematic diagram that illustrates according to machine translation apparatus of the present invention.As shown in Figure 3, according to principle of the present invention, speech recognition technology, TTS technology and the machine translation mothod of correlation technique are combined, promptly utilize speech recognition technology that the speech conversion of input is become text (understanding people's word) (S410), utilize machine translation mothod then, be about to a kind of language and translate into another language (will hear and translate out) (S420), utilize the TTS technology at last, the language conversion that translation is come out becomes voice output (the translation result opening is said) (S430).
Yet, in speech recognition technology, TTS technology and the machine translation mothod of correlation technique, have a problem, promptly because the problem of the large vocabulary of language and the flexible and changeable semantic analysis that grammer brought.And this problem seems particularly outstanding in the middle of the words of the whole sentence of mechanical translation.
Because the semantic huge complicacy of language, make the semantic analysis particular importance, to adopt the algorithm of optimization on the one hand, on the other hand, be defined as specific area in the language by reducing the language that to translate, can reduce the calculated amount of semantic analysis significantly, make the combination of above correlation technique become and realize easily.
Therefore, according to principle of the present invention,, the language that translate is defined in specific field in order to solve the problem of above-mentioned semantic analysis.
Preferably, for example, be directed to session and exchange, the language that translate can be defined in the works and expressions for everyday use translation, this day term translation.
Alternatively, if be directed to some sector application, for example the communications field also can be defined in the language that will translate the communication term.
By with upper type, can reduce the calculated amount of semantic analysis significantly, and accelerate translation speed significantly, and owing to the requirement that has reduced machine translation apparatus arithmetic capability and storage capacity, so strengthened striding the hardware platform performance significantly, for example not only PC can be applied to, and various embedded systems such as single-chip microcomputer can be applied to.
Alternatively, in order to improve portability, realize that better session anywhere or anytime exchanges, and can realize with 32 single-chip microcomputers of FLASH storer according to machine translation apparatus of the present invention.
Describe machine translation apparatus and machine translation method according to an embodiment of the invention in detail below in conjunction with accompanying drawing.
Fig. 4 is the block scheme that machine translation apparatus 500 according to an embodiment of the invention is shown.
Fig. 5 is the process flow diagram that machine translation method according to an embodiment of the invention is shown.
This machine translation method may further comprise the steps: voice messaging input step S610 is used to import the voice messaging of first language; Speech recognition steps S620 is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And translation steps S630, the text translation that is used for being changed becomes the text message of second language.
Fig. 6 is the block scheme that machine translation apparatus 700 according to an embodiment of the invention is shown.
As shown in Figure 6, machine translation apparatus 700 according to the present invention comprises: microphone 710 is used to import user's voice; Voice recognition unit 720 utilizes speech recognition technology to discern the voice of being imported, and converts thereof into text; Judging unit 730 is used to judge that the text of being changed is a word or works and expressions for everyday use; Dictionary unit 740 is used for translation of words; Works and expressions for everyday use translation unit 750 is used to translate works and expressions for everyday use; Display screen 760 is used to show translation result; TTS unit 770 is used for converting translation result to voice messaging; Loudspeaker 780 is used to export voice messaging; And keyboard 790, the text message that is used to import the text message of first language and revises the second language of being changed.
Alternatively, this machine translation apparatus can be realized with PC, also can realize with embedded system.It should be noted that machine translation apparatus according to the present invention is not limited to specific hardware platform.
Alternatively, this machine translation apparatus can be used for Sino-British works and expressions for everyday use paginal translation, Sino-Japan works and expressions for everyday use paginal translation etc.It should be noted that machine translation apparatus according to the present invention is not limited to one or more language.
Alternatively, this machine translation apparatus can be used for works and expressions for everyday use translation, the translation of industry term etc.It should be noted that machine translation apparatus according to the present invention is not limited to the term of specific area.
Below the operating process of machine translation apparatus 700 according to another embodiment of the invention will be described according to Fig. 7.
Fig. 7 is the process flow diagram that machine translation apparatus 700 according to an embodiment of the invention is shown.
In S810, the user speaks to the microphone of machine translation apparatus with predetermined a kind of language, i.e. input needs the voice messaging of translation; In S820, utilize speech recognition technology to discern the voice of being imported, and convert thereof into text; In S830, judge that the text of being changed is a word or works and expressions for everyday use; If word then forwards S840 to, in S840, translation of words; If works and expressions for everyday use then forward S850 to, in S850, the translation works and expressions for everyday use; In S860, show translation result; In S870, convert translation result to voice messaging; In S880, export voice messaging to the user; And in S880, directly input text information can be used for importing the text message of first language and the text message of the second language that modification is changed.
By machine translation apparatus of the present invention, an one advantage is not only can carry out mechanical translation, and can directly understand people's word, no longer needs text input manually.
According to machine translation apparatus of the present invention, its another advantage be can be enough embedded system device realize, thereby be easy to carry and with low cost.
By machine translation apparatus of the present invention, its another advantage is and the result of mechanical translation directly can also be said.
By machine translation apparatus of the present invention, its another advantage is not only can translation of words, can also translate the whole sentence of specific area, for example works and expressions for everyday use or industry term.
By machine translation apparatus of the present invention, realized that people in the foreign language session of reality, can understand people's word, and will hear and to say by opening after translating out.Generally speaking, carry such machine translation apparatus just as being equipped with a translator with oneself.
The above is the preferred embodiments of the present invention only, is not limited to the present invention, and for a person skilled in the art, the present invention can have various changes and variation.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.
Claims (14)
1. a machine translation apparatus is characterized in that, comprising:
The voice messaging input block is used to import the voice messaging of first language;
Voice recognition unit is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And
Translation unit is used for the text message of being changed is translated into the text message of second language.
2. machine translation apparatus according to claim 1 is characterized in that, also comprises:
The text voice converting unit, the text message that is used for the second language that will be translated converts voice messaging to; And
The voice messaging output unit is used for the voice messaging output that will be changed.
3. machine translation apparatus according to claim 1 is characterized in that, also comprises:
Judging unit is used to judge that the text message of being changed is word or sentence; And
The dictionary unit is used for translation of words; Wherein,
When the text of changing when described judgment unit judges is word, described word translation is become the text message of second language by described dictionary unit; And
When the text of changing when described judgment unit judges is sentence, described sentence translation is become the text message of second language by described translation unit.
4. according to each described machine translation apparatus in the claim 1 to 3, it is characterized in that, also comprise:
The text message output unit is used to export the text message of text message of being changed and the second language of being translated; And
The text message input block, the text message that is used to import the text message of first language and revises the second language of being changed.
5. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that, realize with computing machine.
6. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that, realize with embedded system device.
7. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that described first language is defined as the works and expressions for everyday use in a kind of existing language.
8. according to each described machine translation apparatus in the claim 1 to 4, it is characterized in that described first language is defined as the industry term in a kind of existing language.
9. a machine translation method is characterized in that, may further comprise the steps:
The voice messaging input step is used to import the voice messaging of first language;
Speech recognition steps is used to discern the voice messaging of the first language of being imported, and converts thereof into text message; And
Translation steps, the text translation that is used for being changed becomes the text message of second language.
10. method according to claim 9 is characterized in that, and is further comprising the steps of:
The text voice switch process, the text message that is used for the second language that will be translated converts voice messaging to; And
Voice messaging output step is used for the voice messaging output that will be changed.
11. method according to claim 9 is characterized in that, and is further comprising the steps of:
Determining step is used to judge that the text message of being changed is word or sentence; And
The word translation step is used for translation of words, wherein
When judging that the text message of being changed is word, described word translation is become the text message of second language by described word translation step; And
When judging that the text message of being changed is sentence, described sentence translation is become the text message of second language by described translation steps.
12. according to each described method in the claim 9 to 11, it is characterized in that, further comprising the steps of:
Text message output step is used to export the text message of text message of being changed and the second language of being translated; And
The text message input step is used to import the text message of first language and revises the text message of being changed.
13., it is characterized in that described scheduled instruction is defined as the works and expressions for everyday use in a kind of existing language according to each described method in the claim 9 to 12.
14., it is characterized in that described scheduled instruction is defined as the industry term in a kind of existing language according to each described method in the claim 9 to 12.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA200610002940XA CN101008942A (en) | 2006-01-25 | 2006-01-25 | Machine translation device and method thereof |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA200610002940XA CN101008942A (en) | 2006-01-25 | 2006-01-25 | Machine translation device and method thereof |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101008942A true CN101008942A (en) | 2007-08-01 |
Family
ID=38697376
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA200610002940XA Pending CN101008942A (en) | 2006-01-25 | 2006-01-25 | Machine translation device and method thereof |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101008942A (en) |
Cited By (69)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102239517A (en) * | 2009-01-28 | 2011-11-09 | 三菱电机株式会社 | Speech recognition device |
CN102237083A (en) * | 2010-04-23 | 2011-11-09 | 广东外语外贸大学 | Portable interpretation system based on WinCE platform and language recognition method thereof |
CN102467908A (en) * | 2010-11-17 | 2012-05-23 | 英业达股份有限公司 | Multilingual voice control system and method thereof |
CN102650960A (en) * | 2012-03-31 | 2012-08-29 | 奇智软件(北京)有限公司 | Method and device for eliminating faults of terminal equipment |
CN101923858B (en) * | 2009-06-17 | 2012-11-21 | 劳英杰 | Real-time and synchronous mutual translation voice terminal |
CN103514153A (en) * | 2012-06-29 | 2014-01-15 | 株式会社东芝 | Speech translation apparatus, method and program |
CN103838714A (en) * | 2012-11-22 | 2014-06-04 | 北大方正集团有限公司 | Method and device for converting voice information |
CN104049960A (en) * | 2013-03-16 | 2014-09-17 | 上海能感物联网有限公司 | Method for remotely controlling computer program operation through foreign-language voice |
CN104380284A (en) * | 2012-03-06 | 2015-02-25 | 苹果公司 | Handling speech synthesis of content for multiple languages |
CN104679733A (en) * | 2013-11-26 | 2015-06-03 | 中国移动通信集团公司 | Voice conversation translation method, device and system |
CN104899192A (en) * | 2014-03-07 | 2015-09-09 | 韩国电子通信研究院 | Apparatus and method for automatic interpretation |
CN106156012A (en) * | 2016-06-28 | 2016-11-23 | 乐视控股(北京)有限公司 | A kind of method for generating captions and device |
CN106649295A (en) * | 2017-01-04 | 2017-05-10 | 携程旅游网络技术(上海)有限公司 | Text translation method for mobile terminal |
US9668024B2 (en) | 2014-06-30 | 2017-05-30 | Apple Inc. | Intelligent automated assistant for TV user interactions |
CN106919559A (en) * | 2015-12-25 | 2017-07-04 | 松下知识产权经营株式会社 | Machine translation method and machine translation system |
US9865248B2 (en) | 2008-04-05 | 2018-01-09 | Apple Inc. | Intelligent text-to-speech conversion |
CN107704456A (en) * | 2016-08-09 | 2018-02-16 | 松下知识产权经营株式会社 | Identify control method and identification control device |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9966060B2 (en) | 2013-06-07 | 2018-05-08 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
US9971774B2 (en) | 2012-09-19 | 2018-05-15 | Apple Inc. | Voice-based media searching |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US9986419B2 (en) | 2014-09-30 | 2018-05-29 | Apple Inc. | Social reminders |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10079014B2 (en) | 2012-06-08 | 2018-09-18 | Apple Inc. | Name recognition system |
US10083690B2 (en) | 2014-05-30 | 2018-09-25 | Apple Inc. | Better resolution when referencing to concepts |
CN108595443A (en) * | 2018-03-30 | 2018-09-28 | 浙江吉利控股集团有限公司 | Simultaneous interpreting method, device, intelligent vehicle mounted terminal and storage medium |
US10089072B2 (en) | 2016-06-11 | 2018-10-02 | Apple Inc. | Intelligent device arbitration and control |
US10169329B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Exemplar-based natural language processing |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
CN109448698A (en) * | 2018-10-17 | 2019-03-08 | 深圳壹账通智能科技有限公司 | Simultaneous interpretation method, apparatus, computer equipment and storage medium |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10269345B2 (en) | 2016-06-11 | 2019-04-23 | Apple Inc. | Intelligent task discovery |
US10283110B2 (en) | 2009-07-02 | 2019-05-07 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US10297253B2 (en) | 2016-06-11 | 2019-05-21 | Apple Inc. | Application integration with a digital assistant |
US10318871B2 (en) | 2005-09-08 | 2019-06-11 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US10332518B2 (en) | 2017-05-09 | 2019-06-25 | Apple Inc. | User interface for correcting recognition errors |
US10356243B2 (en) | 2015-06-05 | 2019-07-16 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10354011B2 (en) | 2016-06-09 | 2019-07-16 | Apple Inc. | Intelligent automated assistant in a home environment |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US10410637B2 (en) | 2017-05-12 | 2019-09-10 | Apple Inc. | User-specific acoustic models |
CN110309289A (en) * | 2019-08-23 | 2019-10-08 | 深圳市优必选科技股份有限公司 | A kind of sentence generation method, sentence generation device and smart machine |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US10482874B2 (en) | 2017-05-15 | 2019-11-19 | Apple Inc. | Hierarchical belief states for digital assistants |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10521466B2 (en) | 2016-06-11 | 2019-12-31 | Apple Inc. | Data driven natural language event detection and classification |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
WO2020048143A1 (en) * | 2018-09-05 | 2020-03-12 | 满金坝(深圳)科技有限公司 | Machine learning-based simultaneous interpretation method and device |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10706841B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Task flow identification based on user intent |
US10733993B2 (en) | 2016-06-10 | 2020-08-04 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10755703B2 (en) | 2017-05-11 | 2020-08-25 | Apple Inc. | Offline personal assistant |
US10789945B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Low-latency intelligent automated assistant |
US10791176B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Synchronization and task delegation of a digital assistant |
US10795541B2 (en) | 2009-06-05 | 2020-10-06 | Apple Inc. | Intelligent organization of tasks items |
US10810274B2 (en) | 2017-05-15 | 2020-10-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
CN112055876A (en) * | 2018-04-27 | 2020-12-08 | 语享路有限责任公司 | Multi-party dialogue recording/outputting method using voice recognition technology and apparatus therefor |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US11080012B2 (en) | 2009-06-05 | 2021-08-03 | Apple Inc. | Interface for a virtual digital assistant |
US11217255B2 (en) | 2017-05-16 | 2022-01-04 | Apple Inc. | Far-field extension for digital assistant services |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
-
2006
- 2006-01-25 CN CNA200610002940XA patent/CN101008942A/en active Pending
Cited By (86)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10318871B2 (en) | 2005-09-08 | 2019-06-11 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US9865248B2 (en) | 2008-04-05 | 2018-01-09 | Apple Inc. | Intelligent text-to-speech conversion |
CN102239517A (en) * | 2009-01-28 | 2011-11-09 | 三菱电机株式会社 | Speech recognition device |
CN102239517B (en) * | 2009-01-28 | 2013-05-08 | 三菱电机株式会社 | Speech recognition device |
US10795541B2 (en) | 2009-06-05 | 2020-10-06 | Apple Inc. | Intelligent organization of tasks items |
US11080012B2 (en) | 2009-06-05 | 2021-08-03 | Apple Inc. | Interface for a virtual digital assistant |
CN101923858B (en) * | 2009-06-17 | 2012-11-21 | 劳英杰 | Real-time and synchronous mutual translation voice terminal |
US10283110B2 (en) | 2009-07-02 | 2019-05-07 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US11423886B2 (en) | 2010-01-18 | 2022-08-23 | Apple Inc. | Task flow identification based on user intent |
US10706841B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Task flow identification based on user intent |
CN102237083A (en) * | 2010-04-23 | 2011-11-09 | 广东外语外贸大学 | Portable interpretation system based on WinCE platform and language recognition method thereof |
CN102467908A (en) * | 2010-11-17 | 2012-05-23 | 英业达股份有限公司 | Multilingual voice control system and method thereof |
CN102467908B (en) * | 2010-11-17 | 2016-01-06 | 英业达股份有限公司 | Multilingual voice control system and method thereof |
CN104380284A (en) * | 2012-03-06 | 2015-02-25 | 苹果公司 | Handling speech synthesis of content for multiple languages |
CN104380284B (en) * | 2012-03-06 | 2018-01-30 | 苹果公司 | For the phonetic synthesis of multilingual process content |
CN102650960B (en) * | 2012-03-31 | 2015-04-15 | 北京奇虎科技有限公司 | Method and device for eliminating faults of terminal equipment |
CN102650960A (en) * | 2012-03-31 | 2012-08-29 | 奇智软件(北京)有限公司 | Method and device for eliminating faults of terminal equipment |
US10079014B2 (en) | 2012-06-08 | 2018-09-18 | Apple Inc. | Name recognition system |
CN103514153A (en) * | 2012-06-29 | 2014-01-15 | 株式会社东芝 | Speech translation apparatus, method and program |
US9971774B2 (en) | 2012-09-19 | 2018-05-15 | Apple Inc. | Voice-based media searching |
CN103838714A (en) * | 2012-11-22 | 2014-06-04 | 北大方正集团有限公司 | Method and device for converting voice information |
CN104049960A (en) * | 2013-03-16 | 2014-09-17 | 上海能感物联网有限公司 | Method for remotely controlling computer program operation through foreign-language voice |
US9966060B2 (en) | 2013-06-07 | 2018-05-08 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
CN104679733A (en) * | 2013-11-26 | 2015-06-03 | 中国移动通信集团公司 | Voice conversation translation method, device and system |
CN104679733B (en) * | 2013-11-26 | 2018-02-23 | 中国移动通信集团公司 | A kind of voice dialogue interpretation method, apparatus and system |
CN104899192B (en) * | 2014-03-07 | 2018-03-27 | 韩国电子通信研究院 | For the apparatus and method interpreted automatically |
CN104899192A (en) * | 2014-03-07 | 2015-09-09 | 韩国电子通信研究院 | Apparatus and method for automatic interpretation |
US10169329B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Exemplar-based natural language processing |
US10083690B2 (en) | 2014-05-30 | 2018-09-25 | Apple Inc. | Better resolution when referencing to concepts |
US10904611B2 (en) | 2014-06-30 | 2021-01-26 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9668024B2 (en) | 2014-06-30 | 2017-05-30 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9986419B2 (en) | 2014-09-30 | 2018-05-29 | Apple Inc. | Social reminders |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US10356243B2 (en) | 2015-06-05 | 2019-07-16 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US11500672B2 (en) | 2015-09-08 | 2022-11-15 | Apple Inc. | Distributed personal assistant |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US11526368B2 (en) | 2015-11-06 | 2022-12-13 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
CN106919559A (en) * | 2015-12-25 | 2017-07-04 | 松下知识产权经营株式会社 | Machine translation method and machine translation system |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
US11069347B2 (en) | 2016-06-08 | 2021-07-20 | Apple Inc. | Intelligent automated assistant for media exploration |
US10354011B2 (en) | 2016-06-09 | 2019-07-16 | Apple Inc. | Intelligent automated assistant in a home environment |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10733993B2 (en) | 2016-06-10 | 2020-08-04 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US11037565B2 (en) | 2016-06-10 | 2021-06-15 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10297253B2 (en) | 2016-06-11 | 2019-05-21 | Apple Inc. | Application integration with a digital assistant |
US10089072B2 (en) | 2016-06-11 | 2018-10-02 | Apple Inc. | Intelligent device arbitration and control |
US10269345B2 (en) | 2016-06-11 | 2019-04-23 | Apple Inc. | Intelligent task discovery |
US10521466B2 (en) | 2016-06-11 | 2019-12-31 | Apple Inc. | Data driven natural language event detection and classification |
US11152002B2 (en) | 2016-06-11 | 2021-10-19 | Apple Inc. | Application integration with a digital assistant |
CN106156012A (en) * | 2016-06-28 | 2016-11-23 | 乐视控股(北京)有限公司 | A kind of method for generating captions and device |
CN107704456A (en) * | 2016-08-09 | 2018-02-16 | 松下知识产权经营株式会社 | Identify control method and identification control device |
CN107704456B (en) * | 2016-08-09 | 2023-08-29 | 松下知识产权经营株式会社 | Identification control method and identification control device |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US10553215B2 (en) | 2016-09-23 | 2020-02-04 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
CN106649295A (en) * | 2017-01-04 | 2017-05-10 | 携程旅游网络技术(上海)有限公司 | Text translation method for mobile terminal |
US10332518B2 (en) | 2017-05-09 | 2019-06-25 | Apple Inc. | User interface for correcting recognition errors |
US10755703B2 (en) | 2017-05-11 | 2020-08-25 | Apple Inc. | Offline personal assistant |
US10410637B2 (en) | 2017-05-12 | 2019-09-10 | Apple Inc. | User-specific acoustic models |
US10789945B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Low-latency intelligent automated assistant |
US10791176B2 (en) | 2017-05-12 | 2020-09-29 | Apple Inc. | Synchronization and task delegation of a digital assistant |
US11405466B2 (en) | 2017-05-12 | 2022-08-02 | Apple Inc. | Synchronization and task delegation of a digital assistant |
US10810274B2 (en) | 2017-05-15 | 2020-10-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
US10482874B2 (en) | 2017-05-15 | 2019-11-19 | Apple Inc. | Hierarchical belief states for digital assistants |
US11217255B2 (en) | 2017-05-16 | 2022-01-04 | Apple Inc. | Far-field extension for digital assistant services |
CN108595443A (en) * | 2018-03-30 | 2018-09-28 | 浙江吉利控股集团有限公司 | Simultaneous interpreting method, device, intelligent vehicle mounted terminal and storage medium |
CN112055876A (en) * | 2018-04-27 | 2020-12-08 | 语享路有限责任公司 | Multi-party dialogue recording/outputting method using voice recognition technology and apparatus therefor |
WO2020048143A1 (en) * | 2018-09-05 | 2020-03-12 | 满金坝(深圳)科技有限公司 | Machine learning-based simultaneous interpretation method and device |
CN109448698A (en) * | 2018-10-17 | 2019-03-08 | 深圳壹账通智能科技有限公司 | Simultaneous interpretation method, apparatus, computer equipment and storage medium |
CN110309289A (en) * | 2019-08-23 | 2019-10-08 | 深圳市优必选科技股份有限公司 | A kind of sentence generation method, sentence generation device and smart machine |
CN110309289B (en) * | 2019-08-23 | 2019-12-06 | 深圳市优必选科技股份有限公司 | Sentence generation method, sentence generation device and intelligent equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101008942A (en) | Machine translation device and method thereof | |
EP1089193A2 (en) | Translating apparatus and method, and recording medium used therewith | |
Neto et al. | Free tools and resources for Brazilian Portuguese speech recognition | |
CN111243599B (en) | Speech recognition model construction method, device, medium and electronic equipment | |
TW201517015A (en) | Method for building acoustic model, speech recognition method and electronic apparatus | |
US9026430B2 (en) | Electronic device and natural language analysis method thereof | |
El Ouahabi et al. | Toward an automatic speech recognition system for amazigh-tarifit language | |
Hämäläinen et al. | Multilingual speech recognition for the elderly: The AALFred personal life assistant | |
CN112489634A (en) | Language acoustic model training method and device, electronic equipment and computer medium | |
CN110852075B (en) | Voice transcription method and device capable of automatically adding punctuation marks and readable storage medium | |
JP2020060642A (en) | Speech synthesis system and speech synthesizer | |
Ronzhin et al. | Survey of russian speech recognition systems | |
CN100380442C (en) | System and method for mandarin chinese speech recogniton using an optimized phone set | |
Gilbert et al. | Intelligent virtual agents for contact center automation | |
Liu et al. | A maximum entropy based hierarchical model for automatic prosodic boundary labeling in mandarin | |
CN115019787A (en) | Interactive homophonic and heteronym word disambiguation method, system, electronic equipment and storage medium | |
JP2001117752A (en) | Information processor, information processing method and recording medium | |
RU80603U1 (en) | ELECTRONIC TRANSMISSION SYSTEM WITH THE FUNCTION OF SYNCHRONOUS TRANSLATION OF ORAL SPEECH FROM ONE LANGUAGE TO ANOTHER | |
Srun et al. | Development of speech recognition system based on cmusphinx for khmer language | |
Raheem et al. | Real-time speech recognition of arabic language | |
Hanane et al. | TTS-SA (A text-to-speech system based on standard arabic) | |
JP7012935B1 (en) | Programs, information processing equipment, methods | |
CN113515952B (en) | Combined modeling method, system and equipment for Mongolian dialogue model | |
Deng | Research on Online English Speech Interactive Recognition System Based on Nose Algorithm | |
Hassana et al. | Text to Speech Synthesis System in Yoruba Language |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Open date: 20070801 |