CN104199813A - Pseudo-feedback-based personalized machine translation system and method - Google Patents
Pseudo-feedback-based personalized machine translation system and method Download PDFInfo
- Publication number
- CN104199813A CN104199813A CN201410491100.9A CN201410491100A CN104199813A CN 104199813 A CN104199813 A CN 104199813A CN 201410491100 A CN201410491100 A CN 201410491100A CN 104199813 A CN104199813 A CN 104199813A
- Authority
- CN
- China
- Prior art keywords
- translation
- model
- postedit
- result
- preliminary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Machine Translation (AREA)
Abstract
The invention relates to a pseudo-feedback-based personalized machine translation system and method. The existing traditional machine translation methods are unavailable for the obtaining of high-quality personalized translation systems, and the various translation demands of users cannot be met. The pseudo-feedback-based personalized machine translation system comprises a phrase table filter module, an input module, an initial translation module, a pseudo-feedback search module, a phrase table sorting module and a decoder module. The pseudo-feedback-based personalized machine translation method includes the steps: an inputting step, a user inputs a translation task S; an initial translation step, an initial machine translation result T' of the translation task is obtained with the initial translation module; a pseudo-feedback search step, the pseudo-feedback search module is used to search to obtain initial translation results and standard translations R of similar translation instances; a phrase table sorting step, a trained universal post-editing model is turned into a personalized post-editing model, and filtering is performed to obtain an optimized post-editing model; a decoder module decoding step, the optimized personalized post-editing model is used to decode the initial machine translation result T' of the translation task so as to obtain an optimal final translation result. The pseudo-feedback-based personalized machine translation system and method is applicable to the field of machine translation.
Description
Technical field
The present invention relates to a kind of personalization machine translation system and method, belong to mechanical translation field.
Background technology
Developing rapidly along with machine translation mothod in recent years, the quality of its translation has had lifting to a great extent, and the obstacle that some general translation on line services can help people to break through language at present removes to read and understand texts across language that some are commonly used.Yet further the quality of hoisting machine translation has but run into very large difficulty.On the one hand, because existing statistical machine translation technology major defect is, if complete personalized translation, needs a large amount of field feedbacks, and in these data, add up and train modeling, realize the Machine Translation Model of property one by one.And these acquisitions of training required field feedback are very difficult, and existing method cannot effectively utilize these feedback informations, thereby cannot obtain high-quality personalized translation system.Although can utilize field feedback by traditional postedit, due to can user's data less, cause the advantage of adding up postedit model to be difficult to bring into play.On the other hand, the optimization aim of traditional machine translation method is normally based on open field, rather than is directed to that specific translation duties carries out.Although have the research for domain-adaptive problem, but still belong to for professional colony, and in the face of extensively various mechanical translation user especially internet online user, cannot meet the various translate requirements of user.So further improve the quality of mechanical translation, be that we want a technical matters urgently to be resolved hurrily.
Summary of the invention
The object of the invention is cannot obtain high-quality personalized translation system in order to solve traditional machine translation method, cause the problem that can not meet the various translate requirements of user, and propose a kind of personalization machine translation system and interpretation method based on puppet feedback that can improve mechanical translation quality.
A personalization machine translation system based on puppet feedback, described translation system comprises:
For the phrase table filtering module that each general postedit model phrase table of exploitation collection data is filtered;
For obtaining the load module of the translation duties S of user's input;
For user is inputted, translate after translation duties S and obtain the preliminary mechanical translation result T ' of translation duties, the preliminary translation module that obtains the preliminary translation of the sentence T of translation instance translated in the source language sentence in the translation instance storehouse that local system is provided;
For in the translation instance storehouse of local system word alignment form, retrieve the pseudo-feedback searching module of the preliminary translation result and the standard translation translation R that obtain similar translation example;
For being classified, the phrase table of postedit model after training obtains the phrase table sort module of personalized postedit model;
The preliminary mechanical translation result that is used for pseudo-feedback searching module retrieval to obtain is decoded, and obtains the decoder module of final translation result.
A kind of personalization machine interpretation method based on puppet feedback, before user inputs translation duties S, utilize preliminary mechanical translation sentence T and the standard translation translation R of the translation instance in translation memory to adopt statistical method to train general postedit model, complete the training process of general postedit model; Described personalization machine interpretation method is realized by following steps:
Step 1, phrase table filters mold process: utilize phrase table filtering module to filter each general postedit model phrase table of exploitation collection data;
Utilize the result after filtering to adopt default-weight to each sentence D in exploitation collection data
idecode, produce n-best translation result; Then, n-best translation result is carried out to combination; Finally, the whole ginseng of adjusting of n-best translation result after using MERT instrument to combination, can also realization character parameter optimisation procedure;
Step 2, input process: user inputs to load module by translation duties S;
Step 3, preliminary translation process: described preliminary translation process comprises that user inputs translation duties S before and user inputs translation duties S two parts afterwards;
Before user inputs translation duties S, the transcription platform that utilizes the machine translation system of local system to build, the source language sentence in the translation instance storehouse that local system is provided is tentatively translated, and obtains the preliminary translation of the sentence T of translation instance;
Meanwhile, obtain the translation duties S of user's input by load module after, utilize preliminary translation module translation to obtain the preliminary mechanical translation result T ' of translation duties;
Step 4, pseudo-feedback searching process: according to the preliminary mechanical translation sentence of the translation instance obtaining in step 3 T, in the translation instance storehouse of local word alignment form, utilize pseudo-feedback searching module to carry out the retrieval of cosine similarity with source language word bag model, obtain preliminary translation result and the standard translation translation R of similar translation example, and select the most similar front 900-1100 from the preliminary translation result of similar translation example and the result for retrieval of standard translation translation R;
Wherein, described cosine similarity CS calculates according to take the vector space model that source language word bag model is unit, and the computing method of described cosine similarity CS are:
Wherein, Vec (S
example) be the source language sentence subvector of translation instance, Vec (S
input) be translation duties vector, Vec (S
input) Vec (S
example) be two vectorial inner products, || || be vectorial norm;
Step 5, phrase table assorting process: the preliminary translation result of front 900-1100 the most similar similar translation example of selecting according to step 4 and standard translation translation R, utilize phrase table sort module that the phrase table of the general postedit model after training is categorized as to the passive phrase that contributes to promote the positive phrase of translation quality and final translation result is incorporated to noise, make the general postedit model after training become personalized postedit model, again by the preliminary translation result of the similar translation example that in the positive phrase in personalized postedit model and passive phrase and step 4, pseudo-feedback searching procedural retrieval goes out and standard translation translation R contrast, described passive phrase is filtered out from personalized postedit model phrase table, thereby obtain the personalized postedit model of an optimization,
Step 6, decoder module decode procedure: using the personalized postedit model optimized in step 5 as translation model, utilize demoder to use traditional mechanical translation coding/decoding method to decode to the preliminary mechanical translation result T ' of the translation duties of step 3 acquisition, obtain the final translation result of goodization.
Beneficial effect of the present invention is: the present invention utilizes pseudo-feedback searching module to retrieve carrying out similar translation example in translation instance storehouse, by phrase table sort module, general postedit phrase is classified again, filter out passive postedit phrase, select postedit rule and obtain the personalized postedit model of optimizing, thus the quality of hoisting machine translation.In addition, application characteristic parameter optimisation procedure when building postedit model in preliminary translation process, and in characteristic parameter optimizing process, for given exploitation collection data, carrying out to input decoded respectively, then carry out integral body and adjust ginseng, there is the benefit of effective Optimal Parameters, elevator system performance.Especially, utilizing pseudo-feedback searching module in the concentrated retrieving of local translation instance database data, obtain the parallel statement similar with the preliminary translation result of sentence to be translated of being inputted acquisition by user to replacing feedback information, thereby solved, be difficult to obtain this problem of field feedback.
In addition, the inventive method has well been utilized feedback information, on initial translation model, set up postedit model effectively, the translation result that the personalization machine translation system of feeding back based on puppet of the present invention and method obtain and the translation result of Google contrast, and its translation quality has improved 19.5%; The translation result of the machine translation system training with Moses instrument contrasts, and its translation quality has improved 14.1%
Accompanying drawing explanation
Fig. 1 is translation flow schematic diagram of the present invention.
Embodiment
Embodiment one:
The personalization machine translation system based on puppet feedback of present embodiment, described translation system comprises:
For the phrase table filtering module that each general postedit model phrase table of exploitation collection data is filtered;
For obtaining the load module of the translation duties S of user's input;
For user is inputted, translate after translation duties S and obtain the preliminary mechanical translation result T ' of translation duties, the preliminary translation module that obtains the preliminary translation of the sentence T of translation instance translated in the source language sentence in the translation instance storehouse that local system is provided;
For in the translation instance storehouse of local system word alignment form, retrieve the pseudo-feedback searching module of the preliminary translation result and the standard translation translation R that obtain similar translation example;
For being classified, the phrase table of postedit model after training obtains the phrase table sort module of personalized postedit model;
The preliminary mechanical translation result that is used for pseudo-feedback searching module retrieval to obtain is decoded, and obtains the decoder module of final translation result.
Embodiment two:
Different from embodiment one, the personalization machine translation system based on puppet feedback described in present embodiment, described phrase table filtering module is contained in described phrase table sort module.
Embodiment three:
The personalization machine interpretation method based on puppet feedback of present embodiment, before user inputs translation duties S, utilize preliminary mechanical translation sentence T and the standard translation translation R of the translation instance in translation memory to adopt statistical method to train general postedit model, complete the training process of general postedit model; Described personalization machine interpretation method is realized by following steps:
Step 1, phrase table filters mold process: utilize phrase table filtering module to filter each general postedit model phrase table of exploitation collection data;
Utilize the result after filtering to adopt default-weight to each sentence D in exploitation collection data
idecode, produce n-best translation result; Then, n-best translation result is carried out to combination; Finally, the whole ginseng of adjusting of n-best translation result after using MERT instrument to combination, can also realization character parameter optimisation procedure;
Step 2, input process: user inputs to load module by translation duties S;
Step 3, preliminary translation process: described preliminary translation process comprises that user inputs translation duties S before and user inputs translation duties S two parts afterwards;
Before user inputs translation duties S, the transcription platform that utilizes the machine translation system of local system to build, the source language sentence in the translation instance storehouse that local system is provided is tentatively translated, and obtains the preliminary translation of the sentence T of translation instance;
Meanwhile, obtain the translation duties S of user's input by load module after, utilize preliminary translation module translation to obtain the preliminary mechanical translation result T ' of translation duties;
Step 4, pseudo-feedback searching process: according to the preliminary mechanical translation sentence of the translation instance obtaining in step 3 T, in the translation instance storehouse of local word alignment form, utilize pseudo-feedback searching module to carry out the retrieval of cosine similarity with source language word bag model, obtain preliminary translation result and the standard translation translation R of similar translation example, and select the most similar front 900-1100 from the preliminary translation result of similar translation example and the result for retrieval of standard translation translation R;
Wherein, described cosine similarity CS calculates according to take the vector space model that source language word bag model is unit, and the computing method of described cosine similarity CS are:
Wherein, Vec (S
example) be the source language sentence subvector of translation instance, Vec (S
input) be translation duties vector, Vec (S
input) Vec (S
example) be two vectorial inner products, || || be vectorial norm;
Step 5, phrase table assorting process: the preliminary translation result of front 900-1100 the most similar similar translation example of selecting according to step 4 and standard translation translation R, utilize phrase table sort module that the phrase table of the general postedit model after training is categorized as to the passive phrase that contributes to promote the positive phrase of translation quality and final translation result is incorporated to noise, make the general postedit model after training become personalized postedit model, again by the preliminary translation result of the similar translation example that in the positive phrase in personalized postedit model and passive phrase and step 4, pseudo-feedback searching procedural retrieval goes out and standard translation translation R contrast, described passive phrase is filtered out from personalized postedit model phrase table, thereby obtain the personalized postedit model of an optimization,
Step 6, decoder module decode procedure: using the personalized postedit model optimized in step 5 as translation model, utilize demoder to use traditional mechanical translation coding/decoding method to decode to the preliminary mechanical translation result T ' of the translation duties of step 3 acquisition, obtain the final translation result of goodization.
Embodiment four:
Different from embodiment three, the personalization machine interpretation method based on puppet feedback described in present embodiment, decode procedure utilizes formula described in step 6:
the preliminary mechanical translation result T ' that processes translation duties obtains the final translation result of goodization; In formula, P (T " | T ') be the translation probability of general postedit model; P (S|T ", T ') in general postedit model, utilize phrase to (T "; T ') the preliminary mechanical translation sentence T ' of the translation duties of the translation duties S of given input is carried out to the probability of postedit model; defining its probable value is 1 or 0, then by following two kinds of methods, obtains the value of P (S|T ", T '):
1) phrase in the personalized postedit model of optimizing is to (P
t, P
r) in two phrases respectively with the preliminary mechanical translation result T ' of translation duties and standard translation translation R in while having at least one phrase to match, the probable value of P (S|T ", T ') gets 1, otherwise gets 0;
2) phrase in the personalized postedit model of optimizing is to (P
t, P
r) in phrase P
rwhen having at least one phrase to match in standard translation translation R, the probable value of P (S|T ", T ') gets 1, otherwise gets 0.
Embodiment five:
Different from embodiment three or four is, personalization machine interpretation method based on puppet feedback described in present embodiment, while carrying out described in step 4 pseudo-feedback searching process, from the preliminary translation result of similar translation example and the result for retrieval of standard translation translation R, select the most similar first 1000.
Adopt the Olympics of IWSLT2012 as the translation duties of user's input, utilize personalization machine translation system and the method based on puppet feedback of this translation duties data test the present invention design, the training data that the translation duties of user's input provides is the spoken field of tourism, covered the concrete application scenario such as traffic under Olympic Games application background, food and drink, stadiums, commercial affairs, comprise altogether 52,603 pairs of Chinese-English bilingual sentences are right, be specially 495,638 Chinese words and 527,599 English words, the local translation instance of the personalization storehouse using it as user.Adopted and comprised the exploitation collection that 2, the 057 pairs of Chinese-English bilingual sentences are right and comprise 998 pairs of test sets that Chinese-English bilingual sentence is right; Preliminary translation module has been used Google's translation on line system, has crawled the translation result of above-mentioned language material from Google's translation on line system, and translation quality evaluation criterion adopts BLEU-4, by the test result obtaining directly and Google's translation result contrast.Meanwhile, the machine translation system that the Moses instrument that use is increased income trains, as second group of control test, contrasts.
With BLEU-4, must be divided into evaluation criterion, the translation result contrast of the personalization machine translation system based on puppet feedback of the present invention's design and the translation result that method obtains and Google's translation on line system, its translation quality has improved 19.5%; The translation result of the machine translation system training with Moses instrument contrasts, and its translation quality has improved 14.1%, and test result is as shown in table 1:
Table 1: the translation quality contrast of the personalized translation result based on puppet feedback and other system translation result.
Claims (5)
1. the personalization machine translation system based on puppet feedback, is characterized in that, described translation system comprises:
For the phrase table filtering module that each general postedit model phrase table of exploitation collection data is filtered;
For obtaining the load module of the translation duties S of user's input;
For user is inputted, translate after translation duties S and obtain the preliminary mechanical translation result T ' of translation duties, the preliminary translation module that obtains the preliminary translation of the sentence T of translation instance translated in the source language sentence in the translation instance storehouse that local system is provided;
For in the translation instance storehouse of local system word alignment form, retrieve the pseudo-feedback searching module of the preliminary translation result and the standard translation translation R that obtain similar translation example;
For being classified, the phrase table of postedit model after training obtains the phrase table sort module of personalized postedit model;
The preliminary mechanical translation result that is used for pseudo-feedback searching module retrieval to obtain is decoded, and obtains the decoder module of final translation result.
2. the personalization machine translation system of feeding back based on puppet according to claim 1, is characterized in that, described phrase table filtering module is contained in described phrase table sort module.
3. the personalization machine interpretation method of the personalization machine translation system based on above-mentioned pseudo-feedback, it is characterized in that: before user inputs translation duties S, utilize preliminary mechanical translation sentence T and the standard translation translation R of the translation instance in translation memory to adopt statistical method to train general postedit model, complete the training process of general postedit model; Described personalization machine interpretation method is realized by following steps:
Step 1, phrase table filter process: utilize phrase table filtering module to filter each general postedit model phrase table of exploitation collection data;
Utilize the result after filtering to adopt default-weight to each sentence D in exploitation collection data
idecode, produce n-best translation result; Then, n-best translation result is carried out to combination; Finally, the whole ginseng of adjusting of n-best translation result after using MERT instrument to combination, can also realization character parameter optimisation procedure;
Step 2, input process: user inputs to load module by translation duties S;
Step 3, preliminary translation process: described preliminary translation process comprises that user inputs translation duties S before and user inputs translation duties S two parts afterwards;
Before user inputs translation duties S, the transcription platform that utilizes the machine translation system of local system to build, the source language sentence in the translation instance storehouse that local system is provided is tentatively translated, and obtains the preliminary translation of the sentence T of translation instance;
Meanwhile, obtain the translation duties S of user's input by load module after, utilize preliminary translation module translation to obtain the preliminary mechanical translation result T ' of translation duties;
Step 4, pseudo-feedback searching process: according to the preliminary mechanical translation sentence of the translation instance obtaining in step 3 T, in the translation instance storehouse of local word alignment form, utilize pseudo-feedback searching module to carry out the retrieval of cosine similarity with source language word bag model, obtain preliminary translation result and the standard translation translation R of similar translation example, and select the most similar front 900-1100 from the preliminary translation result of similar translation example and the result for retrieval of standard translation translation R;
Wherein, described cosine similarity CS calculates according to take the vector space model that source language word bag model is unit, and the computing method of described cosine similarity CS are:
Wherein, Vec (S
example) be the source language sentence subvector of translation instance, Vec (S
input) be translation duties vector, Vec (S
input) Vec (S
example) be two vectorial inner products, || || be vectorial norm;
Step 5, phrase table assorting process: the preliminary translation result of front 900-1100 the most similar similar translation example of selecting according to step 4 and standard translation translation R, utilize phrase table sort module that the phrase table of the general postedit model after training is categorized as to the passive phrase that contributes to promote the positive phrase of translation quality and final translation result is incorporated to noise, make the general postedit model after training become personalized postedit model, again by the preliminary translation result of the similar translation example that in the positive phrase in personalized postedit model and passive phrase and step 4, pseudo-feedback searching procedural retrieval goes out and standard translation translation R contrast, described passive phrase is filtered out from personalized postedit model phrase table, thereby obtain the personalized postedit model of an optimization,
Step 6, decoder module decode procedure: using the personalized postedit model optimized in step 5 as translation model, utilize demoder to use traditional mechanical translation coding/decoding method to decode to the preliminary mechanical translation result T ' of the translation duties of step 3 acquisition, obtain the final translation result of goodization.
4. the personalization machine interpretation method feeding back based on puppet according to claim 3, is characterized in that: described in step 6, decode procedure utilizes formula:
the preliminary mechanical translation result T ' that processes translation duties obtains the final translation result of goodization; In formula, P (T " | T ') be the translation probability of general postedit model; P (S|T ", T ') in general postedit model, utilize phrase to (T "; T ') the preliminary mechanical translation sentence T ' of the translation duties of the translation duties S of given input is carried out to the probability of postedit model; defining its probable value is 1 or 0, then by following two kinds of methods, obtains the value of P (S|T ", T '):
1) phrase in the personalized postedit model of optimizing is to (P
t, P
r) in two phrases respectively with the preliminary mechanical translation result T ' of translation duties and standard translation translation R in while having at least one phrase to match, the probable value of P (S|T ", T ') gets 1, otherwise gets 0;
2) phrase in the personalized postedit model of optimizing is to (P
t, P
r) in phrase P
rwhen having at least one phrase to match in standard translation translation R, the probable value of P (S|T ", T ') gets 1, otherwise gets 0.
5. according to the personalization machine interpretation method based on puppet feedback described in claim 3 or 4, it is characterized in that: while carrying out described in step 4 pseudo-feedback searching process, from the preliminary translation result of similar translation example and the result for retrieval of standard translation translation R, select the most similar first 1000.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410491100.9A CN104199813B (en) | 2014-09-24 | 2014-09-24 | Pseudo-feedback-based personalized machine translation system and method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410491100.9A CN104199813B (en) | 2014-09-24 | 2014-09-24 | Pseudo-feedback-based personalized machine translation system and method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN104199813A true CN104199813A (en) | 2014-12-10 |
CN104199813B CN104199813B (en) | 2017-05-24 |
Family
ID=52085108
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410491100.9A Active CN104199813B (en) | 2014-09-24 | 2014-09-24 | Pseudo-feedback-based personalized machine translation system and method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104199813B (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107301173A (en) * | 2017-06-22 | 2017-10-27 | 北京理工大学 | A kind of automatic post-editing system and method for multi-source neutral net that mode is remixed based on splicing |
WO2020039807A1 (en) * | 2018-08-24 | 2020-02-27 | 株式会社Nttドコモ | Machine translation control device |
WO2020039808A1 (en) * | 2018-08-24 | 2020-02-27 | 株式会社Nttドコモ | Machine translation control device |
CN111274827A (en) * | 2020-01-20 | 2020-06-12 | 南京新一代人工智能研究院有限公司 | Suffix translation method based on multi-target learning of word bag |
CN113807106A (en) * | 2021-08-31 | 2021-12-17 | 北京百度网讯科技有限公司 | Translation model training method and device, electronic equipment and storage medium |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100138210A1 (en) * | 2008-12-02 | 2010-06-03 | Electronics And Telecommunications Research Institute | Post-editing apparatus and method for correcting translation errors |
CN103793368A (en) * | 2012-10-31 | 2014-05-14 | 上海勇金懿信息科技有限公司 | Method for automatically protecting marks in marking language in automatic translation processing |
-
2014
- 2014-09-24 CN CN201410491100.9A patent/CN104199813B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20100138210A1 (en) * | 2008-12-02 | 2010-06-03 | Electronics And Telecommunications Research Institute | Post-editing apparatus and method for correcting translation errors |
CN103793368A (en) * | 2012-10-31 | 2014-05-14 | 上海勇金懿信息科技有限公司 | Method for automatically protecting marks in marking language in automatic translation processing |
Non-Patent Citations (2)
Title |
---|
NISHTHA JAISWAL等: "A Step towards Human-Machine Unification Using Translation Memory and Machine Translation System", 《2011 INTERNATIONAL CONFERENCE ON LANGUAGES, LITERATURE AND LINGUISTICS》 * |
黄河燕等: "一种智能译后编辑器的设计及其实现算法", 《软件学报》 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107301173A (en) * | 2017-06-22 | 2017-10-27 | 北京理工大学 | A kind of automatic post-editing system and method for multi-source neutral net that mode is remixed based on splicing |
CN107301173B (en) * | 2017-06-22 | 2019-10-25 | 北京理工大学 | A kind of automatic post-editing system and method for multi-source neural network remixing mode based on splicing |
WO2020039807A1 (en) * | 2018-08-24 | 2020-02-27 | 株式会社Nttドコモ | Machine translation control device |
WO2020039808A1 (en) * | 2018-08-24 | 2020-02-27 | 株式会社Nttドコモ | Machine translation control device |
CN111274827A (en) * | 2020-01-20 | 2020-06-12 | 南京新一代人工智能研究院有限公司 | Suffix translation method based on multi-target learning of word bag |
CN113807106A (en) * | 2021-08-31 | 2021-12-17 | 北京百度网讯科技有限公司 | Translation model training method and device, electronic equipment and storage medium |
CN113807106B (en) * | 2021-08-31 | 2023-03-07 | 北京百度网讯科技有限公司 | Translation model training method and device, electronic equipment and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN104199813B (en) | 2017-05-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Cussens | Part-of-speech tagging using Progol | |
CN102169495B (en) | Industry dictionary generating method and device | |
CN108363704A (en) | A kind of neural network machine translation corpus expansion method based on statistics phrase table | |
CN108460027A (en) | A kind of spoken language instant translation method and system | |
CN104199813A (en) | Pseudo-feedback-based personalized machine translation system and method | |
Li et al. | Text compression-aided transformer encoding | |
CN110276071A (en) | A kind of text matching technique, device, computer equipment and storage medium | |
CN104268133B (en) | machine translation method and system | |
Paraschiv et al. | UPB at GermEval-2019 Task 2: BERT-Based Offensive Language Classification of German Tweets. | |
CN109408806A (en) | A kind of Event Distillation method based on English grammar rule | |
Agić et al. | Baselines and test data for cross-lingual inference | |
CN111814451A (en) | Text processing method, device, equipment and storage medium | |
CN110502759B (en) | Method for processing Chinese-Yue hybrid network neural machine translation out-of-set words fused into classification dictionary | |
Kang | Spoken language to sign language translation system based on HamNoSys | |
CN106156013A (en) | The two-part machine translation method that a kind of regular collocation type phrase is preferential | |
Lo et al. | Cool English: A grammatical error correction system based on large learner corpora | |
Abid | The SADID evaluation datasets for low-resource spoken language machine translation of Arabic dialects | |
CN111597827B (en) | Method and device for improving accuracy of machine translation | |
Tran et al. | A classifier-based preordering approach for english-vietnamese statistical machine translation | |
Dinarelli et al. | Concept segmentation and labeling for conversational speech | |
Liang et al. | DUKE: Distance Fusion and Knowledge Enhanced Framework for Chinese Spelling Check | |
Viet et al. | Dependency-based pre-ordering for English-Vietnamese statistical machine translation | |
Shaalan et al. | Automatic rule induction in Arabic to English machine translation framework | |
Tran et al. | A reordering model for Vietnamese-English statistical machine translation using dependency information | |
CN108153743A (en) | Intelligent offline translation machine based on similarity |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200327 Address after: 150001 No. 118 West straight street, Nangang District, Heilongjiang, Harbin Patentee after: Harbin University of technology high tech Development Corporation Address before: 150001 Harbin, Nangang, West District, large straight street, No. 92 Patentee before: HARBIN INSTITUTE OF TECHNOLOGY |