CN103885942B - A kind of rapid translation device and method - Google Patents

A kind of rapid translation device and method Download PDF

Info

Publication number
CN103885942B
CN103885942B CN201410100000.9A CN201410100000A CN103885942B CN 103885942 B CN103885942 B CN 103885942B CN 201410100000 A CN201410100000 A CN 201410100000A CN 103885942 B CN103885942 B CN 103885942B
Authority
CN
China
Prior art keywords
translated
simple sentence
module
file
mark
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410100000.9A
Other languages
Chinese (zh)
Other versions
CN103885942A (en
Inventor
张马成
王兴强
杨明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chengdu Excellent Translation Information Technology Ltd By Share Ltd
Original Assignee
Chengdu Excellent Translation Information Technology Ltd By Share Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chengdu Excellent Translation Information Technology Ltd By Share Ltd filed Critical Chengdu Excellent Translation Information Technology Ltd By Share Ltd
Priority to CN201410100000.9A priority Critical patent/CN103885942B/en
Publication of CN103885942A publication Critical patent/CN103885942A/en
Application granted granted Critical
Publication of CN103885942B publication Critical patent/CN103885942B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

A kind of rapid translation method, including the step of file to be translated is assigned as into simple sentence to be translated in advance, in addition to the step of simple sentence to be translated is compared with the simple sentence in historical trace storehouse;Step is screened first:Judge whether the simple sentence to be translated occurs first in file to be translated;Recovering step, after the completion of translation, the translation of whole simple sentences in screening storehouse returns to the correspondence position of file to be translated according to corresponding relation by historical trace storehouse and first.A kind of rapid translation device, including historical trace storehouse, first input distribute module, judge module, first contrast module, screening storehouse, mark module, recovery module and display device, the contrast module is connected with historical trace storehouse, each simple sentence to be translated that simple sentence is obtained with input distribute module distribution in comparison library;The judge module first judges each simple sentence to be translated that input distribute module distribution is obtained.Rapid translation device and method of the present invention, reduces translation word quantity and ensure that translation translation quality.

Description

A kind of rapid translation device and method
Technical field
The invention belongs to software field, it is related to a kind of language translation software, particularly a kind of rapid translation device and method.
Background technology
Since last century the mid-80, the extensive utilization based on language material and multi engine machine translation method, translation software Performance and efficiency are significantly improved, and miscellaneous translation software comes out like the mushrooms after rain.Using the software write in advance Program translation, greatly improves the speed of text translation.
Due to the particularity of language performance, the translation quality of translation software is denounced repeatly always, and the principle of translation software is By macaronic semantic one-to-one storage, machinery calls replacement during translation, due to the diversity of language performance, each word Or word often corresponds to the more than one meaning, it generally can not correctly express original text using the translation obtained by software translation completely and contain Justice, therefore human translation is still the guarantee for obtaining high translation quality.
The inherent defect of human translation is still that translation speed is excessively slow, and interpreter translates when translating firstly the need of by word, then group The clause for being formed and meeting object language communicative habits is closed, the time is longer and can not be substituted by translation software.
The content of the invention
To overcome translation software translation quality poor, the slow defect of human translation translation speed, the invention discloses a kind of fast Fast translating equipment and method.
A kind of rapid translation device of the present invention, including historical trace storehouse, input distribute module, first judge module, Contrast module, first screening storehouse, mark module, recovery module and display device;
File to be translated inputs the input distribute module, and is distributed in units of whole sentence;
The judge module first judges each simple sentence to be translated that input distribute module distribution is obtained, and judges that this is to be translated Whether simple sentence occurs first in file to be translated, is to be stored in screen storehouse first, otherwise sign module makes the sentence Represent the mark do not translated;
The contrast module is connected with historical trace storehouse, in comparison library simple sentence with input distribute module distribution obtain it is each Simple sentence to be translated, it is such as identical, it indicates that mark module makes the simple sentence to be translated the mark for representing not translate;
Display device shows the file to be translated made and do not translate mark all;
The recovery module is used for after the completion of translation, and translating for whole simple sentences in storehouse is screened by historical trace storehouse and first Text returns to the correspondence position of file to be translated according to corresponding relation.
It is preferred that, the letter abbreviations in file to be translated are also identified as simple sentence by the input distribute module.
Specifically, not translating of making of the mark module is designated color displays different from normal display, and for that can not compile The state of collecting.
File to be translated is assigned as simple sentence to be translated the invention also discloses a kind of rapid translation method, including in advance Step, in addition to:
The step of I1 is compared simple sentence to be translated with the simple sentence in historical trace storehouse:Simple sentence and input distribution mould in comparison library Each simple sentence to be translated that block distribution is obtained, it is such as identical, it indicates that the simple sentence to be translated is made expression not by mark module The mark of translation;Make and do not translate the simple sentence of mark and do not enter back into I2 steps;
I2 screens step first:Judge whether the simple sentence to be translated occurs first in file to be translated, be then to be stored in head Secondary screening storehouse, otherwise sign module this is made and represents the mark do not translated;
I3 recovering steps:After the completion of translation, the translation of whole simple sentences in screening storehouse is pressed by historical trace storehouse and first The correspondence position of file to be translated is returned to according to corresponding relation.
It is preferred that, set one to match array D,, should if not having to store same simple sentence in D to each simple sentence Simple sentence is stored in D, is otherwise made the simple sentence and is not translated mark.
Specifically, described in advance the step of file to be translated is assigned as into simple sentence to be translated including the word in file to be translated Mother's abbreviation is identified as simple sentence.
Using rapid translation device and method of the present invention, treat translated document and carried out in advance using history file storehouse Screening, carries out repeating filtering with reference to self-contrast, reduces translation word quantity, contrast ensures the translation of translation in units of sentence Quality, by actual test, the present invention can reduce translation amount more than 30%.
Brief description of the drawings
Fig. 1 is a kind of embodiment structural representation of rapid translation device of the present invention;
Fig. 2 is a kind of embodiment structural representation of rapid translation method of the present invention.
Embodiment
Below in conjunction with the accompanying drawings, the embodiment to the present invention is described in further detail.
Rapid translation device of the present invention, including historical trace storehouse, first input distribute module, judge module, contrast Module, first screening storehouse, mark module, recovery module and display device;
File to be translated inputs the input distribute module, and is distributed in units of whole sentence;
The judge module first judges each simple sentence to be translated that input distribute module distribution is obtained, and judges that this is to be translated Whether simple sentence occurs first in file to be translated, is to be stored in screen storehouse first, otherwise sign module makes the sentence Represent the mark do not translated;
The contrast module is connected with historical trace storehouse, in comparison library simple sentence with input distribute module distribution obtain it is each Simple sentence to be translated, it is such as identical, it indicates that mark module makes the simple sentence to be translated the mark for representing not translate;
Display device shows the file to be translated made and do not translate mark all;
The recovery module is used for after the completion of translation, and translating for whole simple sentences in storehouse is screened by historical trace storehouse and first Text returns to the correspondence position of file to be translated according to corresponding relation.
During using the present invention, file to be translated is inputted to input distribute module first, input distribute module is according to certain File to be translated is divided into simple sentence by rule, and common processing mode is, with punctuation mark, to make such as fullstop, question mark, ellipsis Simple sentence is marked off for simple sentence segmentation identifier.
The simple sentence that historical trace storehouse marks off file to be translated is compared with the simple sentence prestored in historical trace storehouse Right, it is completely the same to compare principle, i.e., the order in tandem of each word and whole words is completely the same in simple sentence, has met Complete consistent simple sentence, which is made, does not translate mark.
It is currently preferred individually to draw the letter abbreviations for possessing common art-recognized meanings as simple sentence during simple sentence is recognized Branch away, without considering whether the letter abbreviations are separated by punctuation mark, such as WTO(World Trade Organization), USA(United States of America United states)It is located at Deng, usual letter abbreviations in a sentence, then contrast abbreviation first, then contrast the sentence where the abbreviation.
By taking most common English to Chinese as an example, historical trace storehouse is according to accumulation in the past or discloses English-Chinese document and accumulated The phrase that completely looks like of simple sentence or can express be data library that unit is stored, including one-to-one original English version and the Chinese Language translation, it is well known that each word there may be multiple declarations of will in English, but in each specific sentence, should The meaning of word generally immobilizes, and the meaning of simple sentence and phrase sanctified by usage is in different context of co-texts, meaning Think statement also basically identical.
Using simple sentence file and divided simple sentence unit in file to be translated in contrast module contrast historical trace storehouse, press Screened according to identical comparison principle, the identical simple sentence filtered out is made in file to be translated does not translate mark Know.Make and do not translate the simple sentence of mark and no longer carry out follow-up first judging.
After the completion of the contrast of historical trace storehouse, continuation judges that remaining simple sentence that is, whether the simple sentence is at this first Occur first in file to be translated, judge module itself is screened using file to be translated first, by the clause repeated Among filtering, same piece article, due to being the description for the same subject that same author writes, there are quite a lot of simple sentence or phrase may Repeatedly occur, judge module judges whether each simple sentence occurs first in file to be translated successively first, so long as not first Occur, then make and do not translate mark, be to occur first, be stored in screening storehouse first.
The contrast of historical trace storehouse should be earlier than judgement contrast first, it is possible to reduce judge amount of calculation, for example certain word is in text Middle first time occurs, if the word is also appeared in historical trace storehouse, and only needing to progress historical trace contrast can make Mark is not translated.If being judged contrast first first, also need to progress historical trace and judge just to can obtain result, for For one article, generally, the simple sentence quantity repeated always less than only there is simple sentence quantity once, and by Simple sentence accumulation in historical trace storehouse is huge, appears in the simple sentence in historical trace storehouse and is often more than what is repeated in quantity Simple sentence, therefore will should judge to postpone first.
The simple sentence for not translating mark is made in internal system, should be different from other lists in the dispaly state of display module Sentence, such as color displays are different from normal display, to prevent translator from voluntarily translating or misoperation, can be with by the simple sentence do not translated Being set as can not editing mode.Translator only needs to operate and translated to be needed to turn in the file to be translated that display module is shown The simple sentence translated, and the simple sentence repeatedly occurred in file to be translated stored first in judge module.
After the completion of translation, what is obtained is the translation for including some vacancies, and vacancy is to make the simple sentence pair for not translating mark Answer position, system simple sentence translation in judge module according to the translation stored in historical trace storehouse and first, according to corresponding relation These translations for making the simple sentence for not translating mark are backfilling into the vacancy of translation, complete translation is obtained.
A kind of embodiment of file to be translated is handled in real time using loop nesting algorithm batch as Fig. 2 is provided,
System once reads in N files to be translated, and interpretation method of the present invention is taken to each piece file to be translated, Full text is made pauses in reading unpunctuated ancient writings first, C simple sentence is obtained, to each simple sentence, judgement identification is carried out successively, judges that identification uses cycle accumulor Mode, i.e., to J, it is first determined whether appearing in historical trace storehouse, do not translate after mark if so, then making, continue into The judgement that row is J+1, if it is not, then continuing to determine whether occur first.
The embodiment of judgment step in the present embodiment is first:A customized matching array D is set up, just The array is sky during the beginning, to each simple sentence, if occurring for the first time, is stored in matching array D, and then carrying out J+1 sentences It is disconnected, when the simple sentence occurs for the second time, then the simple sentence is made and do not translate mark, then carry out J+1 judgements, thus match Array D is finally stored in unduplicated whole simple sentences in file to be translated, the complete matching array D of translator's actual translations Whole simple sentences, with reference to the history translation stored in historical trace storehouse, that is, complete whole translations of file to be translated.Using The mode of matching array is set, and data process method is simple, program operation consumption resource is few.
Disclosed in this invention embodiment description method or can directly use hardware, computing device the step of algorithm Software module, or the two combination implemented.Software module can be placed in random access memory(RAM), internal memory, read-only storage Device(ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology neck In any other form of storage medium well known in domain.
Previously described each preferred embodiment for the present invention, if the preferred embodiment in each preferred embodiment It is not substantially contradictory or premised on a certain preferred embodiment, each preferred embodiment can any stack combinations Use, the design parameter in the embodiment and embodiment merely to clearly state inventor invention verification process, and The scope of patent protection of the limitation present invention is not used to, scope of patent protection of the invention is still defined by its claims, all It is the equivalent structure change made with the specification and accompanying drawing content of the present invention, similarly should be included in the protection model of the present invention In enclosing.

Claims (6)

1. a kind of rapid translation device, it is characterised in that including historical trace storehouse, first input distribute module, judge module, right Than module, storehouse, mark module, recovery module and display device are screened first;
File to be translated inputs the input distribute module, and is distributed in units of whole sentence;
The judge module first judges each simple sentence to be translated that input distribute module distribution is obtained, and judges the simple sentence to be translated Whether occur first in file to be translated, be to be stored in screen storehouse first, otherwise the sentence is made expression by sign module The mark do not translated;
The contrast module is connected with historical trace storehouse, in comparison library simple sentence with input distribute module distribution is obtained each waits to turn over Simple sentence is translated, it is such as identical, it indicates that mark module makes the simple sentence to be translated the mark for representing not translate;
Display device shows the file to be translated made and do not translate mark all;
The recovery module is used for after the completion of translation, and the translation that whole simple sentences in storehouse are screened by historical trace storehouse and first is pressed The correspondence position of file to be translated is returned to according to corresponding relation.
2. a kind of rapid translation device as claimed in claim 1, it is characterised in that the input distribute module will also be to be translated Letter abbreviations in file are identified as simple sentence.
3. a kind of rapid translation device as claimed in claim 1, it is characterised in that what the mark module was made does not translate mark Know be color displays different from normal display, and for can not editing mode.
4. a kind of rapid translation method, including the step of file to be translated is assigned as into simple sentence to be translated in advance, it is characterised in that Also include:
The step of I1 is compared simple sentence to be translated with the simple sentence in historical trace storehouse:Simple sentence and input distribute module point in comparison library It is such as identical with obtained each simple sentence to be translated, it indicates that the simple sentence to be translated is made expression and do not translated by mark module Mark;Make and do not translate the simple sentence of mark and do not enter back into I2 steps;
I2 screens step first:Judge whether the simple sentence to be translated occurs first in file to be translated, be to be stored in and sieve first Storehouse is selected, otherwise sign module makes this mark for representing not translate;
I3 recovering steps:After the completion of translation, the translation of whole simple sentences in screening storehouse is according to right by historical trace storehouse and first The correspondence position for returning to file to be translated should be related to.
5. a kind of rapid translation method as claimed in claim 4, it is characterised in that the step of screening first is:Set one Array D is matched, to each simple sentence, if not having to store same simple sentence in D, the simple sentence is stored in D, otherwise should Simple sentence, which is made, does not translate mark.
6. a kind of rapid translation method as claimed in claim 4, it is characterised in that described to be in advance assigned as file to be translated The step of simple sentence to be translated, is identified as simple sentence including the letter abbreviations in file to be translated.
CN201410100000.9A 2014-03-18 2014-03-18 A kind of rapid translation device and method Active CN103885942B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410100000.9A CN103885942B (en) 2014-03-18 2014-03-18 A kind of rapid translation device and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410100000.9A CN103885942B (en) 2014-03-18 2014-03-18 A kind of rapid translation device and method

Publications (2)

Publication Number Publication Date
CN103885942A CN103885942A (en) 2014-06-25
CN103885942B true CN103885942B (en) 2017-09-05

Family

ID=50954837

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410100000.9A Active CN103885942B (en) 2014-03-18 2014-03-18 A kind of rapid translation device and method

Country Status (1)

Country Link
CN (1) CN103885942B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104699669B (en) * 2015-03-31 2018-08-03 中译语通科技股份有限公司 A kind of method and device of text word counting
CN105183723A (en) * 2015-09-17 2015-12-23 成都优译信息技术有限公司 Associating method for translation software and language material searching
CN107451127B (en) * 2017-07-04 2020-11-06 广东小天才科技有限公司 Word translation method and system based on image and mobile device
CN107918904A (en) * 2017-11-15 2018-04-17 郑州中业科技股份有限公司 A kind of crowdsourcing interpretation method and platform
CN109992753B (en) * 2019-03-22 2023-09-08 维沃移动通信有限公司 Translation processing method and terminal equipment
CN111563389B (en) * 2020-04-20 2023-11-03 富途网络科技(深圳)有限公司 Translation method and device for original content of user
CN112784613A (en) * 2021-01-29 2021-05-11 语联网(武汉)信息技术有限公司 Document batch translation method and device, electronic equipment and storage medium

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60124782A (en) * 1983-12-09 1985-07-03 Fujitsu Ltd Machine translation device
JPH09305610A (en) * 1996-05-20 1997-11-28 Matsushita Electric Ind Co Ltd Machine translation device
CN1606030A (en) * 2004-11-12 2005-04-13 无敌科技(西安)有限公司 Electronic photography translation paraphrasing method and apparatus
CN1661593A (en) * 2004-02-24 2005-08-31 北京中专翻译有限公司 Method for translating computer language and translation system
CN1952930A (en) * 2005-10-20 2007-04-25 英业达股份有限公司 Inquiry system and method of vocabulary
CN101221576A (en) * 2008-01-23 2008-07-16 腾讯科技(深圳)有限公司 Input method and device capable of implementing automatic translation
CN102662933A (en) * 2012-03-28 2012-09-12 成都优译信息技术有限公司 Distributive intelligent translation method
CN102708097A (en) * 2012-04-27 2012-10-03 曾立人 Online computer translation method and online computer translation system
CN102902667A (en) * 2012-10-12 2013-01-30 曾立人 Method for displaying translation memory match result
CN103020044A (en) * 2012-12-03 2013-04-03 江苏乐买到网络科技有限公司 Machine-aided webpage translation method and system thereof
CN103235775A (en) * 2013-04-25 2013-08-07 中国科学院自动化研究所 Statistics machine translation method integrating translation memory and phrase translation model

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS60124782A (en) * 1983-12-09 1985-07-03 Fujitsu Ltd Machine translation device
JPH09305610A (en) * 1996-05-20 1997-11-28 Matsushita Electric Ind Co Ltd Machine translation device
CN1661593A (en) * 2004-02-24 2005-08-31 北京中专翻译有限公司 Method for translating computer language and translation system
CN1606030A (en) * 2004-11-12 2005-04-13 无敌科技(西安)有限公司 Electronic photography translation paraphrasing method and apparatus
CN1952930A (en) * 2005-10-20 2007-04-25 英业达股份有限公司 Inquiry system and method of vocabulary
CN101221576A (en) * 2008-01-23 2008-07-16 腾讯科技(深圳)有限公司 Input method and device capable of implementing automatic translation
CN102662933A (en) * 2012-03-28 2012-09-12 成都优译信息技术有限公司 Distributive intelligent translation method
CN102708097A (en) * 2012-04-27 2012-10-03 曾立人 Online computer translation method and online computer translation system
CN102902667A (en) * 2012-10-12 2013-01-30 曾立人 Method for displaying translation memory match result
CN103020044A (en) * 2012-12-03 2013-04-03 江苏乐买到网络科技有限公司 Machine-aided webpage translation method and system thereof
CN103235775A (en) * 2013-04-25 2013-08-07 中国科学院自动化研究所 Statistics machine translation method integrating translation memory and phrase translation model

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
机器翻译与计算机辅助翻译比较分析;梁三云;《外语电化教学》;20041231;43-45 *

Also Published As

Publication number Publication date
CN103885942A (en) 2014-06-25

Similar Documents

Publication Publication Date Title
CN103885942B (en) A kind of rapid translation device and method
CN105808528B (en) A kind of processing method of document text
CN104572616B (en) The definite method and apparatus of Text Orientation
CN106446072B (en) The treating method and apparatus of web page contents
CN105243055A (en) Multi-language based word segmentation method and apparatus
CN104866498A (en) Information processing method and device
CN101539910A (en) A sentence taking method for computer aided translation and system thereof
CN109558482B (en) Parallelization method of text clustering model PW-LDA based on Spark framework
CN105574156A (en) Text clustering method and device, and computing device
CN109885828A (en) Word error correction method, device, computer equipment and medium based on language model
CN106372053B (en) Syntactic analysis method and device
CN105760368B (en) A kind of deep treatment method of document text
CN105654022A (en) Method and device for extracting structured document information
CN104142912A (en) Accurate corpus category marking method and device
CN105786921A (en) Data module conversion method and device for non-structured document
CN109753976B (en) Corpus labeling device and method
CN113011337A (en) Chinese character library generation method and system based on deep meta learning
JP6952967B2 (en) Automatic translator
CN106227770A (en) A kind of intelligentized news web page information extraction method
CN106257442A (en) Computer-aided translation method
CA3166556A1 (en) Method and device for generating target advertorial based on deep learning
CN115130437B (en) Intelligent document filling method and device and storage medium
CN110807338A (en) English-Chinese machine translation term consistency self-correcting system and method
CN110263345A (en) Keyword extracting method, device and storage medium
CN116776879A (en) Method, system and equipment for excavating skill entity in recruitment field

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 610000 B, building 4, building 200, Tianfu five street, Chengdu hi tech Zone, Sichuan,

Applicant after: CHENGDU UE INFORMATION TECHNOLOGY CO.,LTD.

Address before: 610000, No. 1, building 107, 1 West Bauhinia Road, Chengdu hi tech Zone, Sichuan, 6

Applicant before: CHENGDU UE INFORMATION TECHNOLOGY CO.,LTD.

COR Change of bibliographic data
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: A Fast Translation Device and Method

Effective date of registration: 20230526

Granted publication date: 20170905

Pledgee: Industrial Bank Limited by Share Ltd. Chengdu branch

Pledgor: CHENGDU UE INFORMATION TECHNOLOGY CO.,LTD.

Registration number: Y2023980041884

PE01 Entry into force of the registration of the contract for pledge of patent right