CN109783826A - A kind of document automatic translating method - Google Patents
A kind of document automatic translating method Download PDFInfo
- Publication number
- CN109783826A CN109783826A CN201910036423.1A CN201910036423A CN109783826A CN 109783826 A CN109783826 A CN 109783826A CN 201910036423 A CN201910036423 A CN 201910036423A CN 109783826 A CN109783826 A CN 109783826A
- Authority
- CN
- China
- Prior art keywords
- translation
- text
- document
- extracted
- languages
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 36
- 238000013519 translation Methods 0.000 claims abstract description 156
- 239000000203 mixture Substances 0.000 claims abstract description 11
- 238000001514 detection method Methods 0.000 claims description 11
- 238000012545 processing Methods 0.000 claims description 9
- 239000000284 extract Substances 0.000 claims description 4
- 238000007689 inspection Methods 0.000 claims 1
- 230000014616 translation Effects 0.000 description 135
- 230000000694 effects Effects 0.000 description 9
- 230000007257 malfunction Effects 0.000 description 4
- 238000000605 extraction Methods 0.000 description 3
- 230000004069 differentiation Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000003362 replicative effect Effects 0.000 description 2
- 230000007306 turnover Effects 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 150000001875 compounds Chemical class 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000004438 eyesight Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000011017 operating method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 229910052701 rubidium Inorganic materials 0.000 description 1
- IGLNJRXAVVLDKE-UHFFFAOYSA-N rubidium atom Chemical compound [Rb] IGLNJRXAVVLDKE-UHFFFAOYSA-N 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000000699 topical effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
Abstract
The invention discloses a kind of document automatic translating methods, comprising: uploads document, specifies target language;According to Doctype, corresponding Word Input tool is selected to extract the text in document;Detect the languages of document text;Analyze and record the composition information of extracted text;The text extracted is divided into the entry of several minimum units;Mark in extracted text non-translates element;The interpretative system that selection corresponds to the languages of text translates the entry of all divisions;With obtain retaining in each entry be labeled it is non-translate element, will be unless translating the partial translation of element into the translation of target language;Translation format is reconstructed with the text composition information of record, translation is compareed into displaying with original text.The present invention automatically locks non-element of translating, and translation result can be made more to meet the use habit of people, and that improves translation can be readability.Meanwhile translation of the present invention can compare displaying with original text, in order to which the control of user is read, verifies and is understood.
Description
Technical field
The present invention relates to machine translation field, especially a kind of document automatic translating method.
Background technique
Machine translation is that a kind of natural language (original language) is converted into another natural language (target language using computer
Speech) process.Because its translation efficiency is apparently higher than manually, user can be assisted more rapidly to obtain information, therefore have important
Practical value.It is predicted according to the maximum market survey library Research & Markets in the whole world, global machine translation city in 2023
Field scale will be up to 1.95 hundred million dollars.During 2017 to 2023 years, the compound annual growth rate in machine translation market is more than 6.0%, and
Reach 1.95 hundred million dollars within 2023.Existing machine translation software, for example, translation dog, Transgod using MT engine into
Row full text translation.
The engines such as TransGod are completed to need download online translation that can just check after translation, and current machine translation pair
It is " making no exception " for document, i.e., for the text in document, carries out full text translation, do not take into account that readding for translation
The bandwagon effect of the property read and translation, that is, do not take into account that whether translation meets the reading habit of people.
In the epoch that machine translation market increases rapidly, the precision and efficiency of machine translation are the Dai-ichi Mutual Life Insurances of translation software
Power.User experience is also important references point of the translation software in market competition simultaneously.
Summary of the invention
The present invention is directed to provide easier mode of operation for user, more intuitively translates effect and show.Hair of the invention
Improving eyesight is: for above-mentioned all or part of problem, providing a kind of document automatic translating method.To translate flexibly
, be suitble to people's reading habit translation.
The technical solution adopted by the invention is as follows:
A kind of document automatic translating method, comprising:
Document is uploaded, target language is specified;
Analysis Doctype selects corresponding Word Input tool to extract the text in document based on the analysis results;
According to the text extracted, the languages of document text are detected;
Analyze and record the composition information of extracted text;
The text extracted is subjected to minimum unit processing, divides the entry that extracted text is several minimum units;
Mark in extracted text it is non-translate element, it is described non-to translate the character that element is preliminary making;
According to the languages of document text, corresponding interpretative system is selected to translate extracted text;The knot of the translation
Fruit are as follows: retain it is labeled it is non-translate element, will be unless translating the partial translation of element into target language;
To the format of the result of translation, it is reconstructed with the text composition information of record.
The step of above method, in addition to apparent sequence limits, the successive requirement of no execution.
By the languages detection to original text, determines that specific interpretative system is targetedly translated, can be improved
The accuracy of translation;Corresponding Word Input tool is chosen according to Doctype, it can be ensured that for the extraction side of document text
Formula has specific aim, accurately to extract the character in document;Non- element of translating in document is locked, in a manner of original text
It is exported, so that the character of certain unsuitable translations is not translated, is more in line with the reading habit of people;Translation is compareed former
Literary typesetting is reconstructed, it is ensured that the contrastive of translation and original text reads convenient for control and translation detects;Because translation engine is
Automatic punctuate translation, is translated as unit of minimum unit and non-integral is translated, can be to avoid because of translation engine punctuate mistake
The case where causing translation result to malfunction, so that translation result is more accurate.
Further, above-mentioned interpretation method further include:
Original text is made of several parts, and when user selects bilingual journal display, each part of original text is respectively corresponded reconstruct
Each part of translation afterwards is shown.
Current machine translation mothod, does not support bilingual piecewise control, and people are only capable of by opening original text respectively and translating
Text carries out control reading.Each topical controls translation of original text is shown, original text and translation can be carried out in order to user
Control reading, verification and understanding, prominent verification, sentence learn and the effect of expression.Meanwhile passing through the selection of function, Ke Yi
Whether control show between be switched fast, simplified user's operation movement.
Further, it is above-mentioned by translation to being literally shown specifically: the control display mode specified according to user, with
Original text carries out control displaying in the preceding mode of preceding or translation, by original text and translation.
The reading habit that can satisfy different user is arranged in this, improves reading experience.
Further, above-mentioned with the preceding control displaying process of original text are as follows: duplication original text, and by each section after duplication
It falls, is respectively placed in original text and corresponds to paragraph and then by each paragraph after duplication, replace with corresponding translation;It is described to translate
The preceding control displaying process of text are as follows: the translation after duplication reconstruct, and by each paragraph after duplication, it is respectively placed in former translation
It corresponds to paragraph and then by each paragraph after duplication, replaces with corresponding original text.
Because of the translation being translated as unit of entry to original text, there is contrastive between the result and original text entry of translation.
It replicates original text or when translation, has set entry amount and corresponding content, then when row replacement, it is directly (or right to minimum unit
Answer translation) it is replaced.Compared to the mode (or opposite) for being inserted directly into translation behind original text part, more quickly, and
Because having strong contrastive, format and content is not easily caused to malfunction;Meanwhile being for each local operation while carrying out, at control
It manages more efficient.Further more, the mode replaced again is first replicated, when replicating original text or translation, to have primarily determined bandwagon effect
Typesetting when being replaced, not will cause the variation of typesetting, and treatment effect is more stable, and the mode for increasing translation paragraph by paragraph is being inserted
It just can determine final typesetting after the completion of entering.
Further, above-mentioned that the text extracted is carried out minimum unit processing, it is several for dividing extracted text
The entry of minimum unit are as follows: the text extracted is subjected to punctuate processing, dividing extracted text is several simple sentences.
Using simple sentence as minimum unit, on the one hand, can quickly, accurately complete the division of unit, on the other hand, also comply with and turn over
The case where mode for translating engine processing, prevents translation engine punctuate mistake, and translation is caused to malfunction.
Further, the above-mentioned languages according to document text, select corresponding interpretative system to turn over extracted text
It is translated into:
According to the languages of document text, corresponding corpus is called, each simple sentence of division is translated in order.
The contrast relationship being stored between a large amount of phrases, simple sentence and corresponding translation in corpus, passes through the match party of corpus
Formula can be rapidly completed the translation to simple sentence, while carry out unification to the translation result of phrase of the same race, simple sentence, and in corpus
It is all existing, the accurate translation of translation result, therefore, translation effect is more accurate.
Further, the above-mentioned languages according to document text, call corresponding corpus, in order to each list of division
Sentence is translated are as follows: it is directed to each simple sentence, it, will be non-on the translation basis that calling corpus is completed after going unless translating element
Translate the result that element is supplemented in after translation corresponding position;Or are as follows: to each complete simple sentence, call turning over for corpus completion
On the basis of translating, the non-translation for translating element is replaced with into the non-result translated after element.
In this manner, it is to have locked the non-translation effect for translating element that output result, which may be implemented, and then directly to this
As a result using without carrying out operation bidirectional.
Further, the text that above-mentioned basis extracts detects the languages of document text are as follows: complete by what is extracted from document
Portion or segment word are transferred to the detection application of third party's languages, and receive the languages of third party's languages detection application feedback.
The detection application of third party's languages is called directly, the differentiation of languages can be rapidly completed.Selection based on segment word,
It can reduce the calculation amount of languages differentiation, improve detection efficiency.
Further, above-mentioned upload document are as follows: in the translation area of prebuild, pass through dragging or the side of specified path
Formula uploads document.
Document is pulled to the mode of workspace, the operating procedure that user uploads document can be simplified, improve user experience.
Further, above-mentioned translation area is the suspended window constructed in PC machine desktop.Which is convenient for user's quick calling,
And in other cases, user will not be interfered.
In conclusion by adopting the above-described technical solution, the beneficial effects of the present invention are:
1, interpretation method of the invention selects optimal text based on the automatic identification to languages documents to be translated and type
Extraction and translation scheme, can effectively improve the accuracy of translation result.Meanwhile non-element of translating is locked (i.e. not automatically
Translated), translation result can be made more to meet the use habit of people, that improves translation can be readability.
2, the present invention carries out control paragraph by paragraph to original text and translation and shows for user's function that actively selection control is shown,
The effect that control is read, verifies, learnt can be increased, meanwhile, control is shown and is switched fast, the operation stream of user is saved
Journey.
3, in control displaying processing, by first unifying duplication, then the mode of unified replacement, on the one hand, it is based on minimum
Corresponding relationship between unit and translation can fast implement the operation for showing result, on the other hand, in duplication original text (duplication translation
When similarly), to show that document is in a relatively steady state, then when row replacement, document status will not be caused to arrange
Influence in version improves the stability that control shows processing.
4, the present invention is based on corpus translates document sentence by sentence, and translation is quicker, and translation result is more quasi-, meanwhile,
Corpus can be enriched again, provide corpus with the translation for other subsequent documents and support.
Detailed description of the invention
Examples of the present invention will be described by way of reference to the accompanying drawings, in which:
Fig. 1 and 2 is the different embodiments of two document automatic translating methods.
Specific embodiment
All features disclosed in this specification or disclosed all methods or in the process the step of, in addition to mutually exclusive
Feature and/or step other than, can combine in any way.
Any feature disclosed in this specification (including any accessory claim, abstract), unless specifically stated,
It is replaced by other equivalent or with similar purpose alternative features.That is, unless specifically stated, each feature is a series of
An example in equivalent or similar characteristics.
Embodiment one
As shown in the figures 1 and 2, present embodiment discloses a kind of document automatic translating methods, can voluntarily carry out to non-element of translating
Filtering, to translate the translation for meeting people's communicative habits;Interpretation method are as follows:
Document is uploaded, target language is specified.
Analysis Doctype selects corresponding Word Input tool to extract the text in document based on the analysis results.
According to the text extracted, the languages of document text are detected.
Analyze and record the composition information of extracted text.
The text extracted is subjected to minimum unit processing, divides the word that extracted text is several minimum units
Item.So-called minimum unit is one of word, word, phrase, simple sentence or a variety of patterns.
Mark in extracted text it is non-translate element, it is described non-to translate the character that element is preliminary making.It is non-to translate element
Refer in the case where not translating, more meets the content of context and common communicative habits.Such as document is English questionnaire,
In Q1, A1 etc., when translating into Chinese for " problem 1, replies 1 ", and this translation does not meet habit expression, thus the present invention by its
It filters out, without translation.
According to the languages of document text, corresponding interpretative system is selected to translate extracted text;The translation
Result are as follows: retain it is non-translates element, will be unless translating the partial translation of element into target language.
To the format of translation result, it is reconstructed with the text composition information of record.Translation is corresponded to the row of original text
Version format.
Original text is made of several parts, will be former when user selects bilingual journal (i.e. original text and translation contrasts) display
Each part that each part of text respectively corresponds the translation after reconstruct is shown.
The display of bilingual journal specifically: the control display mode specified according to user, with original text in preceding or translation preceding
Mode, original text and translation are subjected to control displaying.
In one embodiment, if user selects the preceding mode of original text, original text is replicated, and by each after duplication
Paragraph is respectively placed in original text and corresponds to paragraph and then by each paragraph after duplication, replace with corresponding translation.If user
Select the preceding mode of translation, then the translation after replicating reconstruct, and by each paragraph after duplication, it is respectively placed in former translation pair
It answers paragraph and then by each paragraph after duplication, replaces with corresponding original text.
Embodiment two
Referring to attached drawing 1 or 2, present embodiment discloses a kind of document automatic translating methods, belong to machine translation type, and method includes
Following steps:
In desktop, browser, the regions such as window for calling plug-in unit, translation area is constructed, passes through and pulls or the side of specified path
Formula uploads to transcription platform for documents to be translated;The target language of specified plan translation simultaneously.It, can be in desktop in PC machine
Suspended window is constructed as workspace, directly dragging document completes the upload of document to the suspended window region.
According to the suffix name of document, the type of document is judged, and according to the type of document, select corresponding Word Input work
Tool extracts the text in document.Such as office paper series, the entitled .doc (x) of suffix .ppt (x) .xls (x) etc.,
Such document belongs to text class file, and the text in document can be directly extracted by Open XML;For such as scanning PDF, picture
Equal documents, belong to picture class file, need to extract the text in document by corresponding tool (such as OCR tool);For text
Word document is such as carried out the document of PDF format conversion by this class PDF(), it is extracted in document by using tools such as PDFbox
Text.
The text extracted is input to and such as newly translates tool, receive its feedback language languages, using feedback result as
Original text languages documents to be translated.Such languages judge that tool belongs to existing application, by way of calling directly, can be completed
Judgement to document original text languages.It can be institute it should be noted that being transferred to the text that third party's languages judge tool herein
Whole texts of the document of extraction, are also possible to segment word.For the applications of calling, if single third-party application without
The complete all languages of method detection can then be realized (including numerous, simplified to Chinese to full languages by calling multiple applications
Detection) detection.
Using file parsing algorithm, analysis, which is done, extracts the composition informations such as position, pattern, the format of text in a document, and
The composition information of text is recorded.
The whole texts extracted are subjected to simple sentence and paragraph divides.This step can by detection text in comma,
Fullstop and carriage return mark are realized.
Non- based on prebuild translates element database, carries out non-translating rubidium marking to each simple sentence.It is non-translate element database according to
Experience or machine learning, the character picked out in a large amount of historical document, such as " Q: ", " F: ", " A: ", logo, enterprise name
Deng.This kind of character existing machine translation application in, will do it translation (by taking English translates Chinese as an example, " problem: " can be translated as,
" answer: " etc.), but translation result is not easy to the position of quick positioning question or answer, does not also meet people to similar file
Reading habit, therefore, this kind of characters in the present invention, without translation, directly quote original text.
Each simple sentence that will be divided calls corresponding corpus, in order to each of division according to original text languages
A simple sentence is translated.So-called corpus is the contrast relationship between the original text and translation that are recorded in history translation;Pass through
Matching between original text and corpus, calling directly corresponding translation can be completed translation to sentence.Which is relatively using such as Baidu
Machine translation, the translation results such as translation, Google's translation are more accurate, clear and coherent.When not being matched to corresponding corpus, need to call machine
Device translation application completes supplementary translation.It should be noted that the phrase in corpus, meeting real-time monitoring translation duties, are being supervised
When measuring the phrase for not including in corpus, the phrase and its translation can be stored automatically;In this way, corpus will obtain
It is continuous abundant.Be for consideration as the simple sentence that document is split into a sentence: 1, translation engine can not be translated simultaneously
One whole section of text can carry out sentence dismantling to whole section of text automatically, in this way it is possible to which sentence is arranged in partitioning site
Centre is translated the part of two sentences as a translation unit, and the result translated in this way can malfunction, and carry out sentence in advance
Son divides, and can overcome this kind of situation;2, the matching translation of corpus is drawn in advance using simple sentence or more junior unit as translation unit
Subordinate sentence, also complies with translation brief.
In one embodiment, the above-mentioned translation to each simple sentence calls corpus or machine after going unless translating element
On the translation basis that device translation application is completed, the result after element is supplemented in translation corresponding position is translated by non-.It takes out and non-translates member
Element is without translation, and after other parts are translated, the non-element of translating taken out before is backfilling into (the correspondence of corresponding position in translation
In the position in original text).In another embodiment, for each complete simple sentence, calling corpus or machine translation
Using on the translation basis of completion, the non-translation for translating element is replaced with into the non-result translated after element.Full text translation is first carried out,
The non-element of translating being translated is replaced with into original text again.
It is recorded according to composition information of the front to original text (i.e. the text of document), typesetting, format is carried out to the translation of document
The operation such as adjustment, so that translation is corresponding with the typesetting of original text.The reconstruct of format can be complete by existing document refinement algorithm
At.For example, first non-element of translating is placed in the record position of original text (such as on the basis of the first above-mentioned simple sentence translates embodiment
Relative to the coordinate in original text), it is recorded further according to the format to original text, the translation unless after translating element will be removed, with the lattice of record
Formula, pattern are filled into corresponding position.In this way, just completing the translation to document.
In order to improve the effect that user watches original text and translation, user can choose compares display using original text with translation
Mode is shown translation result.Usually in the upper, lower, left, or right side of each section of original text, control display translation.The present invention
For showing translation below original text (remaining control mode is similar), first below each paragraph of original text, original text is replicated, i.e.,
The form of paragraph A, paragraph A, paragraph B, paragraph B ..., then the paragraph (i.e. posterior to repeat paragraph) of duplication is replaced with into correspondence
, reconstruct after translation.To the displaying of original text and translation in the present embodiment, for the document newly constructed.If user does not select pair
According to display, then the translation after reconstruct is directly exported.
The invention is not limited to specific embodiments above-mentioned.The present invention, which expands to, any in the present specification to be disclosed
New feature or any new combination, and disclose any new method or process the step of or any new combination.
Claims (10)
1. a kind of document automatic translating method, which is characterized in that the interpretation method includes:
Document is uploaded, target language is specified;
Analysis Doctype selects corresponding Word Input tool to extract the text in document based on the analysis results;
According to the text extracted, the languages of document text are detected;
Analyze and record the composition information of extracted text;
The text extracted is subjected to minimum unit processing, divides the entry that extracted text is several minimum units;
Mark in extracted text it is non-translate element, it is described non-to translate the character that element is preliminary making;
According to the languages of document text, corresponding interpretative system is selected to translate the entry of all divisions;The translation
As a result are as follows: retain in each entry be labeled it is non-translate element, will be unless translating the partial translation of element into target language;
To the format of translation result, it is reconstructed with the text composition information of record.
2. interpretation method as described in claim 1, which is characterized in that further include:
Original text is made of several parts, and when user selects bilingual journal display, each part of original text is respectively corresponded reconstruct
Each part of translation afterwards is shown.
3. interpretation method as claimed in claim 2, which is characterized in that it is described by translation to being literally shown specifically:
Original text and translation are carried out control exhibition in such a way that original text is preceding in preceding or translation by the control display mode specified according to user
Show.
4. interpretation method as claimed in claim 3, which is characterized in that described with the preceding control displaying process of original text are as follows: multiple
Original text processed, and by each paragraph after duplication is respectively placed in original text and corresponds to paragraph and then by each section after duplication
It falls, replaces with corresponding translation;It is described with the preceding control displaying process of translation are as follows: duplication reconstruct after translation, and will duplication
Each paragraph afterwards is respectively placed in former translation and corresponds to paragraph and then by each paragraph after duplication, replace with corresponding
Original text.
5. the interpretation method as described in one of claim 1-4, which is characterized in that described that the text extracted is carried out minimum
Cell processing divides the entry that extracted text is several minimum units are as follows: carry out the text extracted at punctuate
Reason, dividing extracted text is several simple sentences.
6. interpretation method as claimed in claim 5, which is characterized in that the languages according to document text select corresponding
Interpretative system translates extracted text are as follows:
According to the languages of document text, corresponding corpus is called, each simple sentence of division is translated in order.
7. interpretation method as claimed in claim 6, which is characterized in that the languages according to document text are called corresponding
Corpus in order translates each simple sentence of division are as follows: is directed to each simple sentence, after going unless translating element, adjusts
On translation basis with corpus completion, the result after element is supplemented in translation corresponding position is translated by non-;Or are as follows: to each
The non-translation for translating element on the translation basis for calling corpus to complete, is replaced with the non-result translated after element by complete simple sentence.
8. the interpretation method as described in one of claim 1-4,5,6, which is characterized in that the text that the basis extracts, inspection
Measure the languages of document text are as follows: all or part of text that will be extracted from document is transferred to the detection application of third party's languages,
And receive the languages of third party's languages detection application feedback.
9. the interpretation method as described in one of claim 1-4,5,6, which is characterized in that the upload document are as follows: in prebuild
Translation area in, pull or specified path by way of, document is uploaded.
10. interpretation method as claimed in claim 9, which is characterized in that the translation area is to construct in PC machine desktop
Suspended window.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910036423.1A CN109783826B (en) | 2019-01-15 | 2019-01-15 | Automatic document translation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910036423.1A CN109783826B (en) | 2019-01-15 | 2019-01-15 | Automatic document translation method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109783826A true CN109783826A (en) | 2019-05-21 |
CN109783826B CN109783826B (en) | 2023-11-21 |
Family
ID=66499416
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910036423.1A Active CN109783826B (en) | 2019-01-15 | 2019-01-15 | Automatic document translation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109783826B (en) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110362834A (en) * | 2019-06-27 | 2019-10-22 | 右江民族医学院 | English courseware auxiliary generating method based on public translation engine |
CN110532526A (en) * | 2019-07-23 | 2019-12-03 | 平安科技(深圳)有限公司 | Data method for transformation, electronic device and computer readable storage medium |
CN110688863A (en) * | 2019-09-25 | 2020-01-14 | 六维联合信息科技(北京)有限公司 | Document translation system and document translation method |
CN110705319A (en) * | 2019-09-26 | 2020-01-17 | 张广睿 | Translation method |
CN110852114A (en) * | 2019-10-29 | 2020-02-28 | 杭州轻寻科技有限公司 | Intelligent translation system with full-version translation function |
CN111144070A (en) * | 2019-12-31 | 2020-05-12 | 北京迈迪培尔信息技术有限公司 | Document parsing translation method and device |
CN111159981A (en) * | 2019-12-31 | 2020-05-15 | 北京迈迪培尔信息技术有限公司 | Method and device for analyzing and translating Excel document |
CN111339789A (en) * | 2020-02-20 | 2020-06-26 | 北京字节跳动网络技术有限公司 | Translation model training method and device, electronic equipment and storage medium |
CN111652007A (en) * | 2020-06-09 | 2020-09-11 | 北京中科凡语科技有限公司 | Translation method and device for multi-language mixed file |
CN112287652A (en) * | 2020-06-29 | 2021-01-29 | 南京易杰智信息科技有限公司 | Method, system and device for translating formatted pictures and texts |
CN112487791A (en) * | 2020-11-27 | 2021-03-12 | 江苏省舜禹信息技术有限公司 | Multi-language hybrid intelligent translation method |
CN112765999A (en) * | 2020-12-24 | 2021-05-07 | 中国人民解放军战略支援部队信息工程大学 | Machine translation bilingual comparison method and system |
CN112949324A (en) * | 2021-01-28 | 2021-06-11 | 中国南方航空股份有限公司 | Method, system and terminal for translating and managing aircraft maintenance technical manual data |
CN113298912A (en) * | 2020-04-26 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Commodity picture processing method and device and server |
CN113378585A (en) * | 2021-06-01 | 2021-09-10 | 珠海金山办公软件有限公司 | XML text data translation method and device, electronic equipment and storage medium |
CN114035760A (en) * | 2021-11-02 | 2022-02-11 | 珠海读书郎软件科技有限公司 | Translation display system and method based on double-screen telephone watch |
CN116069901A (en) * | 2023-02-03 | 2023-05-05 | 上海一者信息科技有限公司 | Non-translated element identification method based on editing behavior and rule |
CN110532526B (en) * | 2019-07-23 | 2024-04-26 | 平安科技(深圳)有限公司 | Data conversion method, electronic device and computer readable storage medium |
Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6385568B1 (en) * | 1997-05-28 | 2002-05-07 | Marek Brandon | Operator-assisted translation system and method for unconstrained source text |
US20070150259A1 (en) * | 2005-12-08 | 2007-06-28 | Eiju Akahane | Apparatus, Method, and Program Product for Translation and Method of Providing Translation Support Service |
CN101034395A (en) * | 2007-03-30 | 2007-09-12 | 传神联合(北京)信息技术有限公司 | Document waiting for translating processing system and document processing method using same |
CN102262621A (en) * | 2010-05-26 | 2011-11-30 | 钟长林 | Device and method for checking translated text |
CN102662933A (en) * | 2012-03-28 | 2012-09-12 | 成都优译信息技术有限公司 | Distributive intelligent translation method |
CN103020044A (en) * | 2012-12-03 | 2013-04-03 | 江苏乐买到网络科技有限公司 | Machine-aided webpage translation method and system thereof |
CN103106195A (en) * | 2013-01-21 | 2013-05-15 | 刘树根 | Ideographical member identification and extraction method and machine-translation and manual-correction interactive translation method based on ideographical members |
CN104933041A (en) * | 2015-06-25 | 2015-09-23 | 武汉传神信息技术有限公司 | File extraction and reduction method favorable for translation work |
CN105302782A (en) * | 2015-11-23 | 2016-02-03 | 魅族科技(中国)有限公司 | Method and device for converting information |
CN105760368A (en) * | 2016-03-11 | 2016-07-13 | 张广睿 | Deep processing method for characters of document |
CN105808528A (en) * | 2016-03-04 | 2016-07-27 | 张广睿 | Document character processing method |
CN106453887A (en) * | 2016-09-30 | 2017-02-22 | 维沃移动通信有限公司 | Information processing method and mobile terminal |
CN107967243A (en) * | 2017-11-22 | 2018-04-27 | 语联网(武汉)信息技术有限公司 | A kind of processing method for supporting that user independently makes pauses in reading unpunctuated ancient writings |
EP3392782A1 (en) * | 2017-04-18 | 2018-10-24 | Salesforce.com, Inc. | Natural language translation and localization |
CN108845993A (en) * | 2018-06-06 | 2018-11-20 | 中国科学技术信息研究所 | Interpretation method, device and the terminal device of text information |
CN108961013A (en) * | 2018-07-25 | 2018-12-07 | 西安有言在前网络科技有限公司 | A kind of translation Matching Platform and method |
-
2019
- 2019-01-15 CN CN201910036423.1A patent/CN109783826B/en active Active
Patent Citations (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6385568B1 (en) * | 1997-05-28 | 2002-05-07 | Marek Brandon | Operator-assisted translation system and method for unconstrained source text |
US20070150259A1 (en) * | 2005-12-08 | 2007-06-28 | Eiju Akahane | Apparatus, Method, and Program Product for Translation and Method of Providing Translation Support Service |
CN101034395A (en) * | 2007-03-30 | 2007-09-12 | 传神联合(北京)信息技术有限公司 | Document waiting for translating processing system and document processing method using same |
CN102262621A (en) * | 2010-05-26 | 2011-11-30 | 钟长林 | Device and method for checking translated text |
CN102662933A (en) * | 2012-03-28 | 2012-09-12 | 成都优译信息技术有限公司 | Distributive intelligent translation method |
CN103020044A (en) * | 2012-12-03 | 2013-04-03 | 江苏乐买到网络科技有限公司 | Machine-aided webpage translation method and system thereof |
CN103106195A (en) * | 2013-01-21 | 2013-05-15 | 刘树根 | Ideographical member identification and extraction method and machine-translation and manual-correction interactive translation method based on ideographical members |
CN104933041A (en) * | 2015-06-25 | 2015-09-23 | 武汉传神信息技术有限公司 | File extraction and reduction method favorable for translation work |
CN105302782A (en) * | 2015-11-23 | 2016-02-03 | 魅族科技(中国)有限公司 | Method and device for converting information |
CN105808528A (en) * | 2016-03-04 | 2016-07-27 | 张广睿 | Document character processing method |
CN105760368A (en) * | 2016-03-11 | 2016-07-13 | 张广睿 | Deep processing method for characters of document |
CN106453887A (en) * | 2016-09-30 | 2017-02-22 | 维沃移动通信有限公司 | Information processing method and mobile terminal |
EP3392782A1 (en) * | 2017-04-18 | 2018-10-24 | Salesforce.com, Inc. | Natural language translation and localization |
CN107967243A (en) * | 2017-11-22 | 2018-04-27 | 语联网(武汉)信息技术有限公司 | A kind of processing method for supporting that user independently makes pauses in reading unpunctuated ancient writings |
CN108845993A (en) * | 2018-06-06 | 2018-11-20 | 中国科学技术信息研究所 | Interpretation method, device and the terminal device of text information |
CN108961013A (en) * | 2018-07-25 | 2018-12-07 | 西安有言在前网络科技有限公司 | A kind of translation Matching Platform and method |
Non-Patent Citations (4)
Title |
---|
HAO-WEI PENG等: "Translating OpenACC to LLVM IR with SPIR kernels", IEEE, pages 1 * |
张新玲等: "语言交际能力视角下的翻译能力再审视", 上海翻译, no. 5, pages 15 * |
罗刚峰等: "从交际翻译的角度论出口农产品包装文字说明的翻译技巧", 智库时代, no. 38, pages 287 * |
魏长宏;赵宇;: "Word文件的两种译法", 鞍山师范学院学报, vol. 11, no. 06, pages 57 * |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110362834A (en) * | 2019-06-27 | 2019-10-22 | 右江民族医学院 | English courseware auxiliary generating method based on public translation engine |
CN110532526A (en) * | 2019-07-23 | 2019-12-03 | 平安科技(深圳)有限公司 | Data method for transformation, electronic device and computer readable storage medium |
CN110532526B (en) * | 2019-07-23 | 2024-04-26 | 平安科技(深圳)有限公司 | Data conversion method, electronic device and computer readable storage medium |
CN110688863A (en) * | 2019-09-25 | 2020-01-14 | 六维联合信息科技(北京)有限公司 | Document translation system and document translation method |
CN110688863B (en) * | 2019-09-25 | 2023-04-07 | 六维联合信息科技(北京)有限公司 | Document translation system and document translation method |
CN110705319A (en) * | 2019-09-26 | 2020-01-17 | 张广睿 | Translation method |
CN110852114A (en) * | 2019-10-29 | 2020-02-28 | 杭州轻寻科技有限公司 | Intelligent translation system with full-version translation function |
CN111159981A (en) * | 2019-12-31 | 2020-05-15 | 北京迈迪培尔信息技术有限公司 | Method and device for analyzing and translating Excel document |
CN111144070A (en) * | 2019-12-31 | 2020-05-12 | 北京迈迪培尔信息技术有限公司 | Document parsing translation method and device |
CN111159981B (en) * | 2019-12-31 | 2023-08-08 | 北京迈迪培尔信息技术有限公司 | Method and device for analyzing and translating Excel document |
CN111339789A (en) * | 2020-02-20 | 2020-06-26 | 北京字节跳动网络技术有限公司 | Translation model training method and device, electronic equipment and storage medium |
CN111339789B (en) * | 2020-02-20 | 2023-08-01 | 北京字节跳动网络技术有限公司 | Translation model training method and device, electronic equipment and storage medium |
CN113298912A (en) * | 2020-04-26 | 2021-08-24 | 阿里巴巴集团控股有限公司 | Commodity picture processing method and device and server |
CN111652007A (en) * | 2020-06-09 | 2020-09-11 | 北京中科凡语科技有限公司 | Translation method and device for multi-language mixed file |
CN112287652A (en) * | 2020-06-29 | 2021-01-29 | 南京易杰智信息科技有限公司 | Method, system and device for translating formatted pictures and texts |
CN112487791A (en) * | 2020-11-27 | 2021-03-12 | 江苏省舜禹信息技术有限公司 | Multi-language hybrid intelligent translation method |
CN112765999A (en) * | 2020-12-24 | 2021-05-07 | 中国人民解放军战略支援部队信息工程大学 | Machine translation bilingual comparison method and system |
CN112949324A (en) * | 2021-01-28 | 2021-06-11 | 中国南方航空股份有限公司 | Method, system and terminal for translating and managing aircraft maintenance technical manual data |
CN113378585A (en) * | 2021-06-01 | 2021-09-10 | 珠海金山办公软件有限公司 | XML text data translation method and device, electronic equipment and storage medium |
CN114035760A (en) * | 2021-11-02 | 2022-02-11 | 珠海读书郎软件科技有限公司 | Translation display system and method based on double-screen telephone watch |
CN116069901A (en) * | 2023-02-03 | 2023-05-05 | 上海一者信息科技有限公司 | Non-translated element identification method based on editing behavior and rule |
CN116069901B (en) * | 2023-02-03 | 2023-08-11 | 上海一者信息科技有限公司 | Non-translated element identification method based on editing behavior and rule |
Also Published As
Publication number | Publication date |
---|---|
CN109783826B (en) | 2023-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109783826A (en) | A kind of document automatic translating method | |
CN106844351B (en) | Medical institution organization entity identification method and device oriented to multiple data sources | |
Anastasiou | Idiom treatment experiments in machine translation | |
Scherrer et al. | Digitising Swiss German: how to process and study a polycentric spoken language | |
US20190317986A1 (en) | Annotated text data expanding method, annotated text data expanding computer-readable storage medium, annotated text data expanding device, and text classification model training method | |
Antunović et al. | Here and now: Self-revision in student translation processes from L2 and L3 | |
Novák et al. | Creation of an annotated corpus of Old and Middle Hungarian court records and private correspondence | |
CN109241540A (en) | A kind of blind automatic switching method of Chinese based on deep neural network and system | |
CN108665141B (en) | Method for automatically extracting emergency response process model from emergency plan | |
US11593557B2 (en) | Domain-specific grammar correction system, server and method for academic text | |
CN109933796A (en) | A kind of bulletin text key message extracting method and equipment | |
CN111143531A (en) | Question-answer pair construction method, system, device and computer readable storage medium | |
CN111553159B (en) | Question generation method and system | |
Killman | Context as Achilles’ heel of translation technologies: Major implications for end-users | |
US20230123328A1 (en) | Generating cascaded text formatting for electronic documents and displays | |
Pichel et al. | A methodology to measure the diachronic language distance between three languages based on perplexity | |
De Paiva et al. | Exploratory information extraction from a historical dictionary | |
CN106339367A (en) | Method for automatically correcting Mongolian | |
CN114528459A (en) | Semantic-based webpage information extraction method and system | |
Shao | Human-Computer Interaction Environment Monitoring and Collaborative Translation Mode Exploration Using Artificial Intelligence Technology | |
CN113688606A (en) | Method for automatically writing document report | |
CN113408290A (en) | Intelligent marking method and system for Chinese text | |
CN112115722A (en) | Human brain-simulated Chinese analysis method and intelligent interaction system | |
CN109492100A (en) | A kind of non-structural information processing and resource management system | |
Meelen et al. | Old Catalan Morphosyntax: Developing an Annotated Corpus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |