CN101916247A - Internet multilingual simultaneous translation method based on single kernel language - Google Patents
Internet multilingual simultaneous translation method based on single kernel language Download PDFInfo
- Publication number
- CN101916247A CN101916247A CN 201010271758 CN201010271758A CN101916247A CN 101916247 A CN101916247 A CN 101916247A CN 201010271758 CN201010271758 CN 201010271758 CN 201010271758 A CN201010271758 A CN 201010271758A CN 101916247 A CN101916247 A CN 101916247A
- Authority
- CN
- China
- Prior art keywords
- kernel language
- text
- language
- kernel
- internet
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention discloses an Internet multilingual simultaneous translation method which comprises the following steps of: selecting one kind of language of website texts as kernel language, and translating non-kernel language of the website texts from the kernel language; and for newly added website texts, if the newly added website texts are the kernel language, directly translating the newly added website texts into the non-kernel language, and if not, firstly translating into the kernel language and then translating into the non-kernel language. The translation method provided by the invention avoids the defect of translation meaning distortion caused by different word orders of different languages when long sentences are translated; meanwhile, the framework part is only translated once, thereby saving partial translation processing time and improving the efficiency. Importantly, one kernel language is taken as a reference for multiple languages, thereby avoiding the generation of a new error on the basis of an error generated in the translation transferring process.
Description
Technical field
The present invention relates to a kind of method for translating internet webpage, be applied to Internet of Things Information technology field, internet.
Background technology
Existing translating internet webpage technology, such as the multilingual translation system of Google, this system can to most language in the world in twos between intertranslation.The web page translation system of Google has obtained using widely in current Internet circles, is mainly used in to paste character translation and the translation of webpage entire chapter.The annexation of its each several part is more directly perceived, and essential part has three parts.First is a blank box, and pasting to write for the user needs the translation content, or web page address; Second portion is a background process, perhaps network address webpage full text translation in the user is pasted; Third part is the display part, shows the translation result that is transferred by second portion.Google's translation system is simple and practical, and its weak point is content not to be added the translation of differentiation monoblock return, and to the translation than the long article field, because each languages word order custom is different, looks like through mixed and disorderly stack of regular meeting or meaning distortion.Particularly for the webpage that the standardization content structure is arranged, as the introduction of e-commerce website to product, many webpages all are a kind of framed structures, and the content of framework (Frame) is constant usually, and concrete content (Content) is becoming, and at this moment can produce to repeat translation.And the A language is translated back again through translating the language to B by the A language by Google's web page translation system after tested, changes, even ambiguity is arranged.
Summary of the invention
The objective of the invention is to overcome the deficiencies in the prior art, solve in the internet web page multilingual translation process from being in harmony conforming problem, a kind of internet multilingual simultaneous translation method based on single kernel language is provided, even if, also can guarantee and the original meaning consistance multilingual transmission translation through polyglot.
According to technical scheme provided by the invention, described internet multilingual simultaneous translation method based on single kernel language is: a kind of language of selecting the website text is as kernel language, and the translation of the non-kernel language of website text is obtained by the kernel language translation; To the newly-increased text in website, kernel language then directly is translated into non-kernel language in this way, as not being kernel language, then is translated into kernel language earlier, kernel language is translated into non-kernel language again.
Concrete steps are: the Webpage to the website is analyzed, and page text is separated into framework text and content text, and the framework text storage of kernel language is in first text, and the content text of kernel language is stored in database first table; Equally, the framework text storage that will plant non-kernel language to every kind of non-kernel language respectively is in non-kernel language text, and the content text of the non-kernel language of this kind is stored in the database in the non-kernel language table; Described non-kernel language text is formed by the disposable translation of first text, and the record in the described non-kernel language table is all independently formed by the kernel language translation in first table; First text connects from first table by database gets a record, combines with the webpage format framework, forms the webpage of kernel language; Equally, non-kernel language text connects by database gets a record from non-kernel language table, combine with the webpage format framework, forms the webpage of this non-kernel language.
To website new content text, kernel language then deposits the new content text in first table in this way, as not being kernel language, then is to deposit in behind the kernel language in first table the new content text translation; Again new content in first table is translated as non-kernel language respectively, deposits in the corresponding non-kernel language table.
The corresponding record of first table and each non-kernel language table has same sequence number, is connected correspondence with non-kernel language text by database for first text and obtains record.
Corresponding one by one on described first text and the non-kernel language text structure.
Described first text and non-kernel language text all have the database connection mechanism, can get the record of same sequence number from described first table and non-kernel language table.
Described first table is the pre-designed table of lane database with non-kernel language table, has same structure, and each is recorded unique sequences number.Described sequence number is the generation of progressively increasing automatically in first table, in the non-kernel language table be to duplicate gained when content is done its appropriate translation in according to first table.
There is link to go to the corresponding page or leaf of other language in the webpage by table generation with same sequence number.
When split-frame text and content text, as the framework text, the particular content of content element is as content text with the Property Name of content element in the page text.
Advantage of the present invention is: owing to be the normalized form content of blocking in the standard text, shorter than full text translation, also more easily keep original meaning, again in conjunction with the pretranslation framework, the whole web results of gained is littler than the distortion of the whole paginal translation meaning, what no less important was that the present invention proposes is the scheme in axle center with a kind of language (as Chinese), has avoided bigger distortion after the multilingual chain type translation.
Description of drawings
Fig. 1 is a principle of the invention synoptic diagram.
Embodiment
The invention will be further described below in conjunction with drawings and Examples.The present invention is used for the translation of internet information, especially the multilingual simultaneous translation of standard text type information unit.Excessive in order to solve in the internet web page multilingual translation process the unlikely variation of meaning, the present invention proposes one is core with a kind of language, simultaneously the standardization text structure is separated into the system that framework and content are translated respectively.Framework separates with content respectively that translation method describes in detail in another related invention, and the present invention is primarily aimed at the translation system when surpassing two languages, and major programme is to be core with a kind of language, such as Chinese.The translation of other Languages all is the axle center with Chinese.On system architecture, divide following two parts to each language: a) frame part, can be stored in text, b) content part is stored in database.Such as in the e-commerce system to the introduction of product, with Property Name and content separate processes, such as, " commodity object ", " purchase mode " or the like keyword belongs to Property Name, is framework, and its property content, as " whole world ", " direct deal " etc. is variable content, in case in have change, replaced, promptly can produce a new product introduction.
The frame part of other language is with the translation that is as the criterion of Chinese frame part, content part is dynamic, the one, in the database of Chinese information processing system correspondence form in have increase and decrease change, the translation subsystem that all the other each language is had a correspondence, access information in the Chinese contents table, its appropriate translation one by one places the contents table of this language correspondence of database again.
As shown in Figure 1: the solution of the present invention is with a kind of language, as Chinese, is the axle center, and other language are benchmark, framework radially with it.The present invention is a kind of internet multilingual simultaneous translation method based on single kernel, to internet web page from a kind of language to multilingual translation, especially for the webpage that the structural normalization content structure is arranged, as the introduction of electronic information website to product and member, concrete steps are as follows:
1, selecting the language of information main source is kernel language.Such as source place (being generally producer, the dealer etc.) language of main information in the website, perhaps the employed language of crowd is mainly browsed in the website.
2, information source structural normalization content is analyzed, be separated into frame part and content part, the Property Name of frame part such as web page contents unit, the frame content part is as the content of web page contents unit;
For example the text of the webpage of one group of norm structure wherein one page comprise following content:
Trade name knitted underwear crew neck lovers suit
The whole world, North America, area, Hong Kong, Macao and Taiwan, commodity object continent
The direct deal of purchase mode
So-and-so company of supply of material businessman
Commercial specification
Shandong, area, Zibo
So-and-so road of place Shandong Zibo so-and-so number
Issuing time 2009-11-9
Then the framework text comprises: " trade name ", " commodity object ", " purchase mode ", " supply of material businessman ", " commercial specification ", " area ", " place ", " issuing time ".Content text comprises: " knitted underwear crew neck lovers suit ", " Hong Kong, Macao and Taiwan, continent area North America the whole world ", " direct deal ", " so-and-so company ", " Shandong, Zibo ", " Shandong Zibo so-and-so road so-and-so number ", " 2009-11-9 ".
3, to kernel language, frame part is stored in text A1, and content part is stored in and shows B1 in the database; Then A1 preserves following content:
" trade name
The commodity object
The purchase mode
Supply of material businessman
Commercial specification
The area
The place
Issuing time "
Table B1 comprises the item that is used to fill in the content text of underframe: trade name, commodity object, purchase mode, supply of material businessman, commercial specification, area, place, issuing time.The form of text A1 can define voluntarily, can enough written in software goes into and reads wherein certain content.
4, to other language, set up text and table with spline structure, in text A2, content text is stored in and shows B2 in the database as second kind of language framework text storage, and surplus person analogizes, and sets up A3, B3 etc.The corresponding record of B1, B2, B3 has same sequence number, connects correspondence for A1, A2, A3 by database and obtains record.
5, A2, A3 etc. are formed by the disposable translation of A1.
6, to website newly-added information record (being content text), this content text is stored in the table B1 of kernel language correspondence earlier, then, with the content of table among the B1 through corresponding automatic translation system or human expert by can connection table B1 and its appropriate translation result can be write the user interface tool of table B2, B3 etc., finish the translation of each respective column in the database table.
The content of table B2, B3, be to produce according to the record among the B1 is in time corresponding, can be mechanical translation, such as the common used method of looking up the dictionary, in advance dictionary for translation is placed database or other attachable electronical records, or the user interface of connection table B1 and B2, B3, value from B1 are arranged, by human translation, be stored in again among B2, the B3.B1, B2, B3 record unique sequences number to each.Sequence number is the generation of progressively increasing automatically in B1, among B2, the B3 be to duplicate gained when content is done its appropriate translation in according to first table.
7, text A1, A2, A3 have the database connection mechanism, can get the record of same sequence number from described table B1, B2, B3.Database connection mechanism among the text A1 is got a record from B1, combine with the webpage format framework, just forms former web page text; Database connection mechanism among the text A2 is got a record from B2, combine with the webpage format framework, just forms the translation webpage, equally, database connection mechanism among the text A3 is got a record from B3, combine with the webpage format framework, just forms corresponding translation webpage.
B2, B3 ... in record and the identical recordings among the B1 same sequence number is arranged, for A2, A3 ... corresponding value with A1, but the corresponding conversion corresponding web page formed.In each webpage, be provided with the webpage that hyperlink is used to go to corresponding other language.
Claims (10)
1. based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: a kind of language of selecting the website text is as kernel language, and the translation of the non-kernel language of website text is obtained by the kernel language translation; To the newly-increased text in website, kernel language then directly is translated into non-kernel language in this way, as not being kernel language, then is translated into kernel language earlier, kernel language is translated into non-kernel language again.
2. according to claim 1 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: the Webpage to the website is analyzed, page text is separated into framework text and content text, the framework text storage of kernel language is in first text, and the content text of kernel language is stored in database first table; Equally, the framework text storage that will plant non-kernel language to every kind of non-kernel language respectively is in non-kernel language text, and the content text of the non-kernel language of this kind is stored in the database in the non-kernel language table; Described non-kernel language text is formed by the disposable translation of first text, and the record in the described non-kernel language table is all independently formed by the kernel language translation in first table; First text connects from first table by database gets a record, combines with the webpage format framework, forms the webpage of kernel language; Equally, non-kernel language text connects by database gets a record from non-kernel language table, combine with the webpage format framework, forms the webpage of this non-kernel language.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: to website new content text, kernel language in this way, then the new content text is deposited in first table, as not being kernel language, then be to deposit in behind the kernel language in first table with the new content text translation; Again new content in first table is translated as non-kernel language respectively, deposits in the corresponding non-kernel language table.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: the corresponding record of first table and each non-kernel language table has same sequence number, is connected correspondence with non-kernel language text by database for first text and obtains record.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: corresponding one by one on described first text and the non-kernel language text structure.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: described first text and non-kernel language text all have the database connection mechanism, can get the record of same sequence number from described first table and non-kernel language table.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: described first table is the pre-designed table of lane database with non-kernel language table, has same structure, and each is recorded unique sequences number.
As described in the claim 7 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: described sequence number is the generation of progressively increasing automatically in first table, in the non-kernel language table is to duplicate gained when content is done its appropriate translation in according to first table.
As described in the claim 7 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: have link to go to the corresponding page or leaf of other language in the webpage that generates by table with same sequence number.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: as the framework text, the particular content of content element is as content text with the Property Name of content element in the page text.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010271758 CN101916247A (en) | 2010-09-03 | 2010-09-03 | Internet multilingual simultaneous translation method based on single kernel language |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010271758 CN101916247A (en) | 2010-09-03 | 2010-09-03 | Internet multilingual simultaneous translation method based on single kernel language |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101916247A true CN101916247A (en) | 2010-12-15 |
Family
ID=43323761
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010271758 Pending CN101916247A (en) | 2010-09-03 | 2010-09-03 | Internet multilingual simultaneous translation method based on single kernel language |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101916247A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102364437A (en) * | 2011-09-22 | 2012-02-29 | 厦门游家网络有限公司 | Multi-language site development system and implementation method thereof |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1295292A (en) * | 1999-11-05 | 2001-05-16 | 国际商业机器公司 | Method and system for multi-language wide world web service device thereof |
CN101023425A (en) * | 2004-06-07 | 2007-08-22 | 株式会社日本英柏斯 | WEB page translation device and WEB page translation method |
CN101615181A (en) * | 2008-06-27 | 2009-12-30 | 国际商业机器公司 | Create the system and method for internationalization network application |
-
2010
- 2010-09-03 CN CN 201010271758 patent/CN101916247A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1295292A (en) * | 1999-11-05 | 2001-05-16 | 国际商业机器公司 | Method and system for multi-language wide world web service device thereof |
CN101023425A (en) * | 2004-06-07 | 2007-08-22 | 株式会社日本英柏斯 | WEB page translation device and WEB page translation method |
CN101615181A (en) * | 2008-06-27 | 2009-12-30 | 国际商业机器公司 | Create the system and method for internationalization network application |
Non-Patent Citations (1)
Title |
---|
《语言文字应用》 19970515 鲁川 中国实施UNL工程 第42页第14、18-20、25-33页 1-10 , 第2期 2 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102364437A (en) * | 2011-09-22 | 2012-02-29 | 厦门游家网络有限公司 | Multi-language site development system and implementation method thereof |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102640151B (en) | Transformed data recording method and system | |
CN101833581B (en) | A kind of SEO Web Hosting implementation method of search engine optimization and system | |
US20130013616A1 (en) | Systems and Methods for Natural Language Searching of Structured Data | |
Graner et al. | Arab ko lahure: Looking at Nepali labour migrants to Arabian countries | |
CN103268336A (en) | Fast data and big data combined data processing method and system | |
CN110188107A (en) | A kind of method and device of the Extracting Information from table | |
CN102207948A (en) | Method for generating incident statement sentence material base | |
CN107122434A (en) | A kind of method and system that reconciliation file is imported to database | |
CN106202061A (en) | Chinese-English illiteracy towards cross-border e-commerce platform is hidden and is tieed up multi-lingual machine translation system and method | |
CN108563730A (en) | A kind of cold and hot data automatic switching method, device, electronic equipment and storage medium | |
CN105677915A (en) | Distributed service data access method based on engine | |
CN101916248A (en) | Method for translating internet webpage | |
CN104123376B (en) | A kind of intelligent text collecting method and system based on row template | |
CN101916247A (en) | Internet multilingual simultaneous translation method based on single kernel language | |
CN107480235A (en) | A kind of database framework of data platform | |
TW201227587A (en) | System and method for automatically and synchronously updating transaction interest point data | |
Dody Firmansyah et al. | Sales Information System Using Web for Small Business (Case Study: CV. Tanaka Service) | |
CN102135954A (en) | Method for describing bill conversion relationships in service management software based on extensive markup language (XML) | |
CN102446206B (en) | A kind of cross-platform switch and method of three-dimensional data | |
CN110389988A (en) | A kind of the user data processing method and system of real-time high-efficiency | |
CN103455334A (en) | Invoicing system based on PHP-MySQL technology | |
Byrum | IFLA’s ISBD Progamme: Purpose, Process, and Prospects | |
CN109783616A (en) | A kind of text subject extracting method, system and storage medium | |
Xu et al. | Observed probability measurement for urbanization development level with errrors-in-variables observation | |
Van Der Lans | Data replication for enabling operational bi |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20101215 |