CN101916247A - Internet multilingual simultaneous translation method based on single kernel language - Google Patents

Internet multilingual simultaneous translation method based on single kernel language Download PDF

Info

Publication number
CN101916247A
CN101916247A CN 201010271758 CN201010271758A CN101916247A CN 101916247 A CN101916247 A CN 101916247A CN 201010271758 CN201010271758 CN 201010271758 CN 201010271758 A CN201010271758 A CN 201010271758A CN 101916247 A CN101916247 A CN 101916247A
Authority
CN
China
Prior art keywords
kernel language
text
language
kernel
internet
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201010271758
Other languages
Chinese (zh)
Inventor
俞晓华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WUXI NUOBAO TECHNOLOGY DEVELOPMENT Co Ltd
Original Assignee
WUXI NUOBAO TECHNOLOGY DEVELOPMENT Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WUXI NUOBAO TECHNOLOGY DEVELOPMENT Co Ltd filed Critical WUXI NUOBAO TECHNOLOGY DEVELOPMENT Co Ltd
Priority to CN 201010271758 priority Critical patent/CN101916247A/en
Publication of CN101916247A publication Critical patent/CN101916247A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses an Internet multilingual simultaneous translation method which comprises the following steps of: selecting one kind of language of website texts as kernel language, and translating non-kernel language of the website texts from the kernel language; and for newly added website texts, if the newly added website texts are the kernel language, directly translating the newly added website texts into the non-kernel language, and if not, firstly translating into the kernel language and then translating into the non-kernel language. The translation method provided by the invention avoids the defect of translation meaning distortion caused by different word orders of different languages when long sentences are translated; meanwhile, the framework part is only translated once, thereby saving partial translation processing time and improving the efficiency. Importantly, one kernel language is taken as a reference for multiple languages, thereby avoiding the generation of a new error on the basis of an error generated in the translation transferring process.

Description

Internet multilingual simultaneous translation method based on single kernel language
Technical field
The present invention relates to a kind of method for translating internet webpage, be applied to Internet of Things Information technology field, internet.
Background technology
Existing translating internet webpage technology, such as the multilingual translation system of Google, this system can to most language in the world in twos between intertranslation.The web page translation system of Google has obtained using widely in current Internet circles, is mainly used in to paste character translation and the translation of webpage entire chapter.The annexation of its each several part is more directly perceived, and essential part has three parts.First is a blank box, and pasting to write for the user needs the translation content, or web page address; Second portion is a background process, perhaps network address webpage full text translation in the user is pasted; Third part is the display part, shows the translation result that is transferred by second portion.Google's translation system is simple and practical, and its weak point is content not to be added the translation of differentiation monoblock return, and to the translation than the long article field, because each languages word order custom is different, looks like through mixed and disorderly stack of regular meeting or meaning distortion.Particularly for the webpage that the standardization content structure is arranged, as the introduction of e-commerce website to product, many webpages all are a kind of framed structures, and the content of framework (Frame) is constant usually, and concrete content (Content) is becoming, and at this moment can produce to repeat translation.And the A language is translated back again through translating the language to B by the A language by Google's web page translation system after tested, changes, even ambiguity is arranged.
Summary of the invention
The objective of the invention is to overcome the deficiencies in the prior art, solve in the internet web page multilingual translation process from being in harmony conforming problem, a kind of internet multilingual simultaneous translation method based on single kernel language is provided, even if, also can guarantee and the original meaning consistance multilingual transmission translation through polyglot.
According to technical scheme provided by the invention, described internet multilingual simultaneous translation method based on single kernel language is: a kind of language of selecting the website text is as kernel language, and the translation of the non-kernel language of website text is obtained by the kernel language translation; To the newly-increased text in website, kernel language then directly is translated into non-kernel language in this way, as not being kernel language, then is translated into kernel language earlier, kernel language is translated into non-kernel language again.
Concrete steps are: the Webpage to the website is analyzed, and page text is separated into framework text and content text, and the framework text storage of kernel language is in first text, and the content text of kernel language is stored in database first table; Equally, the framework text storage that will plant non-kernel language to every kind of non-kernel language respectively is in non-kernel language text, and the content text of the non-kernel language of this kind is stored in the database in the non-kernel language table; Described non-kernel language text is formed by the disposable translation of first text, and the record in the described non-kernel language table is all independently formed by the kernel language translation in first table; First text connects from first table by database gets a record, combines with the webpage format framework, forms the webpage of kernel language; Equally, non-kernel language text connects by database gets a record from non-kernel language table, combine with the webpage format framework, forms the webpage of this non-kernel language.
To website new content text, kernel language then deposits the new content text in first table in this way, as not being kernel language, then is to deposit in behind the kernel language in first table the new content text translation; Again new content in first table is translated as non-kernel language respectively, deposits in the corresponding non-kernel language table.
The corresponding record of first table and each non-kernel language table has same sequence number, is connected correspondence with non-kernel language text by database for first text and obtains record.
Corresponding one by one on described first text and the non-kernel language text structure.
Described first text and non-kernel language text all have the database connection mechanism, can get the record of same sequence number from described first table and non-kernel language table.
Described first table is the pre-designed table of lane database with non-kernel language table, has same structure, and each is recorded unique sequences number.Described sequence number is the generation of progressively increasing automatically in first table, in the non-kernel language table be to duplicate gained when content is done its appropriate translation in according to first table.
There is link to go to the corresponding page or leaf of other language in the webpage by table generation with same sequence number.
When split-frame text and content text, as the framework text, the particular content of content element is as content text with the Property Name of content element in the page text.
Advantage of the present invention is: owing to be the normalized form content of blocking in the standard text, shorter than full text translation, also more easily keep original meaning, again in conjunction with the pretranslation framework, the whole web results of gained is littler than the distortion of the whole paginal translation meaning, what no less important was that the present invention proposes is the scheme in axle center with a kind of language (as Chinese), has avoided bigger distortion after the multilingual chain type translation.
Description of drawings
Fig. 1 is a principle of the invention synoptic diagram.
Embodiment
The invention will be further described below in conjunction with drawings and Examples.The present invention is used for the translation of internet information, especially the multilingual simultaneous translation of standard text type information unit.Excessive in order to solve in the internet web page multilingual translation process the unlikely variation of meaning, the present invention proposes one is core with a kind of language, simultaneously the standardization text structure is separated into the system that framework and content are translated respectively.Framework separates with content respectively that translation method describes in detail in another related invention, and the present invention is primarily aimed at the translation system when surpassing two languages, and major programme is to be core with a kind of language, such as Chinese.The translation of other Languages all is the axle center with Chinese.On system architecture, divide following two parts to each language: a) frame part, can be stored in text, b) content part is stored in database.Such as in the e-commerce system to the introduction of product, with Property Name and content separate processes, such as, " commodity object ", " purchase mode " or the like keyword belongs to Property Name, is framework, and its property content, as " whole world ", " direct deal " etc. is variable content, in case in have change, replaced, promptly can produce a new product introduction.
The frame part of other language is with the translation that is as the criterion of Chinese frame part, content part is dynamic, the one, in the database of Chinese information processing system correspondence form in have increase and decrease change, the translation subsystem that all the other each language is had a correspondence, access information in the Chinese contents table, its appropriate translation one by one places the contents table of this language correspondence of database again.
As shown in Figure 1: the solution of the present invention is with a kind of language, as Chinese, is the axle center, and other language are benchmark, framework radially with it.The present invention is a kind of internet multilingual simultaneous translation method based on single kernel, to internet web page from a kind of language to multilingual translation, especially for the webpage that the structural normalization content structure is arranged, as the introduction of electronic information website to product and member, concrete steps are as follows:
1, selecting the language of information main source is kernel language.Such as source place (being generally producer, the dealer etc.) language of main information in the website, perhaps the employed language of crowd is mainly browsed in the website.
2, information source structural normalization content is analyzed, be separated into frame part and content part, the Property Name of frame part such as web page contents unit, the frame content part is as the content of web page contents unit;
For example the text of the webpage of one group of norm structure wherein one page comprise following content:
Trade name knitted underwear crew neck lovers suit
The whole world, North America, area, Hong Kong, Macao and Taiwan, commodity object continent
The direct deal of purchase mode
So-and-so company of supply of material businessman
Commercial specification
Shandong, area, Zibo
So-and-so road of place Shandong Zibo so-and-so number
Issuing time 2009-11-9
Then the framework text comprises: " trade name ", " commodity object ", " purchase mode ", " supply of material businessman ", " commercial specification ", " area ", " place ", " issuing time ".Content text comprises: " knitted underwear crew neck lovers suit ", " Hong Kong, Macao and Taiwan, continent area North America the whole world ", " direct deal ", " so-and-so company ", " Shandong, Zibo ", " Shandong Zibo so-and-so road so-and-so number ", " 2009-11-9 ".
3, to kernel language, frame part is stored in text A1, and content part is stored in and shows B1 in the database; Then A1 preserves following content:
" trade name
The commodity object
The purchase mode
Supply of material businessman
Commercial specification
The area
The place
Issuing time "
Table B1 comprises the item that is used to fill in the content text of underframe: trade name, commodity object, purchase mode, supply of material businessman, commercial specification, area, place, issuing time.The form of text A1 can define voluntarily, can enough written in software goes into and reads wherein certain content.
4, to other language, set up text and table with spline structure, in text A2, content text is stored in and shows B2 in the database as second kind of language framework text storage, and surplus person analogizes, and sets up A3, B3 etc.The corresponding record of B1, B2, B3 has same sequence number, connects correspondence for A1, A2, A3 by database and obtains record.
5, A2, A3 etc. are formed by the disposable translation of A1.
6, to website newly-added information record (being content text), this content text is stored in the table B1 of kernel language correspondence earlier, then, with the content of table among the B1 through corresponding automatic translation system or human expert by can connection table B1 and its appropriate translation result can be write the user interface tool of table B2, B3 etc., finish the translation of each respective column in the database table.
The content of table B2, B3, be to produce according to the record among the B1 is in time corresponding, can be mechanical translation, such as the common used method of looking up the dictionary, in advance dictionary for translation is placed database or other attachable electronical records, or the user interface of connection table B1 and B2, B3, value from B1 are arranged, by human translation, be stored in again among B2, the B3.B1, B2, B3 record unique sequences number to each.Sequence number is the generation of progressively increasing automatically in B1, among B2, the B3 be to duplicate gained when content is done its appropriate translation in according to first table.
7, text A1, A2, A3 have the database connection mechanism, can get the record of same sequence number from described table B1, B2, B3.Database connection mechanism among the text A1 is got a record from B1, combine with the webpage format framework, just forms former web page text; Database connection mechanism among the text A2 is got a record from B2, combine with the webpage format framework, just forms the translation webpage, equally, database connection mechanism among the text A3 is got a record from B3, combine with the webpage format framework, just forms corresponding translation webpage.
B2, B3 ... in record and the identical recordings among the B1 same sequence number is arranged, for A2, A3 ... corresponding value with A1, but the corresponding conversion corresponding web page formed.In each webpage, be provided with the webpage that hyperlink is used to go to corresponding other language.

Claims (10)

1. based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: a kind of language of selecting the website text is as kernel language, and the translation of the non-kernel language of website text is obtained by the kernel language translation; To the newly-increased text in website, kernel language then directly is translated into non-kernel language in this way, as not being kernel language, then is translated into kernel language earlier, kernel language is translated into non-kernel language again.
2. according to claim 1 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: the Webpage to the website is analyzed, page text is separated into framework text and content text, the framework text storage of kernel language is in first text, and the content text of kernel language is stored in database first table; Equally, the framework text storage that will plant non-kernel language to every kind of non-kernel language respectively is in non-kernel language text, and the content text of the non-kernel language of this kind is stored in the database in the non-kernel language table; Described non-kernel language text is formed by the disposable translation of first text, and the record in the described non-kernel language table is all independently formed by the kernel language translation in first table; First text connects from first table by database gets a record, combines with the webpage format framework, forms the webpage of kernel language; Equally, non-kernel language text connects by database gets a record from non-kernel language table, combine with the webpage format framework, forms the webpage of this non-kernel language.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: to website new content text, kernel language in this way, then the new content text is deposited in first table, as not being kernel language, then be to deposit in behind the kernel language in first table with the new content text translation; Again new content in first table is translated as non-kernel language respectively, deposits in the corresponding non-kernel language table.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: the corresponding record of first table and each non-kernel language table has same sequence number, is connected correspondence with non-kernel language text by database for first text and obtains record.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: corresponding one by one on described first text and the non-kernel language text structure.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: described first text and non-kernel language text all have the database connection mechanism, can get the record of same sequence number from described first table and non-kernel language table.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: described first table is the pre-designed table of lane database with non-kernel language table, has same structure, and each is recorded unique sequences number.
As described in the claim 7 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: described sequence number is the generation of progressively increasing automatically in first table, in the non-kernel language table is to duplicate gained when content is done its appropriate translation in according to first table.
As described in the claim 7 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: have link to go to the corresponding page or leaf of other language in the webpage that generates by table with same sequence number.
As described in the claim 2 based on the internet multilingual simultaneous translation method of single kernel language, it is characterized in that: as the framework text, the particular content of content element is as content text with the Property Name of content element in the page text.
CN 201010271758 2010-09-03 2010-09-03 Internet multilingual simultaneous translation method based on single kernel language Pending CN101916247A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010271758 CN101916247A (en) 2010-09-03 2010-09-03 Internet multilingual simultaneous translation method based on single kernel language

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010271758 CN101916247A (en) 2010-09-03 2010-09-03 Internet multilingual simultaneous translation method based on single kernel language

Publications (1)

Publication Number Publication Date
CN101916247A true CN101916247A (en) 2010-12-15

Family

ID=43323761

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010271758 Pending CN101916247A (en) 2010-09-03 2010-09-03 Internet multilingual simultaneous translation method based on single kernel language

Country Status (1)

Country Link
CN (1) CN101916247A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102364437A (en) * 2011-09-22 2012-02-29 厦门游家网络有限公司 Multi-language site development system and implementation method thereof

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1295292A (en) * 1999-11-05 2001-05-16 国际商业机器公司 Method and system for multi-language wide world web service device thereof
CN101023425A (en) * 2004-06-07 2007-08-22 株式会社日本英柏斯 WEB page translation device and WEB page translation method
CN101615181A (en) * 2008-06-27 2009-12-30 国际商业机器公司 Create the system and method for internationalization network application

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1295292A (en) * 1999-11-05 2001-05-16 国际商业机器公司 Method and system for multi-language wide world web service device thereof
CN101023425A (en) * 2004-06-07 2007-08-22 株式会社日本英柏斯 WEB page translation device and WEB page translation method
CN101615181A (en) * 2008-06-27 2009-12-30 国际商业机器公司 Create the system and method for internationalization network application

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
《语言文字应用》 19970515 鲁川 中国实施UNL工程 第42页第14、18-20、25-33页 1-10 , 第2期 2 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102364437A (en) * 2011-09-22 2012-02-29 厦门游家网络有限公司 Multi-language site development system and implementation method thereof

Similar Documents

Publication Publication Date Title
CN102640151B (en) Transformed data recording method and system
CN101833581B (en) A kind of SEO Web Hosting implementation method of search engine optimization and system
US20130013616A1 (en) Systems and Methods for Natural Language Searching of Structured Data
Graner et al. Arab ko lahure: Looking at Nepali labour migrants to Arabian countries
CN103268336A (en) Fast data and big data combined data processing method and system
CN110188107A (en) A kind of method and device of the Extracting Information from table
CN102207948A (en) Method for generating incident statement sentence material base
CN107122434A (en) A kind of method and system that reconciliation file is imported to database
CN106202061A (en) Chinese-English illiteracy towards cross-border e-commerce platform is hidden and is tieed up multi-lingual machine translation system and method
CN108563730A (en) A kind of cold and hot data automatic switching method, device, electronic equipment and storage medium
CN105677915A (en) Distributed service data access method based on engine
CN101916248A (en) Method for translating internet webpage
CN104123376B (en) A kind of intelligent text collecting method and system based on row template
CN101916247A (en) Internet multilingual simultaneous translation method based on single kernel language
CN107480235A (en) A kind of database framework of data platform
TW201227587A (en) System and method for automatically and synchronously updating transaction interest point data
Dody Firmansyah et al. Sales Information System Using Web for Small Business (Case Study: CV. Tanaka Service)
CN102135954A (en) Method for describing bill conversion relationships in service management software based on extensive markup language (XML)
CN102446206B (en) A kind of cross-platform switch and method of three-dimensional data
CN110389988A (en) A kind of the user data processing method and system of real-time high-efficiency
CN103455334A (en) Invoicing system based on PHP-MySQL technology
Byrum IFLA’s ISBD Progamme: Purpose, Process, and Prospects
CN109783616A (en) A kind of text subject extracting method, system and storage medium
Xu et al. Observed probability measurement for urbanization development level with errrors-in-variables observation
Van Der Lans Data replication for enabling operational bi

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20101215