CN103885940A - Multilingual dictionary translation method based on network services - Google Patents

Multilingual dictionary translation method based on network services Download PDF

Info

Publication number
CN103885940A
CN103885940A CN201210553918.XA CN201210553918A CN103885940A CN 103885940 A CN103885940 A CN 103885940A CN 201210553918 A CN201210553918 A CN 201210553918A CN 103885940 A CN103885940 A CN 103885940A
Authority
CN
China
Prior art keywords
language
word
dictionary
input
strategy
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210553918.XA
Other languages
Chinese (zh)
Inventor
塔拉甫·加盘
王天军
邹帅
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
XINJIANG INFORMATION INDUSTRY Co Ltd
Original Assignee
XINJIANG INFORMATION INDUSTRY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by XINJIANG INFORMATION INDUSTRY Co Ltd filed Critical XINJIANG INFORMATION INDUSTRY Co Ltd
Priority to CN201210553918.XA priority Critical patent/CN103885940A/en
Publication of CN103885940A publication Critical patent/CN103885940A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Machine Translation (AREA)

Abstract

The invention discloses a multilingual dictionary translation method based on network services. The method includes a first strategy and a second strategy, wherein the first strategy includes determining a linguistic type of an input word through Unicode, looking up all dictionaries which use the linguistic type as input language and combining numerous searched results to give a result, and the second strategy includes combining all successful searched results without determining the linguistic type of the word to give a result. The multilingual dictionary translation method based on the network services is high in translation efficiency and intelligence degree.

Description

The multilingual dictionary interpretation method of service Network Based
Technical field
The present invention relates to language information processing technology, particularly the multilingual dictionary interpretation method of service Network Based.
Background technology
In recent years, along with the increasing information resources of development of IT technology are shared on network, greatly promote the service efficiency of resource, provided convenience for more vast user simultaneously.At the also Rapid Accumulation of information resources of the minority language in Xinjiang, but the application on network, and shared resource also needs to carry forward vigorously.Such as: several e-dictionary software listings so far, translation direction substantially covers this area and uses in the world several language, but there is no network electronic dictionary.In order to supplement the blank of this respect, the multilingual e-dictionary based on WCF has proposed.For the word enquiring function that the network user provides convenience, need to design dictionary simple structure, inquiry velocity is fast, and operation is through fast e-dictionary.Main performance comprises: start soon, toggle speed, accounts for internal memory few, dictionary structure and searching not for certain specific language, takes general structure, when multilingual word enquiring, be only independently dictionary file of each language construct, but adopt identical dictionary file structure and querying method.Comprise abundant content as far as possible, only provide beyond simple word translation, example sentence is also provided, synonym etc. is information how as far as possible.In WCF service, embed easily, almost without any need for modification, realize on network and sharing.
Summary of the invention
The object of the present invention is to provide a kind of multilingual dictionary interpretation method of service Network Based, translation efficiency and intelligent degree are high.
The object of the present invention is achieved like this: a kind of multilingual dictionary interpretation method of service Network Based, pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation; Pattern 2: input word, but do not know language under this word and the target language of expectation; Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting; The lookup method of pattern 1: store input language and the target language of this dictionary in each dictionary index part simultaneously, according to the input data that give, determine the dictionary of being searched in multiple dictionaries, carry out binary search, provide result; The lookup method of pattern 2: input word, but language and target language under this word are not all known, use following strategy: strategy 1: by the Unicode judgement of encoding, the language form of input word, then in all dictionaries using this language as input language, search, multiple lookup results are combined and provide result; Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result; The lookup method of mode 3: the language form of input word gives, but target language is not known, at this time takes the similar method of strategy 1 of pattern 2, in all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
Feature of the present invention is as follows:
1. dictionary design
E-dictionary design is made up of data field and operation domain, and wherein data field is as follows:
Word entities: refer to certain word, the meaning, type, form, example sentence, the sentence meaning, synonym etc.
Word entities collection: all word entities that comprised by dictionary form.
Operation domain, by setting up dictionary and query manipulation forms, operates as follows:
Obtaining of word entities: according to existing data, designed special data and extracted and collate program, do not introduced here.
Thesaurus structure is made up of index territory and entity set, obtains data later according to the thinking of design, and entity set is sorted, and computation index is also written in specific file structure.The definition of this file structure depends on the demand of design, such as safe coefficient, and capacity requirement etc.
Lookup method is many, but sets up substantially and do not revise according to dictionary one, the consideration of deletion, and it is more satisfactory carrying out binary search, while therefore setting up dictionary, the order sequence first entity set being increased progressively, wherein the word of entity is key word.
 
2. many dictionary managements
Very natural, may need to search multilingual inquiry, such as Uighur-Chinese, Chinese-English etc.Wherein each party is to as a dictionary, and all dictionaries are responsible for searching accordingly by administrative class.
Existing multiple dictionaries, at this time search three patterns of may classifying,
Pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation.
Pattern 2: input word, but do not know language under this word and the target language of expectation.
Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting.
The lookup method of pattern 1: be the simplest, be easy to the situation of management, because store input language and the target language of this dictionary in each dictionary index part simultaneously.According to the input data that give, in multiple dictionaries, determine the dictionary of being searched, carry out binary search, provide result.
The lookup method of pattern 2: input word, the language under this word, and target language is not all known.At this time can select 2 kinds of strategies,
Strategy 1: by the Unicode judgement of encode, input the language form of word, then search in all dictionaries using this language as input language, multiple lookup results are combined and provide result.
Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result.
A lot of in general dictionary quantity, when the physical quantities that each dictionary comprises is not a lot of after strategy 2 be feasible, but not as strategy 1.
The lookup method of mode 3: the language form of input word gives, but target language do not know, at this time takes the similar method of strategy 1 of pattern 2.In all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
In 3.WCF, realize multilingual e-dictionary service
The object of saying simply the service of e-dictionary in WCF is sharing of resource, and resource is e-dictionary here, and shared side is the terminal that is connected to this service.Each terminal can only Connection Service just can be used and equally utilize Service Source in this locality, and client application is quite simple like this, and travelling speed is fast.Because wherein operation is slow, account for internal memory, the part that calculated amount is large is placed on server as service.
Main performance is as follows:
1. service is provided in LAN (Local Area Network).The client of supplementary translation software is the unit of the work of translating specially, company etc.Obviously only for personnel therein provide service.
2. for multiple users provide service simultaneously.
3. concerning word translation service, the reaction time must not exceed the receptible time interval of people.
Translation efficiency of the present invention and intelligent degree are high.
Accompanying drawing explanation
Below in conjunction with accompanying drawing, the invention will be further described.
Fig. 1 is e-dictionary thesaurus structure figure of the present invention.
Fig. 2 is the process flow diagram of the multiple dictionary management of the present invention.
Fig. 3 is the description figure that the translation service of many dictionaries is provided on WCF service framework of the present invention.
Embodiment
A multilingual dictionary interpretation method for service Network Based, pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation; Pattern 2: input word, but do not know language under this word and the target language of expectation; Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting; The lookup method of pattern 1: store input language and the target language of this dictionary in each dictionary index part simultaneously, according to the input data that give, determine the dictionary of being searched in multiple dictionaries, carry out binary search, provide result; The lookup method of pattern 2: input word, but language and target language under this word are not all known, use following strategy: strategy 1: by the Unicode judgement of encoding, the language form of input word, then in all dictionaries using this language as input language, search, multiple lookup results are combined and provide result; Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result; The lookup method of mode 3: the language form of input word gives, but target language is not known, at this time takes the similar method of strategy 1 of pattern 2, in all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
As shown in Figure 1, dictionary is mainly made up of index area and entity set district.Wherein index area comprises that two think content.Section 1 is dictionary fileinfo, comprises language message, dictionary version etc.Second wants to comprise each entity in entity set beginning and end position in dictionary file, can read any one entity like this according to this information.Entity set district is an entity sets, and wherein each entity is word, the translation of words of this word, synonym, example sentence etc. information.
As shown in Figure 2, dictionary is multiple, but input is not for any specific dictionary.First the language that query pattern and word belong to considered in an input word simultaneously, selects one or more corresponding dictionary, then progressively in many dictionaries, obtains Query Result Output rusults.
As shown in Figure 3, multiple clients are carried out communication by the network media service of following.Such as: when wherein certain client sends certain word enquiring request, on server, WCF service structure is accepted this request from service point, then calls many dictionaries supervisory routine and obtains Query Result and send to the client who sends request.

Claims (1)

1. a multilingual dictionary interpretation method for service Network Based, its method is: this pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation; Pattern 2: input word, but do not know language under this word and the target language of expectation; Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting; The lookup method of pattern 1: store input language and the target language of this dictionary in each dictionary index part simultaneously, according to the input data that give, determine the dictionary of being searched in multiple dictionaries, carry out binary search, provide result; The lookup method of pattern 2: input word, but language and target language under this word are not all known, use following strategy: strategy 1: by the Unicode judgement of encoding, the language form of input word, then in all dictionaries using this language as input language, search, multiple lookup results are combined and provide result; Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result; The lookup method of mode 3: the language form of input word gives, but target language is not known, at this time takes the similar method of strategy 1 of pattern 2, in all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
CN201210553918.XA 2012-12-19 2012-12-19 Multilingual dictionary translation method based on network services Pending CN103885940A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210553918.XA CN103885940A (en) 2012-12-19 2012-12-19 Multilingual dictionary translation method based on network services

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210553918.XA CN103885940A (en) 2012-12-19 2012-12-19 Multilingual dictionary translation method based on network services

Publications (1)

Publication Number Publication Date
CN103885940A true CN103885940A (en) 2014-06-25

Family

ID=50954835

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210553918.XA Pending CN103885940A (en) 2012-12-19 2012-12-19 Multilingual dictionary translation method based on network services

Country Status (1)

Country Link
CN (1) CN103885940A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104537066A (en) * 2014-12-30 2015-04-22 郑州市中业科技有限公司 Near-synonym correlation method based on multi-language translation

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3356536B2 (en) * 1994-04-13 2002-12-16 松下電器産業株式会社 Machine translation equipment
CN1428721A (en) * 2001-12-27 2003-07-09 高庆狮 Machine translation system based on semanteme and its method
CN101339547A (en) * 2007-07-03 2009-01-07 株式会社东芝 Apparatus and method for machine translation
US20090024599A1 (en) * 2007-07-19 2009-01-22 Giovanni Tata Method for multi-lingual search and data mining
CN102214167A (en) * 2010-04-09 2011-10-12 倪劲松 System, terminal and method for instant translation

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3356536B2 (en) * 1994-04-13 2002-12-16 松下電器産業株式会社 Machine translation equipment
CN1428721A (en) * 2001-12-27 2003-07-09 高庆狮 Machine translation system based on semanteme and its method
CN101339547A (en) * 2007-07-03 2009-01-07 株式会社东芝 Apparatus and method for machine translation
US20090024599A1 (en) * 2007-07-19 2009-01-22 Giovanni Tata Method for multi-lingual search and data mining
CN102214167A (en) * 2010-04-09 2011-10-12 倪劲松 System, terminal and method for instant translation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赛依旦·阿不力米提: "基于WCF的维汉词典及机器翻译系统的设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104537066A (en) * 2014-12-30 2015-04-22 郑州市中业科技有限公司 Near-synonym correlation method based on multi-language translation
CN104537066B (en) * 2014-12-30 2017-10-03 郑州中业科技股份有限公司 Near synonym correlating method based on multilingual translation

Similar Documents

Publication Publication Date Title
CN102831253B (en) Distributed full-text retrieval system
CN104375992A (en) Address matching method and device
CN101576929B (en) Fast vocabulary entry prompting realization method
CN103902535A (en) Method, device and system for obtaining associational word
CN106407201A (en) Data processing method and apparatus
CN104820717A (en) Massive small file storage and management method and system
CN101150489A (en) Resource share method based on distributed hash table
CN105404693B (en) A kind of service clustering method based on demand semanteme
CN103092943A (en) Method of advertisement dispatch and advertisement dispatch server
CN103226597A (en) Keyword advertisement matching method based on natural semantics
CN102262670A (en) Cross-media information retrieval system and method based on mobile visual equipment
CN101329680B (en) Large scale rapid matching method of sentence surface
CN103064842A (en) Information subscription processing device and information subscription processing method
CN104915405A (en) Microblog query expansion method based on multiple layers
Lopez et al. Scaling up question-answering to linked data
CN109542930A (en) A kind of data efficient search method based on ElasticSearch
CN100456293C (en) Information fast searching device, client end, system and method
CN102156749B (en) Anatomic search and judgment method, system and distributed server system for map sites
CN106156262A (en) A kind of search information processing method and system
KR20120082620A (en) Method for aligning ontology and ontology alignment system thereof
CN101859318A (en) Method for establishment of service discovery tool based on service network
Poomagal et al. K-means for search results clustering using url and tag contents
CN103885940A (en) Multilingual dictionary translation method based on network services
Karan et al. Frequently asked questions retrieval for Croatian based on semantic textual similarity
CN103514214B (en) Data query method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140625