CN103885940A - Multilingual dictionary translation method based on network services - Google Patents
Multilingual dictionary translation method based on network services Download PDFInfo
- Publication number
- CN103885940A CN103885940A CN201210553918.XA CN201210553918A CN103885940A CN 103885940 A CN103885940 A CN 103885940A CN 201210553918 A CN201210553918 A CN 201210553918A CN 103885940 A CN103885940 A CN 103885940A
- Authority
- CN
- China
- Prior art keywords
- language
- word
- dictionary
- input
- strategy
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Machine Translation (AREA)
Abstract
The invention discloses a multilingual dictionary translation method based on network services. The method includes a first strategy and a second strategy, wherein the first strategy includes determining a linguistic type of an input word through Unicode, looking up all dictionaries which use the linguistic type as input language and combining numerous searched results to give a result, and the second strategy includes combining all successful searched results without determining the linguistic type of the word to give a result. The multilingual dictionary translation method based on the network services is high in translation efficiency and intelligence degree.
Description
Technical field
The present invention relates to language information processing technology, particularly the multilingual dictionary interpretation method of service Network Based.
Background technology
In recent years, along with the increasing information resources of development of IT technology are shared on network, greatly promote the service efficiency of resource, provided convenience for more vast user simultaneously.At the also Rapid Accumulation of information resources of the minority language in Xinjiang, but the application on network, and shared resource also needs to carry forward vigorously.Such as: several e-dictionary software listings so far, translation direction substantially covers this area and uses in the world several language, but there is no network electronic dictionary.In order to supplement the blank of this respect, the multilingual e-dictionary based on WCF has proposed.For the word enquiring function that the network user provides convenience, need to design dictionary simple structure, inquiry velocity is fast, and operation is through fast e-dictionary.Main performance comprises: start soon, toggle speed, accounts for internal memory few, dictionary structure and searching not for certain specific language, takes general structure, when multilingual word enquiring, be only independently dictionary file of each language construct, but adopt identical dictionary file structure and querying method.Comprise abundant content as far as possible, only provide beyond simple word translation, example sentence is also provided, synonym etc. is information how as far as possible.In WCF service, embed easily, almost without any need for modification, realize on network and sharing.
Summary of the invention
The object of the present invention is to provide a kind of multilingual dictionary interpretation method of service Network Based, translation efficiency and intelligent degree are high.
The object of the present invention is achieved like this: a kind of multilingual dictionary interpretation method of service Network Based, pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation; Pattern 2: input word, but do not know language under this word and the target language of expectation; Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting; The lookup method of pattern 1: store input language and the target language of this dictionary in each dictionary index part simultaneously, according to the input data that give, determine the dictionary of being searched in multiple dictionaries, carry out binary search, provide result; The lookup method of pattern 2: input word, but language and target language under this word are not all known, use following strategy: strategy 1: by the Unicode judgement of encoding, the language form of input word, then in all dictionaries using this language as input language, search, multiple lookup results are combined and provide result; Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result; The lookup method of mode 3: the language form of input word gives, but target language is not known, at this time takes the similar method of strategy 1 of pattern 2, in all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
Feature of the present invention is as follows:
1. dictionary design
E-dictionary design is made up of data field and operation domain, and wherein data field is as follows:
Word entities: refer to certain word, the meaning, type, form, example sentence, the sentence meaning, synonym etc.
Word entities collection: all word entities that comprised by dictionary form.
Operation domain, by setting up dictionary and query manipulation forms, operates as follows:
Obtaining of word entities: according to existing data, designed special data and extracted and collate program, do not introduced here.
Thesaurus structure is made up of index territory and entity set, obtains data later according to the thinking of design, and entity set is sorted, and computation index is also written in specific file structure.The definition of this file structure depends on the demand of design, such as safe coefficient, and capacity requirement etc.
Lookup method is many, but sets up substantially and do not revise according to dictionary one, the consideration of deletion, and it is more satisfactory carrying out binary search, while therefore setting up dictionary, the order sequence first entity set being increased progressively, wherein the word of entity is key word.
2. many dictionary managements
Very natural, may need to search multilingual inquiry, such as Uighur-Chinese, Chinese-English etc.Wherein each party is to as a dictionary, and all dictionaries are responsible for searching accordingly by administrative class.
Existing multiple dictionaries, at this time search three patterns of may classifying,
Pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation.
Pattern 2: input word, but do not know language under this word and the target language of expectation.
Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting.
The lookup method of pattern 1: be the simplest, be easy to the situation of management, because store input language and the target language of this dictionary in each dictionary index part simultaneously.According to the input data that give, in multiple dictionaries, determine the dictionary of being searched, carry out binary search, provide result.
The lookup method of pattern 2: input word, the language under this word, and target language is not all known.At this time can select 2 kinds of strategies,
Strategy 1: by the Unicode judgement of encode, input the language form of word, then search in all dictionaries using this language as input language, multiple lookup results are combined and provide result.
Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result.
A lot of in general dictionary quantity, when the physical quantities that each dictionary comprises is not a lot of after strategy 2 be feasible, but not as strategy 1.
The lookup method of mode 3: the language form of input word gives, but target language do not know, at this time takes the similar method of strategy 1 of pattern 2.In all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
In 3.WCF, realize multilingual e-dictionary service
The object of saying simply the service of e-dictionary in WCF is sharing of resource, and resource is e-dictionary here, and shared side is the terminal that is connected to this service.Each terminal can only Connection Service just can be used and equally utilize Service Source in this locality, and client application is quite simple like this, and travelling speed is fast.Because wherein operation is slow, account for internal memory, the part that calculated amount is large is placed on server as service.
Main performance is as follows:
1. service is provided in LAN (Local Area Network).The client of supplementary translation software is the unit of the work of translating specially, company etc.Obviously only for personnel therein provide service.
2. for multiple users provide service simultaneously.
3. concerning word translation service, the reaction time must not exceed the receptible time interval of people.
Translation efficiency of the present invention and intelligent degree are high.
Accompanying drawing explanation
Below in conjunction with accompanying drawing, the invention will be further described.
Fig. 1 is e-dictionary thesaurus structure figure of the present invention.
Fig. 2 is the process flow diagram of the multiple dictionary management of the present invention.
Fig. 3 is the description figure that the translation service of many dictionaries is provided on WCF service framework of the present invention.
Embodiment
A multilingual dictionary interpretation method for service Network Based, pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation; Pattern 2: input word, but do not know language under this word and the target language of expectation; Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting; The lookup method of pattern 1: store input language and the target language of this dictionary in each dictionary index part simultaneously, according to the input data that give, determine the dictionary of being searched in multiple dictionaries, carry out binary search, provide result; The lookup method of pattern 2: input word, but language and target language under this word are not all known, use following strategy: strategy 1: by the Unicode judgement of encoding, the language form of input word, then in all dictionaries using this language as input language, search, multiple lookup results are combined and provide result; Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result; The lookup method of mode 3: the language form of input word gives, but target language is not known, at this time takes the similar method of strategy 1 of pattern 2, in all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
As shown in Figure 1, dictionary is mainly made up of index area and entity set district.Wherein index area comprises that two think content.Section 1 is dictionary fileinfo, comprises language message, dictionary version etc.Second wants to comprise each entity in entity set beginning and end position in dictionary file, can read any one entity like this according to this information.Entity set district is an entity sets, and wherein each entity is word, the translation of words of this word, synonym, example sentence etc. information.
As shown in Figure 2, dictionary is multiple, but input is not for any specific dictionary.First the language that query pattern and word belong to considered in an input word simultaneously, selects one or more corresponding dictionary, then progressively in many dictionaries, obtains Query Result Output rusults.
As shown in Figure 3, multiple clients are carried out communication by the network media service of following.Such as: when wherein certain client sends certain word enquiring request, on server, WCF service structure is accepted this request from service point, then calls many dictionaries supervisory routine and obtains Query Result and send to the client who sends request.
Claims (1)
1. a multilingual dictionary interpretation method for service Network Based, its method is: this pattern 1: input word simultaneously, gives the affiliated language of this word and the target language of expectation; Pattern 2: input word, but do not know language under this word and the target language of expectation; Mode 3: input word, only provide the affiliated language form of this word, but do not know the target language of expecting; The lookup method of pattern 1: store input language and the target language of this dictionary in each dictionary index part simultaneously, according to the input data that give, determine the dictionary of being searched in multiple dictionaries, carry out binary search, provide result; The lookup method of pattern 2: input word, but language and target language under this word are not all known, use following strategy: strategy 1: by the Unicode judgement of encoding, the language form of input word, then in all dictionaries using this language as input language, search, multiple lookup results are combined and provide result; Strategy 2: do not judge the language form of word, search in all dictionaries, combine all lookup results of succeeding, provide result; The lookup method of mode 3: the language form of input word gives, but target language is not known, at this time takes the similar method of strategy 1 of pattern 2, in all dictionaries that comprise input word language form, search, combine all successful lookup results, provide result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210553918.XA CN103885940A (en) | 2012-12-19 | 2012-12-19 | Multilingual dictionary translation method based on network services |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210553918.XA CN103885940A (en) | 2012-12-19 | 2012-12-19 | Multilingual dictionary translation method based on network services |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103885940A true CN103885940A (en) | 2014-06-25 |
Family
ID=50954835
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210553918.XA Pending CN103885940A (en) | 2012-12-19 | 2012-12-19 | Multilingual dictionary translation method based on network services |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103885940A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104537066A (en) * | 2014-12-30 | 2015-04-22 | 郑州市中业科技有限公司 | Near-synonym correlation method based on multi-language translation |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3356536B2 (en) * | 1994-04-13 | 2002-12-16 | 松下電器産業株式会社 | Machine translation equipment |
CN1428721A (en) * | 2001-12-27 | 2003-07-09 | 高庆狮 | Machine translation system based on semanteme and its method |
CN101339547A (en) * | 2007-07-03 | 2009-01-07 | 株式会社东芝 | Apparatus and method for machine translation |
US20090024599A1 (en) * | 2007-07-19 | 2009-01-22 | Giovanni Tata | Method for multi-lingual search and data mining |
CN102214167A (en) * | 2010-04-09 | 2011-10-12 | 倪劲松 | System, terminal and method for instant translation |
-
2012
- 2012-12-19 CN CN201210553918.XA patent/CN103885940A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3356536B2 (en) * | 1994-04-13 | 2002-12-16 | 松下電器産業株式会社 | Machine translation equipment |
CN1428721A (en) * | 2001-12-27 | 2003-07-09 | 高庆狮 | Machine translation system based on semanteme and its method |
CN101339547A (en) * | 2007-07-03 | 2009-01-07 | 株式会社东芝 | Apparatus and method for machine translation |
US20090024599A1 (en) * | 2007-07-19 | 2009-01-22 | Giovanni Tata | Method for multi-lingual search and data mining |
CN102214167A (en) * | 2010-04-09 | 2011-10-12 | 倪劲松 | System, terminal and method for instant translation |
Non-Patent Citations (1)
Title |
---|
赛依旦·阿不力米提: "基于WCF的维汉词典及机器翻译系统的设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104537066A (en) * | 2014-12-30 | 2015-04-22 | 郑州市中业科技有限公司 | Near-synonym correlation method based on multi-language translation |
CN104537066B (en) * | 2014-12-30 | 2017-10-03 | 郑州中业科技股份有限公司 | Near synonym correlating method based on multilingual translation |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102831253B (en) | Distributed full-text retrieval system | |
CN104375992A (en) | Address matching method and device | |
CN101576929B (en) | Fast vocabulary entry prompting realization method | |
CN103902535A (en) | Method, device and system for obtaining associational word | |
CN106407201A (en) | Data processing method and apparatus | |
CN104820717A (en) | Massive small file storage and management method and system | |
CN101150489A (en) | Resource share method based on distributed hash table | |
CN105404693B (en) | A kind of service clustering method based on demand semanteme | |
CN103092943A (en) | Method of advertisement dispatch and advertisement dispatch server | |
CN103226597A (en) | Keyword advertisement matching method based on natural semantics | |
CN102262670A (en) | Cross-media information retrieval system and method based on mobile visual equipment | |
CN101329680B (en) | Large scale rapid matching method of sentence surface | |
CN103064842A (en) | Information subscription processing device and information subscription processing method | |
CN104915405A (en) | Microblog query expansion method based on multiple layers | |
Lopez et al. | Scaling up question-answering to linked data | |
CN109542930A (en) | A kind of data efficient search method based on ElasticSearch | |
CN100456293C (en) | Information fast searching device, client end, system and method | |
CN102156749B (en) | Anatomic search and judgment method, system and distributed server system for map sites | |
CN106156262A (en) | A kind of search information processing method and system | |
KR20120082620A (en) | Method for aligning ontology and ontology alignment system thereof | |
CN101859318A (en) | Method for establishment of service discovery tool based on service network | |
Poomagal et al. | K-means for search results clustering using url and tag contents | |
CN103885940A (en) | Multilingual dictionary translation method based on network services | |
Karan et al. | Frequently asked questions retrieval for Croatian based on semantic textual similarity | |
CN103514214B (en) | Data query method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140625 |