CN101425086A - Dictionary enquiry method and dictionary enquiry system based on network - Google Patents

Dictionary enquiry method and dictionary enquiry system based on network Download PDF

Info

Publication number
CN101425086A
CN101425086A CNA2008102224251A CN200810222425A CN101425086A CN 101425086 A CN101425086 A CN 101425086A CN A2008102224251 A CNA2008102224251 A CN A2008102224251A CN 200810222425 A CN200810222425 A CN 200810222425A CN 101425086 A CN101425086 A CN 101425086A
Authority
CN
China
Prior art keywords
word
translation
lexical
textual analysis
webserver
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2008102224251A
Other languages
Chinese (zh)
Inventor
周杨
李志恒
詹晓文
包塔
周枫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd
Netease Youdao Information Technology Beijing Co Ltd
Original Assignee
NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd filed Critical NET EASE YOUDAO INFORMATION TECHNOLOGY (BEIJING) Co Ltd
Priority to CNA2008102224251A priority Critical patent/CN101425086A/en
Publication of CN101425086A publication Critical patent/CN101425086A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to a method for using a dictionary based on networks, comprising the following steps: acquiring a word at a position pointed by a mouse and a sentence containing the word by clients, sending the word and the sentence to a network server, splitting the word and the sentence by a network server, spreading a predetermined number of words to form optional inquiry words with the words at the position pointed the mouse as center, inquiring the version of each optional inquiry word in a built-in database and selecting the version of a long phrase in the optional inquiry words by the network server, returning the version to the client and displaying the version on the client. At the same time, the invention also relates to a system for using a dictionary based on networks. The invention can provide relatively accurate versions by only occupying a few hardware sources of a user computer.

Description

A kind of based on network dictionaries query method and dictionary enquiry system
Technical field
The present invention relates to the Web-Based Dictionary field, particularly relate to a kind of based on network dictionaries query method and dictionary enquiry system.
Background technology
Web-Based Dictionary is based on a kind of network application of Webpage search technology and machine learning techniques, and the built-in huge database of Web-Based Dictionary is translated, explained the word that obtains, and is convenient to user's reading comprehension.Existing Web-Based Dictionary is installed in client, during application, clicks the desktop sign of Web-Based Dictionary, the activating network dictionary.As user during certain word of mouse-pointing document or webpage, Web-Based Dictionary obtains the word of mouse-pointing position, travels through local database, searches the translation or the explanation of this word correspondence, and lookup result is presented at client, the user understands this word according to display result.
Consult Fig. 1, the built-in system structure of having showed existing Web-Based Dictionary, Web-Based Dictionary comprise get speech module 11, search module 12, display module 13 and local data base 14, get the word that speech module 11 is obtained mouse-pointing, and the word that obtains sent to search module 12; Search module 12 is searched this word in local data base 14 translation, and this translation is sent to display module 13; Display module 13 shows this translation.
In the existing network dictionary, there is local client in local data base 14, be in the employed computing machine of user, because database will be stored mass data information, data volume is huge, take subscriber computer more hard disk resource and memory source, the hardware resource of subscriber computer is had higher requirement.The subscriber computer factor receptor amasss the restriction with price, its hardware resource is limited after all, the database that is installed in subscriber computer can't be stored the multiple translation of a large amount of vocabulary as required, otherwise will cause the data of database amount too huge, is not suitable for subscriber computer.And use local data base also will cause can't the immediate updating up-to-date vocabulary or the up-to-date translation of old vocabulary.
Existing Web-Based Dictionary mostly is Chinese and English dictionary for translation, and the built-in database of dictionary often only comprises the data message of Chinese and English dictionary, and therefore, the translation result of demonstration also is basic, the direct implication of this word itself.For example, mouse-pointing word " the matrix ", Web-Based Dictionary can show that the Chinese translation of this word is " matrix ".But, sometimes its implication can be different under different linguistic context for same word, for example, " the matrix " should be translated into " hacker kingdom " in the name of having to certain film, thereby only be shown to the basic lexical or textual analysis of this word of user, may not this word best translation in the text.
Summary of the invention
Technical matters to be solved by this invention provides a kind of based on network dictionaries query method, the hardware resource to subscriber computer requires too high in the prior art to solve, and the not accurate enough problem of translation, this method only need take user's less hardware resource, and translation comparatively accurately can be provided.
The invention provides a kind ofly at based on network dictionaries query method, this method comprises: client is obtained the word of mouse-pointing position and is comprised the statement of this word, sends to the webserver; The webserver carries out participle with above-mentioned statement, is the center with the word of mouse-pointing position, and front and back are extended the word of default number respectively, form alternative query word; The webserver is inquired about the translation of each alternative query word in built-in database, and the translation of selecting in the above-mentioned alternative query word long phrase returns client as main Query Result, and client shows this translation.
Preferably, the network lexical or textual analysis translation of the basic lexical or textual analysis translation of the basic meaning correspondence that comprises of described translation and word amplification implication correspondence.
Preferably, webserver selection portion divides the network lexical or textual analysis translation of word to show in client.
Preferably, the webserver is inquired about before the translation of each alternative query word in built-in database, also comprises: the word that the webserver adopts the Hash table mode will have basic lexical or textual analysis is written into internal memory.
Preferably, the webserver is inquired about before the translation of each alternative query word in built-in database, also comprises: the word that the webserver adopts the mode of Hash table will have the network lexical or textual analysis is written into internal memory.
Preferably, also comprise: abandon word with basic lexical or textual analysis and network lexical or textual analysis.
The present invention also comprises a kind of based on network dictionary enquiry system, comprise client, described client comprises gets speech module and display module, and described system also comprises the webserver, and the described webserver comprises database and word-dividing mode, enquiry module, reaches and select module; The described speech module of getting, the statement that is used to obtain the word of mouse-pointing position and comprises this word sends to described word-dividing mode; Described word-dividing mode is used for above-mentioned statement is carried out participle, is the center with the word of mouse-pointing position, and front and back are extended the word of default number respectively, form alternative query word; Described enquiry module is used for the translation at described each alternative query word of database inquiry; Described selection module, be used for selecting above-mentioned alternative query word the translation of long phrase return described display module as main Query Result; Described display module is used to show this translation.
Preferably, described database comprises the basic lexical or textual analysis database of the basic lexical or textual analysis information of storage word and the network lexical or textual analysis database of storage word network lexical or textual analysis information.
Compared with prior art, the present invention has the following advantages:
The present invention is arranged on database in the webserver, and webserver hardware resource is powerful, can not be subjected to the restriction of client hardware resource, and the database abundant in content, that data volume is huge can be set as required, makes Query Result abundanter.At the webserver database is set, finish main query function by the webserver, client only need realize that word obtains the Presentation Function with Query Result, and this partial function only needs the less function system of data volume to finish, reducing to minimum to the hardware resource consumption of client.
The present invention chooses the corresponding translation of the phrase that comprises this word and grow as main Query Result, because the real meaning of word often depends on context of co-text, so this Query Result more approaches the legitimate reading that the user needs.
Description of drawings
The built-in system structural representation of Fig. 1 existing network dictionary;
Fig. 2 is a dictionaries query method process flow diagram of the present invention;
Fig. 3 is a dictionary enquiry system schematic of the present invention.
Embodiment
For above-mentioned purpose of the present invention, feature and advantage can be become apparent more, the present invention is further detailed explanation below in conjunction with the drawings and specific embodiments.
The present invention is arranged on database on the webserver, and near the word that client will be obtained mouse sends to the webserver, searches the translation of this word by the webserver, and translation is turned back to client shows.Like this, just can make full use of the powerful advantage of webserver hardware resource, the database abundant in content, that quantity of information is huge is set, and to the multiple translation that obtains word compare, relatively after, select best translation to return client, improve the precision of lexical translation.
The present invention can be applicable in any type of Web-Based Dictionary, comprises the Web-Based Dictionary of Chinese and English intertranslation, the Web-Based Dictionary of Sino-Russian intertranslation, and the Web-Based Dictionary of Great Britain and France's mutual translation, and single language dictionary are as Chinese dictionary, encyclopaedia dictionary etc.
Referring to Fig. 2, dictionaries query method of the present invention is shown, concrete steps are as follows.
Step S201, startup client, client is obtained the word of preparing translation.The user when foreign language document or webpage, need check the translation of certain word in browsing, can be with this word of mouse-pointing.Client is obtained the word of mouse-pointing position, as the mouse position character string, obtain simultaneously comprise this word statement as the original query character string.For example: mouse-pointing " I am Chinese " " in " on, the mouse position character string be " in ", the original query character string is " I am Chinese ".
In the word lexical or textual analysis, a word often has multiple interpretation method, and different translations is arranged, which kind of translation is the most suitable, depends on the context of co-text of this word, therefore, the statement that the present invention will comprise this word together obtains, so that during follow-up translation, can select appropriate translation according to this statement.
Step S202, client send to the webserver with original character string and the mouse position character string of obtaining, and the request webserver is inquired about the accurate translation of above-mentioned word.
Step S203, the webserver carry out pre-service to query requests.The webserver carries out participle to the original query character string that receives, each word behind the participle is organized into the participle tabulation, for example, be divided into behind " I am Chinese " participle " I, be, China, the people ", wherein " China " be comprise the mouse position character string " in " word, with " China " core word as the tabulation of this participle.
Step S204, be the center, in the participle tabulation, extend a certain number of word respectively before and after each participle, form alternative query terms, and will above-mentioned alternative query terms press the series arrangement from growing to lacking, form alternative Query List with the core word.For example, from core word " China ", extend 2 words forward, form " being China ", " I am a China " two alternative query terms, 2 words extend back, can only form " Chinese " alternative query terms, with above-mentioned each alternative query terms by series arrangement from grow lack, the alternative Query List of composition be " I am Chinese, I be Chinese, be Chinese, be China, Chinese, China ".
With the core word is the center, and the alternative query terms of composition links together the word up and down in core word and the statement, and this alternative query terms can better be represented the context of co-text of this core word, for the appropriate translation of follow-up selection is provided convenience.By each the alternative query terms of series arrangement from growing to lacking, be because relatively long alternative query terms can more comprehensively, better reflect the linguistic context of core word, before will being arranged in than long alternative query terms, make things convenient for the subsequent process translation of this alternative query terms of inquiry earlier.
Step S205, the webserver take out alternative query word successively from alternative Query List, in built-in database, inquire about the various translations of each alternative query word respectively, as main Query Result, the translation of each alternative query terms is as alternative Query Result with other with the translation of the longest alternative query terms correspondence.For example, with the translation " I am Chinese " of " I am Chinese " as main Query Result,
Step 206, the webserver return main Query Result and alternative Query Result to client, and client shows main Query Result on the top of Mouse Dialog Box, shows alternative Query Result in the bottom of dialog box.
Because the real meaning of word often depends on context of co-text, therefore the translation that will choose the phrase correspondence that comprises this word and grow makes Query Result more approach the legitimate reading that the user needs as main Query Result.Simultaneously, as alternative Query Result,, make things convenient for other translations of other phrase correspondences that comprise this word the user to find the result who oneself wants for the user provides more selection.
The present invention is arranged on database in the webserver, and webserver hardware resource is powerful, can not be subjected to the restriction of client hardware resource, the database abundant in content, that data volume is huge can be set as required, thereby make Query Result abundanter.Owing to database is set, finished main query function by the webserver, thereby client only need realize that word obtains the Presentation Function with Query Result at the webserver.Because these functions only need to handle the function system than small data quantity, therefore reducing to minimum to the hardware resource consumption of client.
The present invention finishes the inquiry of word by the webserver, and also because the webserver makes things convenient for person skilled to safeguard and upgrade, the convenient realization upgraded and querying method is improved database, keeps the ageing of word lexical or textual analysis.
In above-mentioned steps S205, the webserver is according to the translation of alternative Query List inquiry correlation word, but a lot of words not only have basic meaning, also have many amplification implications, existing Web-Based Dictionary can only be inquired about its translation under the basic meaning of word, can't inquire about the translation of its amplification under implication, be not easy to the user more deep understand this word.Database of the present invention is provided with basic lexical or textual analysis database and network lexical or textual analysis database respectively, in the basic lexical or textual analysis of basic lexical or textual analysis data base querying word, in the network lexical or textual analysis of network lexical or textual analysis data base querying word.Basic lexical or textual analysis of the present invention refers to alphabet and the corresponding Chinese lexical or textual analysis of having taken in the dictionary of the artificial writing of tradition thereof.Network lexical or textual analysis of the present invention is by carrying out data mining and text analyzing to billions of webpages, and the Chinese translation that obtains the alphabet that is present in network in a large number but does not take in as yet in the artificial writing dictionary is for user inquiring.
The present invention takes out wherein word successively at alternative Query List, speech with each taking-up, be called current speech, inquire about the basic lexical or textual analysis and the network lexical or textual analysis of this current speech respectively, if this word has basic lexical or textual analysis, this word is joined the basic lexical or textual analysis part of brief data, further inquire about the network lexical or textual analysis of this word, if this word has the network lexical or textual analysis, this word is also joined the network lexical or textual analysis part of brief data; If this word does not have basic lexical or textual analysis, the network lexical or textual analysis of progressive this word of inquiry has the network lexical or textual analysis as this word, and this word is joined the network lexical or textual analysis part of brief data, if this speech does not also have the network lexical or textual analysis, abandons this word.
The present invention will be the longest the basic lexical or textual analysis translation of alternative query terms correspondence as main Query Result, the selected part translation together is presented in the Mouse Dialog Box of client in the network lexical or textual analysis, and in Mouse Dialog Box, provide detailed query to identify, activate this sign, can obtain whole Query Results of this word.
For further improving inquiry velocity, the present invention is written into brief data when webserver initialization, adopts the basic lexical or textual analysis part graftabl of the mode of Hash table with brief data, is called basic hash; Adopt the network lexical or textual analysis part graftabl of Hash table mode, be referred to as the network hash brief data.During inquiry, the webserver is transferred basic hash and network hash fast in internal memory, inquire about, and this makes full use of the memory source of the webserver, improves inquiry velocity.
Based on the querying method of above-mentioned dictionary, the present invention also provides a kind of inquiry system of dictionary.Referring to Fig. 3, the Hermeneutical system of the meaning of a word is shown, comprise the client 31 and the webserver 32, client 31 comprises gets speech module 311 and display module 312, and the webserver 32 comprises database 321 and word-dividing mode 322, enquiry module 323, reaches and select module 324.
Get the statement that speech module 311 is obtained the word of mouse-pointing position and comprised this word, send to word-dividing mode 322.Get speech module 311 and the word of mouse-pointing position and the statement that comprises this word are sent to word-dividing mode 322 by network service.
Word-dividing mode 322 is extended a certain number of word respectively before and after each participle in the participle tabulation, form alternative query terms, and with above-mentioned alternative query terms by series arrangement from growing to lacking, form alternative Query List, word-dividing mode 322 should send to enquiry module 323 by alternative Query List.
Enquiry module 323 is inquired about the translation of each alternative query word in database 321.Database 321 comprises the basic lexical or textual analysis database of the basic lexical or textual analysis information of storage word and the network lexical or textual analysis database of storage word network lexical or textual analysis information.Enquiry module 323 is in the basic lexical or textual analysis of basic lexical or textual analysis data base querying word, in the network lexical or textual analysis of network lexical or textual analysis data base querying word.Enquiry module 323 sends to Query Result and selects module 324.
The basic lexical or textual analysis translation of selecting the longest alternative query terms correspondence of module 324 selections is as main Query Result, and the selected part translation together is presented in the Mouse Dialog Box of display module 312 in the network lexical or textual analysis.The detailed query sign is provided in the Mouse Dialog Box, activates this sign, can obtain whole Query Results of this word.
More than to a kind of based on network dictionaries query method provided by the present invention and dictionary enquiry system, be described in detail, used specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment just is used for helping to understand method of the present invention and core concept thereof; Simultaneously, for one of ordinary skill in the art, according to thought of the present invention, the part that all can change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention.

Claims (10)

1, a kind of based on network dictionaries query method is characterized in that, this method comprises:
Client is obtained the word of mouse-pointing position and is comprised the statement of this word, sends to the webserver;
The webserver carries out participle with above-mentioned statement, is the center with the word of mouse-pointing position, and front and back are extended the word of default number respectively, form alternative query word;
The webserver is inquired about the translation of each alternative query word in built-in database, and the translation of selecting in the above-mentioned alternative query word long phrase returns client as main Query Result, and client shows this translation.
2, the method for claim 1 is characterized in that, the network lexical or textual analysis translation of the basic lexical or textual analysis translation of the basic meaning correspondence that described translation comprises and word amplification implication correspondence.
3, method as claimed in claim 2 is characterized in that, webserver selection portion divides the network lexical or textual analysis translation of word to show in client.
4, the method for claim 1 is characterized in that, the webserver is inquired about before the translation of each alternative query word in built-in database, also comprises:
The word that the webserver adopts the Hash table mode will have basic lexical or textual analysis is written into internal memory.
5, the method for claim 1 is characterized in that, the webserver is inquired about before the translation of each alternative query word in built-in database, also comprises:
The word that the webserver adopts the mode of Hash table will have the network lexical or textual analysis is written into internal memory.
6, as claim 4 or 5 described methods, it is characterized in that, also comprise: abandon word with basic lexical or textual analysis and network lexical or textual analysis.
7, a kind of based on network dictionary enquiry system, comprise client, described client comprises gets speech module and display module, it is characterized in that, described system also comprises the webserver, and the described webserver comprises database and word-dividing mode, enquiry module, reaches and select module;
The described speech module of getting, the statement that is used to obtain the word of mouse-pointing position and comprises this word sends to described word-dividing mode;
Described word-dividing mode is used for above-mentioned statement is carried out participle, is the center with the word of mouse-pointing position, and front and back are extended the word of default number respectively, form alternative query word;
Described enquiry module is used for the translation at described each alternative query word of database inquiry;
Described selection module, be used for selecting above-mentioned alternative query word the translation of long phrase return described display module as main Query Result;
Described display module is used to show this translation.
8, system as claimed in claim 7 is characterized in that, described database comprises the basic lexical or textual analysis database of the basic lexical or textual analysis information of storage word and the network lexical or textual analysis database of storage word network lexical or textual analysis information.
9, as claim 7 and or 8 described systems, it is characterized in that the described speech module of getting adopts network communication mode that the word of mouse-pointing position and the statement that comprises this word are sent to described word-dividing mode.
10, system as claimed in claim 7 is characterized in that, described display module also selection portion divides the network lexical or textual analysis translation of word to show in client.
CNA2008102224251A 2008-09-16 2008-09-16 Dictionary enquiry method and dictionary enquiry system based on network Pending CN101425086A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2008102224251A CN101425086A (en) 2008-09-16 2008-09-16 Dictionary enquiry method and dictionary enquiry system based on network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2008102224251A CN101425086A (en) 2008-09-16 2008-09-16 Dictionary enquiry method and dictionary enquiry system based on network

Publications (1)

Publication Number Publication Date
CN101425086A true CN101425086A (en) 2009-05-06

Family

ID=40615699

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2008102224251A Pending CN101425086A (en) 2008-09-16 2008-09-16 Dictionary enquiry method and dictionary enquiry system based on network

Country Status (1)

Country Link
CN (1) CN101425086A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833571A (en) * 2010-04-13 2010-09-15 清华大学 Method for automatically extracting bilingual translation dictionary from internet
CN102004721A (en) * 2010-11-10 2011-04-06 无敌科技(西安)有限公司 Device and method for marking vocabularies and idioms
CN101826096B (en) * 2009-12-09 2012-10-10 网易有道信息技术(北京)有限公司 Information display method, device and system based on mouse pointing
CN104267878A (en) * 2014-10-23 2015-01-07 成都卓微科技有限公司 Reading equipment
CN106095270A (en) * 2016-06-06 2016-11-09 北京京东尚科信息技术有限公司 Exhibition points statement and determine the method for label range and termination and server
CN107679043A (en) * 2017-09-22 2018-02-09 广州阿里巴巴文学信息技术有限公司 Data processing method, device and terminal device
CN107766334A (en) * 2016-08-23 2018-03-06 耿诚 A kind of interpretation method and device of software to be translated
CN109190128A (en) * 2018-09-28 2019-01-11 郭派 A kind of method and system constructing English paraphrase group

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101826096B (en) * 2009-12-09 2012-10-10 网易有道信息技术(北京)有限公司 Information display method, device and system based on mouse pointing
CN101833571A (en) * 2010-04-13 2010-09-15 清华大学 Method for automatically extracting bilingual translation dictionary from internet
CN101833571B (en) * 2010-04-13 2011-12-28 清华大学 Method for automatically extracting bilingual translation dictionary from internet
CN102004721A (en) * 2010-11-10 2011-04-06 无敌科技(西安)有限公司 Device and method for marking vocabularies and idioms
CN104267878A (en) * 2014-10-23 2015-01-07 成都卓微科技有限公司 Reading equipment
CN106095270A (en) * 2016-06-06 2016-11-09 北京京东尚科信息技术有限公司 Exhibition points statement and determine the method for label range and termination and server
CN106095270B (en) * 2016-06-06 2020-05-01 北京京东尚科信息技术有限公司 Method for displaying key sentences and determining mark range, terminal device and server
CN107766334A (en) * 2016-08-23 2018-03-06 耿诚 A kind of interpretation method and device of software to be translated
CN107679043A (en) * 2017-09-22 2018-02-09 广州阿里巴巴文学信息技术有限公司 Data processing method, device and terminal device
CN109190128A (en) * 2018-09-28 2019-01-11 郭派 A kind of method and system constructing English paraphrase group

Similar Documents

Publication Publication Date Title
US7984034B1 (en) Providing parallel resources in search results
US10140371B2 (en) Providing multi-lingual searching of mono-lingual content
US8745051B2 (en) Resource locator suggestions from input character sequence
CN101425086A (en) Dictionary enquiry method and dictionary enquiry system based on network
CN101520786B (en) Method for realizing input method dictionary and input method system
JP5264892B2 (en) Multilingual information search
US9594850B2 (en) Method and system utilizing a personalized user model to develop a search request
CN102236702B (en) Computer executing method and systems and devices for searching using queries
US8825694B2 (en) Mobile device retrieval and navigation
JP5064388B2 (en) Location identification method
US7853555B2 (en) Enhancing multilingual data querying
US20090125497A1 (en) System and method for multi-lingual information retrieval
US20070250493A1 (en) Multilingual data querying
US20080114747A1 (en) Speech interface for search engines
CN101137983A (en) Embedded translation-enhanced search
CN1815477A (en) Method and system for providing semantic subjects based on mark language
CN101751434A (en) Meta search engine ranking method and Meta search engine
US20150161279A1 (en) Displaying Local Site Name Information with Search Results
US8892596B1 (en) Identifying related documents based on links in documents
CN1570922A (en) A mode-parameter language translation method and translating system
Hanumanthappa et al. A detailed study on Indian languages text mining
Samanta et al. Development of multimodal user interfaces to Internet for common people
Zou et al. Chinese localisation of Evergreen: an open source integrated library system
JP2015184998A (en) Translation device, translation method, and translation program
Xiangzhen et al. Structural Design and Implementation of Tibetan-English-Chinese Electronic Dictionary

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Open date: 20090506