CN1172993A - Processing technology for special language phenomenon - Google Patents

Processing technology for special language phenomenon Download PDF

Info

Publication number
CN1172993A
CN1172993A CN 97112502 CN97112502A CN1172993A CN 1172993 A CN1172993 A CN 1172993A CN 97112502 CN97112502 CN 97112502 CN 97112502 A CN97112502 A CN 97112502A CN 1172993 A CN1172993 A CN 1172993A
Authority
CN
China
Prior art keywords
rule
special
word
sentence
linguistic phenomenon
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 97112502
Other languages
Chinese (zh)
Other versions
CN1067784C (en
Inventor
陈肇雄
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Huajian long Technology Co. Ltd.
Original Assignee
陈肇雄
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 陈肇雄 filed Critical 陈肇雄
Priority to CN97112502A priority Critical patent/CN1067784C/en
Publication of CN1172993A publication Critical patent/CN1172993A/en
Application granted granted Critical
Publication of CN1067784C publication Critical patent/CN1067784C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Abstract

A technique for processing special language phenomena includes such steps as creating special language phenomena rules in dictionary, analysis process including searching each single word in sentence from dictionary an indexing all the special language phenomena rules for each single word, and reduction process featuring that when sentence is redecced, if current single word has special language phenomena rule, the special language phenomena associated with the single word are first reduced. Said technique conforms to human thinking habit, facilitates search of translation processing mechanism and distributed processing, and can increase the power to discriminate polysements and processing efficiency.

Description

Processing technology for special language phenomenon
The present invention relates to the processing technology for special language phenomenon in the mechanical translation, belong to the machine translation mothod field.
Special linguistic phenomenon refers to Chinese idiom, fixed sturcture etc.Machine translation system is one of sign of weighing its performance quality to the power of special linguistic phenomenon processing power.Some machine translation system takes to set up separately the method for special linguistic phenomenon dictionary, consequently not only is difficult to search, and concludes nor be convenient to sum up.
Purpose of the present invention aims to provide a kind of processing technology for special language phenomenon, and this technology meets people's thinking habit, is convenient to Translation Processing mechanism and searches, and can improve the treatment effeciency of special linguistic phenomenon.
The present invention realizes by the following method:
A kind of processing technology for special language phenomenon that uses a computer and carry out the steps include:
(1) in dictionary, sets up the special linguistic phenomenon rule
The form of rule is:<head 〉-<the context dependent function 〉,<right part〉and,<conversion body 〉
Wherein<and head〉form by word that constitutes this special linguistic phenomenon or sentence element,<context dependent function〉part indicates the context of co-text requirement that this special linguistic phenomenon is used,<right part〉indicate characteristic informations such as the grammer of special linguistic phenomenon and semanteme,<conversion body〉corresponding to the translation of special linguistic phenomenon;
(2) Translation Processing mechanism is at first searched each word in the sentence in dictionary in analytic process, and all the special linguistic phenomenon rule searches under each word are come out;
When (three) sentence being carried out reduction,, then at first the special linguistic phenomenon relevant with this word carried out reduction if under the current reduction word special linguistic phenomenon rule is arranged;
When (four) the Translation Processing algorithm received a sentence, special linguistic phenomenon relevant with each word in the distich carried out following processing:
(1) rule of each the bar special linguistic phenomenon under the current word and former sentence are mated, this algorithm finishes when a rule being arranged the match is successful or all rule match finishes.
(2), if its head is word, then only need directly mate, and judge whether the context dependent condition is set up and get final product with former sentence with this rule to each bar special linguistic phenomenon rule; If current special linguistic phenomenon rule head contains sentence element, then need at first word partly directly to be mated, call the Translation Processing algorithm after the match is successful again counterpart in the sentence is carried out reduction, whether the composition of seeing reduction result and special linguistic phenomenon rule head can the match is successful.Head judges whether the context dependent condition is set up after the match is successful again.
The present invention adds special linguistic phenomenon information in dictionary, promptly directly be stored under the entry of the main word in the special linguistic phenomenon with rule format the special linguistic phenomenon relevant with concrete word, the thinking habit that so not only meets people, and be convenient to Translation Processing mechanism and search, be convenient to dispersion treatment and strengthen the ability of distinguishing ambiguity, also can improve the treatment effeciency of special linguistic phenomenon.
Below in conjunction with accompanying drawing and invention example the present invention is described in detail.
Fig. 1 is an algorithm flow chart of the present invention.
The present invention is to use common computer to realize, the steps include:
One, in dictionary, sets up the special linguistic phenomenon rule
Special linguistic phenomenon in the dictionary is equivalent to a rule.The form of rule is:
<head 〉-<the context dependent function 〉,<right part〉and,<conversion body 〉
Wherein<and head〉form by word that constitutes this special linguistic phenomenon or sentence element,<context dependent function〉part indicates the context of co-text requirement that this special linguistic phenomenon is used,<right part〉indicate characteristic informations such as the grammer of special linguistic phenomenon and semanteme,<conversion body〉corresponding to the translation of special linguistic phenomenon.
If same special linguistic phenomenon corresponding to several different translations, just is expressed as several different special ruless in dictionary.
Two, Translation Processing mechanism is at first searched each word in the sentence in dictionary in analytic process, and all the special linguistic phenomenon rule searches under each word are come out.
When three, sentence being carried out reduction,, then at first the special linguistic phenomenon relevant with this word carried out reduction if under the current word special linguistic phenomenon rule is arranged.
When four, the Translation Processing algorithm receives a sentence, the specific algorithm that the special linguistic phenomenon relevant with each word handled in the distich following (, abbreviating the special linguistic phenomenon rule as rule among the figure) referring to Fig. 1:
1. put under the current special linguistic phenomenon rule and be designated as 0.
2. current special linguistic phenomenon subscript adds 1, promptly puts current special linguistic phenomenon rule and is next bar special linguistic phenomenon rule.If the corresponding special linguistic phenomenon rule of current special linguistic phenomenon rule subscript is empty, the corresponding special linguistic phenomenon rule match failure of then current word finishes.
3. if current special linguistic phenomenon rule head is word, then directly mate with this rule, it fails to match as if head, and step 2 is changeed in then current rule failure; The match is successful as if head, then calls the context dependent Processing Algorithm and judge the context dependent test condition, if condition is set up, then directly this special linguistic phenomenon carried out reduction, and this process successfully finishes; If condition is false, change step 2.If current special linguistic phenomenon rule head contains sentence element (being non-word), then change step 4.
4. if current special linguistic phenomenon rule head contains sentence element, then earlier word is partly mated, it fails to match, change step 2, otherwise counterpart in the sentence is carried out reduction, if the sentence element of reduction result and special linguistic phenomenon rule head is inconsistent, step 2 is changeed in then current rule match failure; Otherwise, call the context dependent Processing Algorithm and judge the context test condition, if condition is set up, then can carry out reduction to this special linguistic phenomenon, this process successfully finishes; If condition is false, change step 2.
Illustrate the present invention below.
" They put that question into account. " is translated as Chinese with sentence.
Existing dictionary:
Entry 1:they NP they
Entry 2:put VP is put
Entry 3:put NP into account-〉, VP takes into account NP.
Entry 4:that Q that
Entry 5:question NP problem
Existing rule:
Rule 1:Q NP-〉, NP, Q NP.
Rule 2:NP VP-〉, S, NP VP.
Wherein, entry 3 is a Chinese idiom.
To the sentence reduction procedure:
(1) be NP with entry 1 with the they reduction;
When (2) put being carried out reduction,, at first mate this Chinese idiom owing under the put Chinese idiom is arranged.Put...into account in the Chinese idiom can mate with corresponding word in the sentence.See that then that qustion whether can reduction be the NP in the phrase.
(3) be Q with entry 4 with that reduction.
(4) be NP with entry 5 with the question reduction.
(5) be NP with rule 1 with Q NP reduction, consistent with the sentence element in the phrase.Therefore, the match is successful for put NPinto account Chinese idiom.Is VP with entry 3 with its reduction.The reduction result first time to sentence is NP VP.
(6) be S with rule 2 with NP VP reduction, the success of sentence reduction.
Final translation is " they have taken into account that problem ".

Claims (1)

1. a processing technology for special language phenomenon that uses a computer and carry out the steps include:
(1) in dictionary, sets up the special linguistic phenomenon rule
The form of rule is:<head 〉-<the context dependent function 〉,<right part〉and,<conversion body 〉
Wherein<and head〉form by word that constitutes this special linguistic phenomenon or sentence element,<context dependent function〉part indicates the context of co-text requirement that this special linguistic phenomenon is used,<right part〉indicate characteristic informations such as the grammer of special linguistic phenomenon and semanteme,<conversion body〉corresponding to the translation of special linguistic phenomenon;
(2) Translation Processing mechanism is at first searched each word in the sentence in dictionary in analytic process, and all the special linguistic phenomenon rule searches under each word are come out;
When (three) sentence being carried out reduction,, then at first the special linguistic phenomenon relevant with this word carried out reduction if under the current word special linguistic phenomenon rule is arranged;
When (four) the Translation Processing algorithm received a sentence, the specific algorithm that the special linguistic phenomenon relevant with each word handled in the distich was as follows:
(1) rule of each the bar special linguistic phenomenon under the current word and former sentence are mated, this algorithm finishes when a rule being arranged the match is successful or all rule match finishes;
(2), if its head is word, then only need directly mate, and judge whether the context dependent condition is set up and get final product with former sentence with this rule to each bar special linguistic phenomenon rule; If current special linguistic phenomenon rule head contains sentence element, then need at first word partly directly to be mated, call the Translation Processing algorithm after the match is successful again counterpart in the sentence is carried out reduction, whether the composition of seeing reduction result and special linguistic phenomenon rule head can the match is successful.Head judges whether the context dependent condition is set up after the match is successful again.
CN97112502A 1997-07-02 1997-07-02 Processing technology for special language phenomenon Expired - Fee Related CN1067784C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN97112502A CN1067784C (en) 1997-07-02 1997-07-02 Processing technology for special language phenomenon

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN97112502A CN1067784C (en) 1997-07-02 1997-07-02 Processing technology for special language phenomenon

Publications (2)

Publication Number Publication Date
CN1172993A true CN1172993A (en) 1998-02-11
CN1067784C CN1067784C (en) 2001-06-27

Family

ID=5172302

Family Applications (1)

Application Number Title Priority Date Filing Date
CN97112502A Expired - Fee Related CN1067784C (en) 1997-07-02 1997-07-02 Processing technology for special language phenomenon

Country Status (1)

Country Link
CN (1) CN1067784C (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1302415C (en) * 2000-06-19 2007-02-28 李玉鑑 English-Chinese translation machine
CN101390091B (en) * 2006-02-27 2011-02-09 日本电气株式会社 Language processing device, language processing method
WO2017012327A1 (en) * 2015-07-22 2017-01-26 华为技术有限公司 Syntax analysis method and device

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4473702B2 (en) * 2004-11-02 2010-06-02 株式会社東芝 Machine translation system, machine translation method and program

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5510981A (en) * 1993-10-28 1996-04-23 International Business Machines Corporation Language translation apparatus and method using context-based translation models

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1302415C (en) * 2000-06-19 2007-02-28 李玉鑑 English-Chinese translation machine
CN101390091B (en) * 2006-02-27 2011-02-09 日本电气株式会社 Language processing device, language processing method
WO2017012327A1 (en) * 2015-07-22 2017-01-26 华为技术有限公司 Syntax analysis method and device
US10909315B2 (en) 2015-07-22 2021-02-02 Huawei Technologies Co., Ltd. Syntax analysis method and apparatus

Also Published As

Publication number Publication date
CN1067784C (en) 2001-06-27

Similar Documents

Publication Publication Date Title
CN1204514C (en) Method and apparatus for translating between languages
US7269547B2 (en) Tokenizer for a natural language processing system
US20010014852A1 (en) Document semantic analysis/selection with knowledge creativity capability
JP3300866B2 (en) Method and apparatus for preparing text for use by a text processing system
EP0805403A3 (en) Translating apparatus and translating method
CN101079031A (en) Web page subject extraction system and method
CN1950831A (en) Apparatus and method for handwriting recognition
BR9902574A (en) Process and apparatus for processing documents in an image-based document processing system
Hobbs et al. Fastus: A system for extracting information from text
EP1291790A3 (en) Text-based automatic content classification and grouping
US20030074190A1 (en) Method and apparatus for dynamic configuration of a lexical analysis parser
CN1067784C (en) Processing technology for special language phenomenon
WO2018101506A1 (en) Document multi-classification device and document multi-classification method for classifying one document into plurality of categories by using lexico-semantic pattern obtained by reconfiguring semantic category of words constituting sentence
CN101075260A (en) Method and module for extracting summary
CN1342942A (en) Computer recognizing and indexing method of Chinese names
CN1114165C (en) Segmentation of Chinese text into words
Paik et al. Interpretation of proper nouns for information retrieval
CN1369833A (en) Lexial system and method for conversion between unsimplified and simplified Chinese characters
CN1242353C (en) System and method for exactly explaining literal meaning in a sentence
WO2021040101A1 (en) Real-time distributed indexing system and method for high-performance query and response
CN1173674A (en) Transfering generation tech. based on SC grammar
CN1055553C (en) Lancuage identification system and method for a peripheral unit
CN1294362A (en) Method for processing duplicate kay words in dual-language dictionary
Sutcliffe et al. Using Distributed Patterns as Language Independent Lexical Representations
CN1067782C (en) Inference technology on imperfect knowledge

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent for invention or patent application
COR Change of bibliographic data

Free format text: CORRECT: APPLICANT; FROM: CHEN ZHAOXIONG TO: HUAJIAN MACHINE TRANSLATION CO., LTD

CP03 Change of name, title or address

Address after: 100083 Beijing Haidian District Xueyuan Road No. 30, West Building Huajian Corporation Li Hua

Applicant after: Huajian Machine Translation Co., Ltd.

Applicant before: Chen Zhaoxiong

C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: BEIJING HUAJIAN CHANGHE SCIENCE CO., LTD.

Free format text: FORMER OWNER: HUAJIAN MACHINE TRANSLATION CO., LTD

Effective date: 20090417

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20090417

Address after: Room 207, West Building, Kequn Building, 30 College Road, Haidian District, Beijing: 100083

Patentee after: Beijing Huajian long Technology Co. Ltd.

Address before: Li Hua Zip Code of West Building Huajian Group Company, Kequn Building, 30 College Road, Haidian District, Beijing: 100083

Patentee before: Huajian Machine Translation Co., Ltd.

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20010627

Termination date: 20160702

CF01 Termination of patent right due to non-payment of annual fee