CN103514145B - Punctuation mark conversion method and device - Google Patents

Punctuation mark conversion method and device Download PDF

Info

Publication number
CN103514145B
CN103514145B CN201210200998.0A CN201210200998A CN103514145B CN 103514145 B CN103514145 B CN 103514145B CN 201210200998 A CN201210200998 A CN 201210200998A CN 103514145 B CN103514145 B CN 103514145B
Authority
CN
China
Prior art keywords
punctuation mark
language
chinese
text
conversion
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210200998.0A
Other languages
Chinese (zh)
Other versions
CN103514145A (en
Inventor
朱纯深
郝天永
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
City University of Hong Kong CityU
Original Assignee
City University of Hong Kong CityU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by City University of Hong Kong CityU filed Critical City University of Hong Kong CityU
Priority to CN201210200998.0A priority Critical patent/CN103514145B/en
Publication of CN103514145A publication Critical patent/CN103514145A/en
Application granted granted Critical
Publication of CN103514145B publication Critical patent/CN103514145B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The present invention relates to a kind of punctuation mark conversion method, in the transfer process applied to the punctuation mark for being converted into exporting with the 3rd language by the punctuation mark inputted with first language and/or second language in text, it the described method comprises the following steps:S1. the punctuation mark inputted in the text with first language and/or second language is searched;S2. judge whether the punctuation mark found needs conversion;And S3. is according to default punctuation mark mapping table, it would be desirable to which the punctuation mark of conversion is converted into the punctuation mark exported with the 3rd language, and keep the punctuation mark that need not be changed constant.The present invention can realize the conversion between simplified and traditional Chinese to punctuation mark in the case of multilingual mixing, solve the problem of punctuation mark is misrouted in mixing text, improve conversion efficiency, while facilitating user's contrast conversion and result.

Description

Punctuation mark conversion method and device
Technical field
The present invention relates to punctuation mark treatment technology, more particularly to a kind of punctuation mark conversion method and device.
Background technology
It is well known that the Chinese font used at present has two kinds, one kind is used in the area such as China's Mainland and Singapore Simplified form of Chinese Character, another is in the Chinese-traditional that uses of the area such as Taiwan, Hong Kong and Macao.However, due to simplified form of Chinese Character and It is widely different between Chinese-traditional, therefore, cause the estrangement in information interchange using the user of both words.In order to This estrangement is eliminated, some conversion between simplified and traditional Chinese instruments have been developed., can be easily by these conversion between simplified and traditional Chinese instruments Realize the conversion between simplified form of Chinese Character and Chinese-traditional.In addition, with the development of computer technology, in number of site(For example, Google)Or copy editor's software(For example, the Microsoft Word of Microsoft)In can all have this kind of conversion between simplified and traditional Chinese instrument.
Although existing conversion between simplified and traditional Chinese instrument has been able to relatively accurately realize the conversion between simplified and traditional Chinese between word, at present also There is no a software to realize to punctuation mark more complicated in text(For example, double quotation marks)Conversion between simplified and traditional Chinese.Further, since Simplified form of Chinese Character is different to the setting of punctuation mark with Chinese-traditional, result in when carrying out conversion between simplified and traditional Chinese, there is some punctuates symbol Number the situation of mess code can not be changed or be converted into, therefore, it is still necessary to which user is again to the punctuate in the text after conversion Symbol is searched and is judged and make correct conversion.
The punctuation mark in Chinese-non-Chinese mixing text is entered in addition, existing conversion between simplified and traditional Chinese instrument can not also be realized Row conversion.Since in Chinese-non-Chinese mixing text, non-Chinese text(For example, English text)Usually using literary with Chinese This(Simplified form of Chinese Character or Chinese-traditional)Different punctuation mark codings, therefore, similarly occurs as above institute in transfer process That states can not change or be converted into the situation of mess code, therefore, in order to ensure overall uniformity, need also exist for user again to turning The punctuation mark in file after changing is searched and is judged and make correct conversion.
In summary, because prior art can not also realize the conversion between simplified and traditional Chinese to punctuation mark, therefore, user still needs to spend The substantial amounts of time is searched the punctuation mark in the file after conversion and is judged and make correct conversion, and therefore, having must Improved technological means is proposed to solve this problem.
The content of the invention
In view of the defect of above-mentioned prior art, it is a primary object of the present invention to provide a kind of punctuation mark conversion method and Device, to solve the above mentioned problem of prior art presence.
There is provided a kind of punctuation mark conversion method and device, Neng Gou for problems of the prior art by the present invention The conversion between simplified and traditional Chinese to punctuation mark is realized in Chinese text and Chinese-non-Chinese mixing text, so as to improve conversion efficiency.
To achieve the above object, The embodiment provides a kind of punctuation mark conversion method, applied to by text In the punctuation mark that is inputted with first language and/or second language be converted into the converted of the punctuation mark exported with the 3rd language Cheng Zhong, wherein the text is one kind in Chinese text and Chinese-non-Chinese mixing text, the first language and institute It is respectively one kind in simplified form of Chinese Character and Chinese-traditional to state the 3rd language, and the second language is in non-Chinese language One kind, the described method comprises the following steps:S1. the mark inputted in the text with first language and/or second language is searched Point symbol;S2. judge whether the punctuation mark found needs conversion;And S3. maps according to default punctuation mark Table, it would be desirable to which the punctuation mark of conversion is converted into the punctuation mark exported with the 3rd language, and keeps what need not be changed Punctuation mark is constant.
Embodiments of the invention additionally provide a kind of punctuation mark conversion equipment, applied to by text with first language And/or the punctuation mark of second language input is converted into the transfer process of punctuation mark that is exported with the 3rd language, wherein institute It is one kind in Chinese text and Chinese-non-Chinese mixing text, the first language and the 3rd language to state text One kind respectively in simplified form of Chinese Character and Chinese-traditional, the second language is one kind in non-Chinese language, institute Stating device includes:Searching modul, for searching the punctuation mark inputted in the text with first language and/or second language; Whether judge module, the punctuation mark for judging to find needs conversion;And modular converter, for according to default Punctuation mark mapping table, it would be desirable to which the punctuation mark of conversion is converted into the punctuation mark exported with the 3rd language, and keeps The punctuation mark that need not be changed is constant.
In summary, punctuation mark conversion method of the invention and device can be realized in the case of the mixing of multilingual To the conversion between simplified and traditional Chinese of punctuation mark, so as to solve the problem of punctuation mark is misrouted in mixing text, and conversion effect is improved Rate;Additionally it is possible to be highlighted to the punctuation mark in text, consequently facilitating user carries out contrast conversion and verification Result after conversion.
Brief description of the drawings
Fig. 1 is the embodiment flow chart of punctuation mark conversion method of the present invention;
Fig. 2 is the example structure block diagram of punctuation mark conversion equipment of the present invention.
Embodiment
Embodiments of the invention are described more fully below.It should be noted that the embodiments described herein be served only for for example, It is not intended to limit the invention.
Fig. 1 is the embodiment flow chart of punctuation mark conversion method of the present invention.The punctuation mark conversion method of the present embodiment Applied to the punctuate for being converted into exporting with the 3rd language by the punctuation mark inputted with first language and/or second language in text In the transfer process of symbol, wherein the text is one kind in Chinese text and Chinese-non-Chinese mixing text, it is described First language and the 3rd language are respectively one kind in simplified form of Chinese Character and Chinese-traditional, the second language be selected from One kind in non-Chinese language.As illustrated, the described method comprises the following steps S1, S2 and S3.
S1. the punctuation mark inputted in text with first language and/or second language is searched.
In one embodiment, according to default rule, in the text to first language and/or second language input Punctuation mark is searched.Here, default rule refers in first language and second language that all punctuation marks are corresponding and compiled Code table, text can be Chinese text(For example, simplified form of Chinese Character text or Chinese-traditional text)Or Chinese-non-Chinese Mix text(For example, simplified form of Chinese Character-English, simplified form of Chinese Character-German, Chinese-traditional-French etc.);First language is selected from simplified One kind in Chinese and Chinese-traditional, and second language is selected from non-Chinese language(For example, English, German, French etc.)In It is a kind of.
In one embodiment, step S11. is also included after the step S 1, before the step S2 to finding The punctuation mark highlighted.Specifically, in one embodiment, when finding in text with first language And/or second language input punctuation mark when, can be prompted the user whether for example, by the mode of pop-up dialogue box to finding Punctuation mark highlighted, if user selection be that the punctuation mark found is identified into color automatically, it is described Color can be such as red, yellow, blueness.It should be noted that being highlighted to the punctuation mark found Process be not limited only to " color " mark, " annotation " pattern in the Microsoft Word of similar Microsoft can also be used to enter OK.Highlighted, easily can made a distinction the punctuation mark in text and alphabetic character so that punctuate by above-mentioned Symbol can be clear from being shown, consequently facilitating follow-up judgement, conversion and check and correction.It is not limited only to pair in addition, highlighting The punctuation mark found is carried out, with reference to the above, in one embodiment, in output text that can also be after conversion Punctuation mark is highlighted.
S2. judge whether the punctuation mark found needs conversion.
In one embodiment, step S2 specifically includes the mark that S21. judges to input with first language and/or second language The whether all Chinese punctuation marks of point symbol, if it is go to step S3, otherwise continue step S22;S22. non-Chinese is judged Whether Chinese character or Chinese-non-Chinese mixed characters are included between punctuation mark, if it is go to step S3, in otherwise only changing Literary punctuation mark, and keep non-Chinese punctuation mark constant.
With reference to the above, in one embodiment, the punctuation mark found can be sentenced by following steps It is disconnected, to determine the need for being changed.First, it is determined that the whether all Chinese punctuation marks of the punctuation mark found (That is, simplified punctuation mark or traditional font punctuation mark), if it is, according to default rule by it is above-mentioned with first language and/or The Chinese punctuation mark of second language input is converted into the Chinese punctuation mark exported with the 3rd language;If simultaneously comprising Chinese Punctuation mark and non-Chinese punctuation mark, then keep non-Chinese punctuation mark therein constant, and Chinese punctuation mark is changed The Chinese punctuation mark of the language of Cheng Yi tri- output.Here, the 3rd language is one kind in simplified form of Chinese Character and Chinese-traditional. Continue, the above-mentioned non-Chinese punctuation mark do not changed is judged, if only including middle word in non-Chinese punctuation mark Symbol(For example, simplified or Chinese-traditional individual character, phrase or sentence etc.), then non-Chinese punctuation mark is converted into the 3rd language The Chinese punctuation mark of output;If only including non-Chinese character in non-Chinese punctuation mark(For example, English word, phrase or Sentence etc.), then it need not be changed, that is, keeps non-Chinese punctuation mark constant;If same in non-Chinese punctuation mark When include Chinese character and non-Chinese character, then as the above-mentioned only situation comprising Chinese character, non-Chinese punctuate is accorded with Number it is converted into the Chinese punctuation mark exported with the 3rd language.That is, only working as in non-Chinese punctuation mark only comprising non- Just keep the non-Chinese punctuation mark constant during Chinese character, be required for non-Chinese punctuate in the case of in addition other Symbol is converted into Chinese punctuation mark.In addition, in one embodiment, user can also directly select all punctuate symbols of conversion Number.
S3. according to default punctuation mark mapping table, punctuation mark to be converted is converted into what is exported with the 3rd language Punctuation mark, and keep the punctuation mark that need not be changed constant.
In one embodiment, according to default punctuation mark mapping table, punctuation mark to be converted is changed.This In described punctuation mark mapping table be the punctuation mark inputted in text with first language and/or second language that is stored with With the table of the one-to-one mapping relations of punctuation mark exported with the 3rd language so that with first language and/or second language The punctuation mark of input is associated with the punctuation mark exported with the 3rd language;So as to according to mapping relations realization pair The conversion of the punctuation mark.Specifically, in one embodiment, simplified form of Chinese Character and/or non-Chinese and Chinese-traditional punctuate The one-to-one mapping relations of symbol can be with as shown in table 1 below(Which includes single quotation marks in the conversion of simplified Chinese-traditional with it is double The mutual phase transformation of quotation marks), the content in table 1 is only used for for example, and being not intended to limit the present invention.
Table 1
It should be noted that:The general rule of simplified form of Chinese Character text be double quotation marks in outer layer, single quotation marks is in internal layer;And Chinese-traditional The general rule of text be single quotation marks in outer layer, double quotation marks is in internal layer.The system is taken this as a foundation, all in outer layer in simplified form of Chinese Character text Either single quotation marks or double quotation marks, switch to single quotation marks by traditional font general rule without exception, and switch to double draw without exception in the quotation marks of internal layer Number.Vice versa.In table 1,For vertical setting of types double quotation marks, used in vertical setting of types text;For vertical setting of types single quotation marks, Used in vertical setting of types text.
With reference to the above, in one embodiment, can according to above-mentioned mapping relations by punctuation mark mapping table be divided into Lower two kinds:A kind of is for storing with first language and/or second language input " monocase "(Such as comma, fullstop, colon Deng)Punctuation mark and the punctuation mark exported with the 3rd language(For example, with, or, with,)" the list of one-to-one mapping relations Character " punctuation mark mapping table, another is for storing with first language and/or second language input " double word symbol "(That is, The character used in pairs, such as single quotation marks, double quotation marks)Punctuation mark and the punctuation mark exported with the 3rd language(Such as " " With " " or ' ' and " ")" double word symbol " punctuation mark mapping table of one-to-one mapping relations.
In addition, the number of site of prior art(For example, Google)Or copy editor's software(For example, Microsoft Microsoft Word)In subsidiary conversion between simplified and traditional Chinese instrument have been able to directly utilize punctuation mark mapping table automatically to realize " monocase " punctuation mark is from simplified to traditional font or from traditional font to simplified conversion, i.e., " silence " is changed, the accuracy rate of the conversion It is higher, and it is not usually required to follow-up check and correction.
However, for " double word symbol " punctuation mark, can there is presently no any software due to the complexity of itself The complicated and simple conversion to it is realized, and the present invention can then include by the position of " double word symbol " punctuation mark in step S2 with it The judgement of content realize " double word symbol " punctuation mark from simplified to traditional font or from traditional font to simplified conversion.Specifically, For example, being converted into simplified form of Chinese Character-English to be mixed to text in the transfer process that Chinese-traditional-English mixes text, with English Double quotation marks(″″)Exemplified by, it is first, it is associated with content of text to determine it when finding the Part I of double quotation marks For left quotation marks or right quotation marks;Secondly, being searched according to content of text should be with the immediate Part II of double quotation marks same layer(That is, Corresponding to the right quotation marks or left quotation marks of above-mentioned Part I), the position of the Part II of double quotation marks is may determine that by this method Put, so that left quotation marks and right quotation marks are associated;Continue, judge whether the content of text between left quotation marks and right quotation marks is all Chinese character or Sino-British mixed characters, if the double quotation marks of English, then be converted into the list of Chinese-traditional by Sino-British mixed characters Quotation marks(「」)Or the double quotation marks of simplified form of Chinese Character(“”), otherwise keep the double quotation marks of English constant.It should be noted that when English with When single quotation marks includes double quotation marks, it is converted into simplified form of Chinese Character strain to include single quotation marks with double quotation marks;And singly draw when English is included with double quotation marks Number when, be converted into Chinese-traditional strain to include double quotation marks with single quotation marks.
In one embodiment, it can also include after step S3:
S4. after transfer process completion, judge whether to carry out the punctuation mark exported with the 3rd language Modification and whether modification correct, if it is determine whether to remove described in highlight.
In one embodiment, after transfer process is completed, system can be according to the conversion of default rule automatic decision It is no to have completed and whether changed correct, that is to say, that to judge whether to have carried out all punctuation marks highlighted just Really conversion, if it is reminds user to determine whether that removing is highlighted;Or, user can also according to highlight to conversion Punctuation mark afterwards is proofreaded or changed, if it is determined that all punctuation marks have all been changed, then can be by clicking on The removing button that system is provided(For example, receiving to change all formats of text)Highlighted to remove, so that after conversion Text and conversion before text on form(For example, font, textcolor etc.)It is consistent;Removing button described here is not The disposable mode for removing all formats change is only limitted to, can also come clear for example, by receiving and being moved to the mode of next Except highlighting, so contribute to user to carry out in a certain order, will not occur mistake and change or leak the phenomenon changed, so that school Text to after is more accurate, complete.In addition, the method for being not limited to the above for the removing highlighted, for example, user also may be used Highlighted with manual removing, or can also directly select and do not highlight in the transfer etc..
However, the present invention be not limited in from simplified form of Chinese Character or simplified form of Chinese Character-non-Chinese mixing text to Chinese-traditional or The transfer process of Chinese-traditional-non-Chinese mixing text, the method according to examples detailed above can be equally realized from traditional font The transfer process of text or Chinese-traditional-non-Chinese mixing text to simplified form of Chinese Character or simplified form of Chinese Character-non-Chinese mixing text, therefore This is no longer repeated.
Fig. 2 is the example structure block diagram of punctuation mark conversion equipment of the present invention.The punctuation mark converting means of the present embodiment Put and apply equally to be converted into the punctuation mark inputted with first language and/or second language in text to export with the 3rd language Punctuation mark transfer process in, wherein the text be selected from Chinese text and Chinese-it is non-Chinese mixing text in one Kind, the first language and the 3rd language are respectively one kind in simplified form of Chinese Character and Chinese-traditional, second language Say as one kind in non-Chinese language.As illustrated, described device includes:Searching modul 11, for search in text with First language and/or the punctuation mark of second language input;Whether judge module 12, the punctuation mark for judging to find needs Change;And modular converter 13, for according to default punctuation mark mapping table, by punctuation mark to be converted be converted into The punctuation mark of 3rd language output, and keep the punctuation mark that need not be changed constant.
With reference to the above, in one embodiment, according to default rule, searching modul 11 is in the text to first Language and/or the punctuation mark of second language input are searched.Preset rules mentioned here and text are accorded with above-mentioned punctuate Text described in number conversion method is identical, therefore is no longer repeated herein.In one embodiment, described device also includes mark Module is known, for being highlighted to the punctuation mark found.Specifically, in one embodiment, searching modul is worked as 11 when finding the punctuation mark inputted in text with first language and/or second language, can be for example, by pop-up dialogue box Mode prompts the user whether to highlight the punctuation mark found, if user's selection is will to find automatically Punctuation mark identify color, the color can be such as red, yellow, blueness.It should be noted that to being searched To the process that is highlighted of punctuation mark be not limited only to " color " mark, similar Microsoft can also be used " annotation " pattern in Microsoft Word is carried out.Highlighted by above-mentioned, can be easily by the mark in text Point symbol makes a distinction with alphabetic character so that punctuation mark can be clear from being shown, consequently facilitating follow-up judgement, turn Change and proofread.
Continue, in one embodiment, determine the need for what searching modul 11 was found by judge module 12 Punctuation mark is changed.Specifically, in one embodiment, judge module 12 is specifically included:Punctuate judging unit 121, For judging the whether all Chinese punctuation marks of punctuation mark inputted with first language and/or second language, if It is to be changed by modular converter 13;Character judging unit 123, for judging in whether including between non-Chinese punctuation mark Chinese character or Chinese-non-Chinese mixed characters, are if it is changed by modular converter 13, otherwise only pass through modular converter 13 change Chinese punctuation mark, and keep non-Chinese punctuation mark constant.
Continue, in one embodiment, modular converter 13 is according to default punctuation mark mapping table, to punctuate to be converted Symbol is changed.Punctuation mark mapping table mentioned here is one and is stored with text with first language and/or the second language Say the punctuation mark of input and the table of the one-to-one mapping relations of punctuation mark exported with the 3rd language so that with the first language Speech and/or the punctuation mark of second language input are associated with the punctuation mark exported with the 3rd language;So as to according to institute State conversion of the mapping relations realization to the punctuation mark.Specifically, in one embodiment, simplified form of Chinese Character and/or it is non-in Text can be as listed in Table 1 with the one-to-one mapping relations of Chinese-traditional punctuation mark, and the content in table 1 is only used for illustrating It is bright, and it is not intended to limit the present invention.
In addition, in one embodiment, the punctuation mark conversion equipment of the present embodiment also includes removing module, in institute State transfer process complete after, judge whether to it is described with the 3rd language export punctuation mark be modified and change whether Correctly, if it is determine whether to highlight described in removing.
Specifically, in one embodiment, after transfer process is completed, judge module 12 can be according to default rule Whether whether automatic decision conversion completed and changed correct, that is to say, that judges whether all marks highlighted Point symbol has carried out correct conversion, if it is reminds user to determine whether to highlight to remove by removing module;Or, User can also be proofreaded or be changed to the punctuation mark after conversion according to highlighting, if it is determined that all punctuation marks All changed, then can be by clicking on the removing button that system is provided(For example, receiving to change all formats of text) Highlighted to remove, so that the text after conversion and the text before conversion are on form(For example, font, textcolor etc.) It is consistent;Button described here of removing is not limited only to the disposable mode for removing all formats change, can also be for example, by Receive and be moved to the mode of next to highlight to remove, so contribute to user to carry out in a certain order, will not Occur mistake and change or leak the phenomenon changed, so that the text after check and correction is more accurate, complete.In addition, for the removing highlighted The method of the above is not limited to, is highlighted for example, user can also remove manually, or can also be directly selected in the transfer not Highlighted using mark module etc..
In summary, punctuation mark conversion method of the invention and device can realize the conversion between simplified and traditional Chinese to punctuation mark, Particularly to the complicated and simple conversion of " double word symbol " punctuation mark, so as to improve conversion efficiency, save user and text is looked into the whole text The time looked for and changed;Additionally it is possible to be highlighted to the punctuation mark in text, consequently facilitating the follow-up school of user It is right.
Although with reference to exemplary embodiment describing the present invention, it is to be understood that, term used be explanation and it is exemplary, And nonrestrictive term.Because the present invention can be embodied without departing from the spiritual or substantive of invention, institute in a variety of forms Should be appreciated that above-described embodiment is not limited to any foregoing details, and the spirit and model that should be limited in appended claims Widely explained in enclosing, therefore the whole changes fallen into claim or its equivalent scope and remodeling all should be right of enclosing and wants Ask and covered.

Claims (8)

1. a kind of punctuation mark conversion method, is accorded with applied to by the punctuate inputted in text with first language and/or second language Number be converted into the transfer process of punctuation mark that is exported with the 3rd language, wherein the text be selected from Chinese text and in One kind in Wen-non-Chinese mixing text, the first language and the 3rd language are respectively to be selected from simplified form of Chinese Character and traditional font One kind in Chinese, the second language is one kind in non-Chinese language, be the described method comprises the following steps:
S1. the punctuation mark inputted in the text with first language and/or second language is searched;
S2. judge whether the punctuation mark found needs conversion;And
S3. according to default punctuation mark mapping table, it would be desirable to which the punctuation mark of conversion is converted into exporting with the 3rd language Punctuation mark, and keep the punctuation mark that need not change constant;
Wherein, the step S2 includes:
S21. the whether all Chinese punctuation marks of punctuation mark inputted with first language and/or second language are judged, If it is S3 is gone to step, otherwise continues step S22;
S22. judge whether include Chinese character or Chinese-non-Chinese mixed characters between non-Chinese punctuation mark, if it is turn Step S3, otherwise only changes the Chinese punctuation mark, and keeps the non-Chinese punctuation mark constant.
2. punctuation mark conversion method according to claim 1, wherein, after the step S1, before the step S2 Also include:
S11. the punctuation mark found is highlighted.
3. method according to claim 2, wherein, also include after the step S3:
S4. after transfer process completion, judge whether to repair the punctuation mark exported with the 3rd language Change and whether correct change, if it is determine whether to highlight described in removing.
4. according to the method described in claim 1, wherein, the default punctuation mark mapping table be used for store in the text Punctuation mark of the punctuation mark inputted with first language and/or second language with being exported with the 3rd language maps correspondingly Relation, makes the punctuation mark inputted with first language and/or second language be accorded with the punctuate exported with the 3rd language It is number associated;The step S3 realizes the conversion to the punctuation mark according to the mapping relations.
5. a kind of punctuation mark conversion equipment, is accorded with applied to by the punctuate inputted in text with first language and/or second language Number be converted into the transfer process of punctuation mark that is exported with the 3rd language, wherein the text be selected from Chinese text and in One kind in Wen-non-Chinese mixing text, the first language and the 3rd language are respectively to be selected from simplified form of Chinese Character and traditional font One kind in Chinese, the second language is one kind in non-Chinese language, and described device includes:
Searching modul, for searching the punctuation mark inputted in the text with first language and/or second language;
Whether judge module, the punctuation mark for judging to find needs conversion;And
Modular converter, for according to default punctuation mark mapping table, it would be desirable to which the punctuation mark of conversion is converted into The punctuation mark of three language output, and keep the punctuation mark that need not be changed constant;
Wherein, the judge module includes:
Punctuate judging unit, it is whether all for judging the punctuation mark with first language and/or second language input Chinese punctuation mark, is if it is changed by the modular converter;
Character judging unit, for judging whether include Chinese character or Chinese-non-Chinese mixing word between non-Chinese punctuation mark Symbol, is if it is changed by the modular converter, otherwise only changes the Chinese punctuation mark, and keep it is described it is non-in Literary punctuation mark is constant.
6. punctuation mark conversion equipment according to claim 5, wherein, described device also includes:
Mark module, for being highlighted to the punctuation mark found.
7. punctuation mark conversion equipment according to claim 6, wherein, described device also includes:
Module is removed, for after transfer process completion, judging whether to accord with the punctuate exported with the 3rd language Number be modified and whether correct change, if it is determine whether to remove described in highlight.
8. punctuation mark conversion equipment according to claim 5, wherein, the default punctuation mark mapping table is used to deposit Store up the punctuation mark inputted in the text with first language and/or second language and the punctuation mark one exported with the 3rd language One corresponding mapping relations, make the punctuation mark inputted with first language and/or second language and described with the 3rd language The punctuation mark of output is associated;The modular converter realizes the conversion to the punctuation mark according to the mapping relations.
CN201210200998.0A 2012-06-18 2012-06-18 Punctuation mark conversion method and device Active CN103514145B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210200998.0A CN103514145B (en) 2012-06-18 2012-06-18 Punctuation mark conversion method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210200998.0A CN103514145B (en) 2012-06-18 2012-06-18 Punctuation mark conversion method and device

Publications (2)

Publication Number Publication Date
CN103514145A CN103514145A (en) 2014-01-15
CN103514145B true CN103514145B (en) 2017-07-18

Family

ID=49896891

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210200998.0A Active CN103514145B (en) 2012-06-18 2012-06-18 Punctuation mark conversion method and device

Country Status (1)

Country Link
CN (1) CN103514145B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108241612B (en) * 2016-12-27 2021-11-05 北京国双科技有限公司 Punctuation processing method and device
CN110390080A (en) * 2018-04-19 2019-10-29 广东电网有限责任公司 A kind of progressive report method for sorting

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1110421A (en) * 1994-12-30 1995-10-18 黄思源 New braille publishing and reading method and reading device
CN101859295A (en) * 2009-04-07 2010-10-13 英业达股份有限公司 System and method for converting simplified Chinese character/word and traditional Chinese character/word with labels and prompts

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH032960A (en) * 1989-05-30 1991-01-09 Matsushita Electric Ind Co Ltd Kana/kanji converting device
JPH06236357A (en) * 1993-02-09 1994-08-23 Toshiba Corp Word processor and punctuation mark inputting method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1110421A (en) * 1994-12-30 1995-10-18 黄思源 New braille publishing and reading method and reading device
CN101859295A (en) * 2009-04-07 2010-10-13 英业达股份有限公司 System and method for converting simplified Chinese character/word and traditional Chinese character/word with labels and prompts

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
英汉互译中标点符号的转换;刘扬;《牡丹江大学学报》;20070331;第16卷(第3期);第41-44页 *

Also Published As

Publication number Publication date
CN103514145A (en) 2014-01-15

Similar Documents

Publication Publication Date Title
Xia et al. Developing Guidelines and Ensuring Consistency for Chinese Text Annotation.
EP0686286B1 (en) Text input transliteration system
JP5666307B2 (en) System and method for classification and reading of Kanji characters and character components
CN106777275A (en) Entity attribute and property value extracting method based on many granularity semantic chunks
US8660835B2 (en) System and a method for automatically detecting text type and text orientation of a bidirectional (BIDI) text
CN106648819B (en) A kind of internationalization code conversion method based on editing machine
CN105243056B (en) A kind of Chinese parsing method and device based on punctuation mark processing
WO2023284108A1 (en) Demand conformity analysis method and system, and electronic device and storage medium
WO2016197265A1 (en) Method for inputting rarely-used characters
CN103514145B (en) Punctuation mark conversion method and device
CN101533388A (en) Original text note display method
CN102609410B (en) Authority file auxiliary writing system and authority file generating method
US10747794B2 (en) Smart search for annotations and inking
KR20190090636A (en) Method for automatically editing pattern of document
US8127279B2 (en) Systems and methods for graphical indexer operation on documents with SOSI characters
CN106156006A (en) Tibetan character component analysis method, Tibetan character sorting method and corresponding device
CN104794140B (en) text highlight display method and device
CN103514152B (en) Identification tracking method and device used in conversion between simplified Chinese and traditional Chinese
CN107451105B (en) Bright braille conversion system based on novel Chinese character holographic coding rule
CN110163975A (en) Method for drafting, device, equipment and the storage medium of space line
CN110287496A (en) A kind of English to Chinese Word sense disambiguation method neural network based
CN101710260A (en) Input method and system for improving success rate of keyboard input
Jakubıcek et al. Punctuation detection with full syntactic parsing
CN102346733A (en) Translation system based on intermediate languages and method thereof
TW201310262A (en) E-book editing system and method thereof

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant