CN102298577A - Method and device for detecting spelling of document edition - Google Patents

Method and device for detecting spelling of document edition Download PDF

Info

Publication number
CN102298577A
CN102298577A CN2011102813722A CN201110281372A CN102298577A CN 102298577 A CN102298577 A CN 102298577A CN 2011102813722 A CN2011102813722 A CN 2011102813722A CN 201110281372 A CN201110281372 A CN 201110281372A CN 102298577 A CN102298577 A CN 102298577A
Authority
CN
China
Prior art keywords
word
spelling
input
storehouse
text editing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011102813722A
Other languages
Chinese (zh)
Inventor
吴思然
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Wondershare Software Co Ltd
Original Assignee
Shenzhen Wondershare Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Wondershare Software Co Ltd filed Critical Shenzhen Wondershare Software Co Ltd
Priority to CN2011102813722A priority Critical patent/CN102298577A/en
Publication of CN102298577A publication Critical patent/CN102298577A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses a method and device for detecting the spelling of document edition. The method comprises the following steps of: creating a word stock, wherein the word stock is provided with a plurality of branch word stocks, and each branch word stock is provided with standard words; creating a branch word stock index table and a spelling checking scheme table; making different spelling checking schemes corresponding to different branch word stock indexes; acquiring document data of an input region or a selected region, comparing the document data with standard words of a corresponding branch word stock one by one, and recording wrongly-spelled input words; and specially marking the wrongly-spelled input words. The invention further discloses a device for detecting the spelling of document edition. In the invention, multiple branch word stock forms are adopted, multiple languages can be loaded, different checking schemes are distinguished by using different application fields, so that the spelling detection efficiency and speed can be increased; and specific to be practical problem of wrong words caused by error input of multiple users, the word spelling detection function can help the users find and eliminate errors as recommended.

Description

A kind of spelling detection method and device thereof of text editing
Technical field
The present invention relates to a kind of spelling detection method and device thereof of text editing, refer in particular at board or other and have in the text editing functional programs, can play a kind of method and the device thereof of the automatic measuring ability of text editing.
Background technology
RTF is the abbreviation of Rich Text Format, and meaning is many text formattings.You can use " notepad " to edit the rich text format file as editor's html file fully.This is the file of a kind of similar DOC form (Word document), and good compatibility is arranged.Rich text format is the file layout that many softwares can both be discerned.Can open the file of rich text format such as Word, WPSOffice, Excel etc., this illustrates that this form is comparatively general.This patent is realized the function of word spelling by the rich text editor.
The word that a lot of users make the mistake because of the mistake input in reality, and the function of word spelling can be helped the user to find and recommend to solve mistake.To the corresponding word list of recommending of the word generation of mistake, the user can simply replace word, increases work efficiency by this function.Avoid spending a large amount of energy to come verification.
WORD has word spelling function as you know, and is also very practical.But must installing " WORD ", the user could use this function.And common " board " do not provide word spelling measuring ability though can carry out the rich text editor.And the function that this programme can allow existing editing machine supports such as " boards " spell measuring ability with the form of plug-in unit is perhaps directly revised the function that editor code realizes the spelling measuring ability.This patent allows the user break away from the effect that WORD also can arrive the word spelling, increases work efficiency.
Based on above-mentioned defective of the prior art, the inventor develops a kind of spelling detection method and device thereof of text editing.When enabling this device, computing machine can to operating personnel's input in real time or the word of selection area detect, when occurring reminding operating personnel that it is carried out corresponding modification by specific markers when wrong.
Summary of the invention
The objective of the invention is to provides a kind of spelling detection method and device thereof of text editing for overcoming the deficiencies in the prior art.When enabling this device, computing machine can to operating personnel's input in real time or the word of selection area detect, when occurring reminding operating personnel that it is carried out corresponding modification by specific markers when wrong.
For achieving the above object, technical scheme of the present invention is:
A kind of spelling detection method of text editing, this method may further comprise the steps: 1) create word library, described word library is provided with a plurality of word branches storehouse according to different languages and/or language class, and each word branch storehouse is provided with standard word; 2) create word and divide storehouse concordance list and spell check scheme table; The corresponding different word of different spell check schemes divides the storehouse index; 3) obtain the text data of input area or selection area, split into a plurality of input words automatically, its standard word with corresponding word branch storehouse is contrasted one by one, note the input word of misspelling according to the monogram in the text data; 4) the input word to misspelling carries out the singularity mark.
Its further technical scheme is: described word library comprises one or two or more kinds in English word branch storehouse, German word branch storehouse, French word branch storehouse and Russian word branch storehouse according to the difference of languages.
Its further technical scheme is: described spell check scheme table is combined into several different inspection schemes according to the difference of languages and/or language class, and described language class comprises a kind of in works and expressions for everyday use, scientific and technological term or the commercial term at least.
Its further technical scheme is: described singularity is labeled as wave, double underline, single underscore and following punctuate.
Its further technical scheme is: described word branch storehouse also comprises self-defined word.
Its further technical scheme is: described step 3 includes the regular expression detection mode, comprises following at least a irregular spelling word in the described regular expression: network address, email address, programming code etc.
A kind of spelling pick-up unit of text editing is characterized in that comprising the storer that is provided with several word branch storehouses, and is provided with the processor of word contrast device, input word monitor, testing result calibration device, also comprises the detection control knob that connects with processor; Described detection control knob comprises detection key and close key.
Its further technical scheme is: when the detection key is enabled, the work of word contrast device, input word monitor is delivered to word contrast device to the word of real-time input or to the input word of selection area, with these words one by one with storer in the standard word in word branch storehouse compare, contrasting staggers the time demarcates this word of makeing mistakes by the testing result calibration device.
Its further technical scheme is: one or two or more kinds during described testing result calibration device comprises wave demarcation, the demarcation of single underscore, double underline demarcation, punctuate is demarcated down.
The present invention's beneficial effect compared with prior art is: the present invention adopts multiple word branch storehouse form, can be written into multilingual, and utilizes different use fields to tell different inspection schemes, is beneficial to improve efficient and the speed that spelling detects; The word that a lot of users make the mistake because of the mistake input in reality, and the function that word of the present invention spelling detects can be helped the user to find and recommend to solve mistake.To the corresponding word list of recommending of the word generation of mistake, the user can simply replace word, increases work efficiency by the present invention.Avoid spending a large amount of energy to come verification.
Below in conjunction with the drawings and specific embodiments the present invention is further described.
Description of drawings
Fig. 1 is the schematic flow sheet of the spelling detection method specific embodiment one of a kind of text editing of the present invention;
Fig. 2 A is the schematic flow sheet that the spelling detection method specific embodiment two of a kind of text editing of the present invention detects at real-time input state word;
Fig. 2 B is the schematic flow sheet that the spelling detection method specific embodiment two of a kind of text editing of the present invention detects at the selection area word;
Fig. 3 is the frame assumption diagram of the spelling pick-up unit specific embodiment of a kind of text editing of the present invention.
Reference numeral
1 storer, 10 word libraries
11 words divide the storehouse 12 shielding storehouses
2 processors, 21 words contrast device
22 input word monitors 23 are calibration device as a result
3 detect control knob
31 detect key 32 close keys
4 display screens, 5 detected objects
51 import 52 selection areas in real time
Embodiment
In order to more fully understand technology contents of the present invention, below in conjunction with specific embodiment technical scheme of the present invention is further introduced and explanation, but be not limited to this.
The spelling detection method of a kind of text editing of the present invention, this method may further comprise the steps: 1) create word library, described word library is provided with a plurality of word branches storehouse according to different languages and/or language class, and each word branch storehouse is provided with standard word; 2) create word and divide storehouse concordance list and spell check scheme table; The corresponding different word of different spell check schemes divides the storehouse index; 3) obtain the text data of input area or selection area, split into a plurality of input words automatically, its standard word with corresponding word branch storehouse is contrasted one by one, note the input word of misspelling according to the monogram in the text data; 4) the input word to misspelling carries out the singularity mark.
Wherein, word library comprises one or two or more kinds in English word branch storehouse, German word branch storehouse, French word branch storehouse and Russian word branch storehouse according to the difference of languages.Spell check scheme table is combined into several different inspection schemes according to the difference of languages and/or language class, and the language class comprises a kind of in works and expressions for everyday use, scientific and technological term or the commercial term at least.Singularity is labeled as wave, double underline, single underscore and following punctuate.Word branch storehouse also comprises self-defined word, and self-defined word is the word by operating personnel's self-defined adding in use.
The flow process of the spelling detection method of a kind of text editing of the present invention is shown in Fig. 1, Fig. 2 A, Fig. 2 B, and Fig. 1 is the flow process of embodiment one, can detect at the data (text data) of viewing area, detects wrong word and points out with wave afterwards.
The flow process that Fig. 2 A detects at the text data (cursor tracking) of input state for embodiment two, as occur non-detected object (as phone, network address or abbreviation etc.) then shield (promptly do not detect, the shielding speech also can by self-defining mode constantly be added into the shielding storehouse in).When detecting, utilize regular expression to detect, when detecting an effective network address, email address or one section programming code (such as C language codes, JAVA language codes) etc., judge and meet regular expression, so just can shield, not spell detection.For example, user's input is xml code<XMLDATA〉DATA</XMLDATA 〉, can shield these contents so, because XMLDATA is not an effective word, but it is legal xml code (quite a kind of grammer of regular expression needs what content of shielding just to write what content).Need to prove, can comprise multiple regular expression in the embodiment of the invention, and be not limited to network address, email address or programming code.
And then according to languages (English word divides the storehouse, the German word divides the storehouse, French word branch storehouse and Russian word divide storehouse or the like), the language class is (such as works and expressions for everyday use, science and technology term or commercial term or the like) selection inspection scheme, word to real-time input state detects (adopting the mode of linear Hash table to detect) again, there is not prompting (without any operation) during inerrancy, can eject drop-down menu when wrong or eject wicket, operating personnel can be with in the self-defined adding word library of this word, also can demonstrate spelling for the correct word of operating personnel's reference, also can carry out miscue, can adopt the singularity mark (such as wave to this word, double underline, single underscore and following punctuate).When detection is enabled, can select the function of self-defined word.
Fig. 2 B spells the flow process that detection method specific embodiment two detects at selection area (mouse is selected) word for the present invention, as occur non-detected object (as phone, network address or abbreviation etc.) then shield (promptly do not detect, the shielding speech also can by self-defining mode constantly be added into the shielding storehouse in).And then according to languages (English word divides the storehouse, the German word divides the storehouse, French word branch storehouse and Russian word divide storehouse or the like), the language class is (such as works and expressions for everyday use, science and technology term or commercial term or the like) selection inspection scheme, whole words to selection area detect (adopting the mode of linear Hash table to detect) again, detection of end during inerrancy, then vicious word is noted when wrong, treat all to detect when finishing, vicious word is carried out miscue one by one, can adopt the singularity mark (such as wave, double underline, single underscore and following punctuate).When detection is enabled, can select the function of self-defined word.
As shown in Figure 3, the spelling pick-up unit of a kind of text editing of the present invention, comprise the storer 1 that is provided with several word branch storehouses 11, and be provided with the processor 2 of word contrast device 21, input word monitor 22, testing result calibration device 23, also comprise the detection control knob 3 that connects with processor 2; Detect control knob 3 and comprise detection key 31 and close key 32.When detection key 31 is enabled, 21 work of word contrast device, 22 pairs of input word monitors are imported 51 word in real time or word are delivered in the input word of selection area 52 and contrast device 21, with these words one by one with storer 1 in the standard word in word branch storehouse 11 compare, contrasting staggers the time demarcates (showing by display screen 4) by 23 pairs of these words of makeing mistakes of testing result calibration device.Wherein, one or two or more kinds during testing result calibration device 23 comprises wave demarcation, the demarcation of single underscore, double underline demarcation, punctuate is demarcated down.Wherein, detected object 5 comprises two kinds of real-time input 51 (promptly in real time the word of input, by cursor tracking) and selection areas 52 (being the word of selection area, by the mouse selection area).Also comprise shielding storehouse 12 in the storer 1.
The word library of this patent (being each word branch storehouse) (promptly shields scheme or shielding object with the shielding storehouse, can comprise different shielding objects, can also when operation, carry out self-defined to the shielding object, at any time can revise the shielding storehouse, or increase the shielding object) array mode generates corresponding spell check scheme.Also provide instrument (by self-defining mode make amendment or again editor) can make amendment to existing scheme.Also can provide instrument to generate the scheme of wanting to the user.Can also copy to the scheme of appointment under the designated directory, scheme can be loaded in the database of the present invention automatically, the user can be provided with by preference and specify a kind of spell check scheme should arrive among the current editor, and need not recompilate should program (these modifications can self-defining mode carry out the preservation that program is provided with).The present invention is far superior to the monistic defective (be the text detection of prior art can only be bundled on the single software use) of traditional text spelling detection method.
In sum, the present invention adopts multiple word branch storehouse form, can be written into multilingual, and utilizes different use fields to tell different inspection schemes, is beneficial to improve efficient and the speed that spelling detects; The word that a lot of users make the mistake because of the mistake input in reality, and the function that word of the present invention spelling detects can be helped the user to find and recommend to solve mistake.To the corresponding word list of recommending of the word generation of mistake, the user can simply replace word, increases work efficiency by the present invention.Avoid spending a large amount of energy to come verification.
The above only further specifies technology contents of the present invention with embodiment, so that the reader is more readily understood, but does not represent embodiments of the present invention to only limit to this, anyly extends or recreation according to the technology that the present invention did, and all is subjected to protection of the present invention.

Claims (10)

1. the spelling detection method of a text editing, this method may further comprise the steps:
1) create word library, described word library is provided with a plurality of word branches storehouse according to different languages and/or language class, and each word branch storehouse is provided with standard word;
2) create word and divide storehouse concordance list and spell check scheme table; The corresponding different word of different spell check schemes divides the storehouse index;
3) obtain the text data of input area or selection area, split into a plurality of input words automatically, its standard word with corresponding word branch storehouse is contrasted one by one, note the input word of misspelling according to the monogram in the text data;
4) the input word to misspelling carries out the singularity mark.
2. the spelling detection method of a kind of text editing according to claim 1, it is characterized in that the difference of described word library, comprise in English word branch storehouse, German word branch storehouse, French word branch storehouse and Russian word branch storehouse one or two or more kinds according to languages.
3. the spelling detection method of a kind of text editing according to claim 2, it is characterized in that described spell check scheme table is combined into several different inspection schemes according to the difference of languages and/or language class, described language class comprises a kind of in works and expressions for everyday use, scientific and technological term or the commercial term at least.
4. the spelling detection method of a kind of text editing according to claim 3 is characterized in that described singularity is labeled as wave, double underline, single underscore and following punctuate.
5. the spelling detection method of a kind of text editing according to claim 4 is characterized in that described word branch storehouse also comprises self-defined word.
6. the spelling detection method of a kind of text editing according to claim 1, it is characterized in that described step 3 includes the regular expression detection mode, comprise following at least a irregular spelling word in the described regular expression: network address, email address and programming code.
7. the spelling pick-up unit of a text editing, it is characterized in that comprising the storer that is provided with several word branch storehouses, and be provided with the processor that word contrasts device, input word monitor, testing result calibration device, also comprise the detection control knob that connects with processor.
8. the spelling pick-up unit of a kind of text editing according to claim 7 is characterized in that described detection control knob comprises detection key and close key.
9. the spelling pick-up unit of a kind of text editing according to claim 8, when it is characterized in that detecting key and enabling, the work of word contrast device, input word monitor is delivered to word contrast device to the word of real-time input or to the input word of selection area, with these words one by one with storer in the standard word in word branch storehouse compare, contrasting staggers the time demarcates this word of makeing mistakes by the testing result calibration device.
10. the spelling pick-up unit of a kind of text editing according to claim 9 is characterized in that one or two or more kinds during described testing result calibration device comprises wave demarcation, the demarcation of single underscore, double underline demarcation, punctuate is demarcated down.
CN2011102813722A 2011-09-21 2011-09-21 Method and device for detecting spelling of document edition Pending CN102298577A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011102813722A CN102298577A (en) 2011-09-21 2011-09-21 Method and device for detecting spelling of document edition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011102813722A CN102298577A (en) 2011-09-21 2011-09-21 Method and device for detecting spelling of document edition

Publications (1)

Publication Number Publication Date
CN102298577A true CN102298577A (en) 2011-12-28

Family

ID=45359000

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011102813722A Pending CN102298577A (en) 2011-09-21 2011-09-21 Method and device for detecting spelling of document edition

Country Status (1)

Country Link
CN (1) CN102298577A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103885938A (en) * 2014-04-14 2014-06-25 东南大学 Industry spelling mistake checking method based on user feedback
CN104679736A (en) * 2015-02-02 2015-06-03 成都优译信息技术有限公司 Translation system allowing statistics of simple mistakes
CN105095184A (en) * 2015-06-11 2015-11-25 周连惠 Method for spelling and grammar proofreading of text document
CN106326205A (en) * 2015-06-19 2017-01-11 珠海金山办公软件有限公司 Spelling check method and device
CN107305542A (en) * 2016-04-21 2017-10-31 珠海金山办公软件有限公司 A kind of spell checking methods and device
CN110019667A (en) * 2017-10-20 2019-07-16 沪江教育科技(上海)股份有限公司 It is a kind of that word method and device is looked into based on voice input information
CN111859920A (en) * 2020-06-19 2020-10-30 北京国音红杉树教育科技有限公司 Method and system for identifying word spelling errors and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0145202A2 (en) * 1983-10-25 1985-06-19 Sharp Kabushiki Kaisha Word spelling checking system
CN1264478A (en) * 1997-06-17 2000-08-23 欧姆龙株式会社 Information processing apparatus and method, and recording medium containing information processing program stored therein
US20020143828A1 (en) * 2001-03-27 2002-10-03 Microsoft Corporation Automatically adding proper names to a database
CN101206641A (en) * 2006-12-21 2008-06-25 国际商业机器公司 System and method for adaptive spell checking
CN101281517A (en) * 2007-03-30 2008-10-08 捷讯研究有限公司 Spell check function and associated handheld electronic device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0145202A2 (en) * 1983-10-25 1985-06-19 Sharp Kabushiki Kaisha Word spelling checking system
CN1264478A (en) * 1997-06-17 2000-08-23 欧姆龙株式会社 Information processing apparatus and method, and recording medium containing information processing program stored therein
US20020143828A1 (en) * 2001-03-27 2002-10-03 Microsoft Corporation Automatically adding proper names to a database
CN101206641A (en) * 2006-12-21 2008-06-25 国际商业机器公司 System and method for adaptive spell checking
CN101281517A (en) * 2007-03-30 2008-10-08 捷讯研究有限公司 Spell check function and associated handheld electronic device

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103885938A (en) * 2014-04-14 2014-06-25 东南大学 Industry spelling mistake checking method based on user feedback
CN103885938B (en) * 2014-04-14 2015-04-22 东南大学 Industry spelling mistake checking method based on user feedback
CN104679736A (en) * 2015-02-02 2015-06-03 成都优译信息技术有限公司 Translation system allowing statistics of simple mistakes
CN105095184A (en) * 2015-06-11 2015-11-25 周连惠 Method for spelling and grammar proofreading of text document
WO2016197272A1 (en) * 2015-06-11 2016-12-15 周连惠 Method for checking spellings and grammars of text document
CN106326205A (en) * 2015-06-19 2017-01-11 珠海金山办公软件有限公司 Spelling check method and device
CN106326205B (en) * 2015-06-19 2019-05-31 珠海金山办公软件有限公司 A kind of spell checking methods and device
CN107305542A (en) * 2016-04-21 2017-10-31 珠海金山办公软件有限公司 A kind of spell checking methods and device
CN107305542B (en) * 2016-04-21 2018-11-16 珠海金山办公软件有限公司 A kind of spell checking methods and device
CN110019667A (en) * 2017-10-20 2019-07-16 沪江教育科技(上海)股份有限公司 It is a kind of that word method and device is looked into based on voice input information
CN111859920A (en) * 2020-06-19 2020-10-30 北京国音红杉树教育科技有限公司 Method and system for identifying word spelling errors and electronic equipment
CN111859920B (en) * 2020-06-19 2024-06-04 北京国音红杉树教育科技有限公司 Word misspelling recognition method, system and electronic equipment

Similar Documents

Publication Publication Date Title
CN102298577A (en) Method and device for detecting spelling of document edition
US9081769B2 (en) Providing translation assistance in application localization
Silveira et al. A Gold Standard Dependency Corpus for English.
US20190251142A1 (en) System and method for generating task-embedded documents
KR102257248B1 (en) Ink to text representation conversion
US20120110459A1 (en) Automated adjustment of input configuration
Ji et al. A source code linearization technique for detecting plagiarized programs
Pilgrim Dive into HTML5
JP6090850B2 (en) Source program analysis system, source program analysis method and program
CN104899010A (en) Multilingualization method and system of source code
CN102937949B (en) A kind of method and system realizing English spelling and check in editor
CN103488488A (en) Text input check method, device ad mobile terminal
US9298697B2 (en) Techniques for grammar rule composition and testing
CN104915774A (en) SVN log analysis and project management software combination-based method
CN107203500A (en) The automatic switching method of the excel formula object oriented languages of expansion backtracking is replaced based on recurrence
CN104516727A (en) Method and system for changing resource in resource file
CN106325596A (en) Automatic error correction method and system for writing handwriting
Marcinczuk et al. Inforex-a web-based tool for text corpus management and semantic annotation.
CN102707958A (en) Open-platform-based interface generation checking method and equipment
Wrenn et al. Error messages are classifiers: a process to design and evaluate error messages
CN101770388A (en) Method and device for obtaining chip code information
KR101797573B1 (en) Web based spreadsheets service providing apparatus and method
van Loggem Software documentation: A standard for the 21st century
CN106775914A (en) A kind of code method for internationalizing and device for automatically generating key assignments
KR101632951B1 (en) Computer readable medium recording program for converting to online learning data and method of converting to online learning data

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C53 Correction of patent of invention or patent application
CB02 Change of applicant information

Address after: Room 9, block A901 building on the north side of a building 518000 North TCL A of Guangdong Province, Shenzhen city Nanshan District South Road West ten high new technology

Applicant after: Shenzhen Wondershare Information Technology Co., Ltd.

Address before: Room 9, block A901 building on the north side of a building 518000 North TCL A of Guangdong Province, Shenzhen city Nanshan District South Road West ten high new technology

Applicant before: Shenzhen Wondershare Software Co., Ltd.

COR Change of bibliographic data

Free format text: CORRECT: APPLICANT; FROM: SHENZHEN WONDERSHARE SOFTWARE CO., LTD. TO: SHENZHEN WONDERSHARE INFORMATION TECHNOLOGY CO., LTD.

AD01 Patent right deemed abandoned

Effective date of abandoning: 20111228

C20 Patent right or utility model deemed to be abandoned or is abandoned