CN101984436A - Inquiry device of similar-shaped Chinese characters and method thereof - Google Patents

Inquiry device of similar-shaped Chinese characters and method thereof Download PDF

Info

Publication number
CN101984436A
CN101984436A CN 201010551856 CN201010551856A CN101984436A CN 101984436 A CN101984436 A CN 101984436A CN 201010551856 CN201010551856 CN 201010551856 CN 201010551856 A CN201010551856 A CN 201010551856A CN 101984436 A CN101984436 A CN 101984436A
Authority
CN
China
Prior art keywords
chinese character
module
information
shape
nearly word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN 201010551856
Other languages
Chinese (zh)
Inventor
陈淮琰
李毅
程德玉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Besta Xian Co Ltd
Original Assignee
Inventec Besta Xian Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Besta Xian Co Ltd filed Critical Inventec Besta Xian Co Ltd
Priority to CN 201010551856 priority Critical patent/CN101984436A/en
Publication of CN101984436A publication Critical patent/CN101984436A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Document Processing Apparatus (AREA)

Abstract

The invention provides an inquiry device of similar-shaped Chinese characters, comprising an input module, a set module, an inquiry module, a database module and a display unit. The input module is connected with the set module, the inquiry module is connected with the set module, the database module is connected with the inquiry module, and the display unit is connected with the inquiry module. The inquiry device of similar-shaped Chinese characters and the method thereof can help the user learn the Chinese characters faster and better, and can avoid the possible misreading or miswriting of the similar-shaped Chinese characters through inquiring the similar-shaped Chinese characters of the current Chinese character in a Chinese character information database, and providing the relevant learning information about the similar-shaped Chinese characters such as the structure of the character style, the stroke order and the like.

Description

The inquiry unit and the method for the nearly word of a kind of Chinese character shape
Technical field
The present invention relates to a kind of querying method, relate in particular to the inquiry unit and the method for the nearly word of a kind of Chinese character shape.
Background technology
Current, people usually misread a word or wrongly write when reading or writing, often because familiar in shape causing.As " mourn (d à o) read " being read as " falling (di à o) reads ", " sour jujube (j í) hand " is read as " peppery (l à) hand ", mistake is write, mistake reads " spring (ch ū n) meter " etc. " (the ch ō ng) rice of pounding ", is exactly owing to body is close.Half-hearted when reading or writing, not careful, or not quite familiar to certain word, assume as a matter of course in the time of usefulness, cause the mistake of reading and writing.Therefore, for the beginner of learning Chinese characters,, thereby cause reading and writing wrong problem easilier because the nearly word phase of the shape knowledge and magnanimity of these Chinese characters are too big.Yet, concerning those beginners, do not offer at present the method for their inquiry and the nearly word of learning Chinese characters shape on electronic dictionary and associated electronic device, and then influence the speed of the nearly word of these shapes of user's quick learning, the time that causes learning these Chinese characters extends.
Summary of the invention
In order to solve existing technical matters in the background technology, the present invention proposes the inquiry unit and the method for the nearly word of a kind of Chinese character shape, by in the Chinese character information database, finding out the nearly word of shape of this Chinese character, and provide the relevant learning information of the nearly word of this shape, as font architecture, relevant informations such as the order of strokes observed in calligraphy help the faster and better learning Chinese characters of user, have avoided running into the possibility that misreads or wrongly write behind the nearly word of shape.
Technical solution of the present invention is: the inquiry unit of the nearly word of a kind of Chinese character shape, and its special character is: described inquiry unit comprises load module, module is set, searches module, database module and display unit; Described load module with module be set be connected, describedly search module and module is set is connected, described database module is connected in searches module, described display unit with search module and be connected.
Above-mentioned database module comprises structural information, stroke number information and the order of strokes observed in calligraphy information of Chinese character.
The querying method of the nearly word of a kind of Chinese character shape, its special character is: described querying method may further comprise the steps:
1) enter module is set, the user is provided with the rule of the similarity height of the nearly word of shape that need search as required;
2) open input method and carry out the Chinese character input;
3) from the Chinese character information database, obtain the order of strokes observed in calligraphy and the relevant information of the Chinese character of importing;
4) according to the order of strokes observed in calligraphy and the relevant information of the Chinese character that gets access in the step 3), the nearly word of shape of this Chinese character of rule searching that is provided with according to step 1) in the Chinese character information database if find, then carries out step 5);
5) show the nearly word of shape find, and show relevant learning information or search and obtain relevant learning information by striding character library.
The concrete steps of searching the nearly word of Chinese character shape above-mentioned steps 4) are:
4.1) code value index according to Chinese character in the Chinese character information database finds this Chinese character, and structural information, stroke number information, the order of strokes observed in calligraphy information of this Chinese character is extracted;
4.2) search the Chinese character of same structure according to the structural information of this Chinese character;
4.3) find pairing similarity Chinese character just according to the rule in the step 1).
Above-mentioned steps 1) rule of similarity height is in: stroke number is identical, the identical rate of the order of strokes observed in calligraphy reaches 90% for high, reaches more than 70% to be, reaching is low 60% or more; It is 1 that stroke number differs, and the identical rate of the order of strokes observed in calligraphy reaches 80% for high, in reaching %60 and being, reaches 50% for low; It is 2 that stroke number differs, and the identical rate of the order of strokes observed in calligraphy reaches 70% for high, in reaching %60 and being, reaches 50% for low.
The inquiry unit of the nearly word of Chinese character shape of the present invention and method are according to the Chinese character of user's input, convert thereof into Chinese-character order of strokes, in the Chinese-character order of strokes database, search the nearly word of shape of this Chinese character then according to information such as the order of strokes observed in calligraphy of this Chinese character and font architectures, and provide the relevant learning information of the nearly word of this Chinese character shape, help faster and better learning Chinese characters of user and the nearly word of shape thereof, thus the possibility of avoiding the user when running into the near word of shape, to misread or wrongly write.
Description of drawings
Fig. 1 is a structural representation of the present invention;
Fig. 2 is a method flow diagram of the present invention;
Fig. 3 is a method flow diagram of searching the Chinese character information database among the present invention;
Fig. 4 is the structural drawing of Chinese character information database of the present invention;
Fig. 5 .1-Fig. 5 .6 is a specific embodiment synoptic diagram of searching the nearly word of Chinese character shape of the present invention;
Embodiment
Referring to Fig. 1, the inquiry unit of the nearly word of Chinese character shape of the present invention comprises load module 1, module 2 is set, searches module 3, database module 4 and display unit 5; Load module 1 is used to carry out the Chinese character input, and the Chinese character information of input reached module 2 is set, module 2 is set the Chinese character that is obtained is inputed to searches module 3 and carry out the nearly word of shape and search, the data of being searched comprise structural information, stroke number information, the order of strokes observed in calligraphy information of the Chinese character in the database module 4; The height rule that module 2 is provided with the similarity of the nearly word of shape that finds out is set, searches module 3 subsequently and search the nearly word of shape according to rule from database module 4, lookup result is shown by display unit 5.
Referring to Fig. 4, Chinese character information data such as Fig. 4 structure that database module 4 of the present invention shows show.
Referring to Fig. 2, Fig. 3, the querying method of the nearly word of a kind of Chinese character shape, this querying method may further comprise the steps:
1) enter module 1 is set, the user is provided with the rule of the similarity height of the nearly word of shape that need search as required; The rule of similarity height is: stroke number is identical, the identical rate of the order of strokes observed in calligraphy reaches 90% for high, reaches more than 70% to be, reaching is low 60% or more; It is 1 that stroke number differs, and the identical rate of the order of strokes observed in calligraphy reaches 80% for high, in reaching %60 and being, reaches 50% for low; It is 2 that stroke number differs, and the identical rate of the order of strokes observed in calligraphy reaches 70% for high, in reaching %60 and being, reaches 50% for low.
2) open input method and carry out the Chinese character input;
3) from Chinese character information database 4, obtain the order of strokes observed in calligraphy and the relevant information of the Chinese character of importing;
4) according to the order of strokes observed in calligraphy and the relevant information of the Chinese character that gets access in the step 3), the nearly word of shape of this Chinese character of rule searching that is provided with according to step 1) in the Chinese character information database if find, then carries out step 5); If do not find, then point out the user not find the nearly word of relevant shape, return and re-enter;
The concrete steps of searching the nearly word of Chinese character shape are:
4.1) code value index according to Chinese character in the Chinese character information database finds this Chinese character, and structural information, stroke number information, the order of strokes observed in calligraphy information of this Chinese character is extracted;
4.2) search the Chinese character of same structure according to the structural information of this Chinese character;
4.3) find pairing similarity Chinese character just according to the rule in the step 1).
5) show the nearly word of shape find, and show relevant learning information or search and obtain relevant learning information by striding character library.
The specific embodiment of querying method of searching the nearly word of Chinese character shape among the present invention is referring to Fig. 5 .1-Fig. 5 .6, as for ' examining ' this word, at first the user is provided with the height of the similarity of the nearly word of shape of looking into, as Fig. 5 .2, the user can import by input method, as Fig. 5 .3; Search the nearly word of shape of this Chinese character and show lookup result, as Fig. 5 .4; Select to learn the relevant knowledge of the nearly word of this shape according to the lookup result user,, help the relevant knowledge of quicker grasp Chinese character of user and the nearly word of this Chinese character shape as Fig. 5 .5-Fig. 5 .6.

Claims (5)

1. the inquiry unit of the nearly word of Chinese character shape is characterized in that: described inquiry unit comprises load module, module is set, searches module, database module and display unit; Described load module with module be set be connected, describedly search module and module is set is connected, described database module is connected in searches module, described display unit with search module and be connected.
2. the inquiry unit of the nearly word of Chinese character shape according to claim 1 is characterized in that: described database module comprises structural information, stroke number information and the order of strokes observed in calligraphy information of Chinese character.
3. the querying method of the nearly word of Chinese character shape, it is characterized in that: described querying method may further comprise the steps:
1) enter module is set, the user is provided with the rule of the similarity height of the nearly word of shape that need search as required;
2) open input method and carry out the Chinese character input;
3) from the Chinese character information database, obtain the order of strokes observed in calligraphy and the relevant information of the Chinese character of importing;
4) according to the order of strokes observed in calligraphy and the relevant information of the Chinese character that gets access in the step 3), the nearly word of shape of this Chinese character of rule searching that is provided with according to step 1) in the Chinese character information database if find, then carries out step 5);
5) show the nearly word of shape find, and show relevant learning information or search and obtain relevant learning information by striding character library.
4. the querying method of the nearly word of Chinese character shape according to claim 3 is characterized in that: the concrete steps of searching the nearly word of Chinese character shape in the described step 4) are:
4.1) code value index according to Chinese character in the Chinese character information database finds this Chinese character, and structural information, stroke number information, the order of strokes observed in calligraphy information of this Chinese character is extracted;
4.2) search the Chinese character of same structure according to the structural information of this Chinese character;
4.3) find pairing similarity Chinese character just according to the rule in the step 1).
5. the querying method of the nearly word of Chinese character shape according to claim 4 is characterized in that: the rule of similarity height is in the described step 1): stroke number is identical, the identical rate of the order of strokes observed in calligraphy reaches 90% for high, reaches more than 70% to be, reaching is low 60% or more; It is 1 that stroke number differs, and the identical rate of the order of strokes observed in calligraphy reaches 80% for high, in reaching %60 and being, reaches 50% for low; It is 2 that stroke number differs, and the identical rate of the order of strokes observed in calligraphy reaches 70% for high, in reaching %60 and being, reaches 50% for low.
CN 201010551856 2010-11-19 2010-11-19 Inquiry device of similar-shaped Chinese characters and method thereof Pending CN101984436A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201010551856 CN101984436A (en) 2010-11-19 2010-11-19 Inquiry device of similar-shaped Chinese characters and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201010551856 CN101984436A (en) 2010-11-19 2010-11-19 Inquiry device of similar-shaped Chinese characters and method thereof

Publications (1)

Publication Number Publication Date
CN101984436A true CN101984436A (en) 2011-03-09

Family

ID=43641605

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201010551856 Pending CN101984436A (en) 2010-11-19 2010-11-19 Inquiry device of similar-shaped Chinese characters and method thereof

Country Status (1)

Country Link
CN (1) CN101984436A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102750841A (en) * 2011-04-20 2012-10-24 英业达股份有限公司 System and method for providing homograph to study Chinese character
CN103678272A (en) * 2012-09-17 2014-03-26 北京信息科技大学 Method for processing unknown words in Chinese-language dependency tree banks
TWI494775B (en) * 2011-05-18 2015-08-01 Inventec Corp System for learning chinese word using likeness words and method thereof
CN106598920A (en) * 2016-11-28 2017-04-26 昆明理工大学 Similar Chinese character classification method combining stroke codes with Chinese character dot matrixes

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102750841A (en) * 2011-04-20 2012-10-24 英业达股份有限公司 System and method for providing homograph to study Chinese character
TWI494775B (en) * 2011-05-18 2015-08-01 Inventec Corp System for learning chinese word using likeness words and method thereof
CN103678272A (en) * 2012-09-17 2014-03-26 北京信息科技大学 Method for processing unknown words in Chinese-language dependency tree banks
CN103678272B (en) * 2012-09-17 2016-04-06 北京信息科技大学 The disposal route of unregistered word in the interdependent treebank of Chinese
CN106598920A (en) * 2016-11-28 2017-04-26 昆明理工大学 Similar Chinese character classification method combining stroke codes with Chinese character dot matrixes
CN106598920B (en) * 2016-11-28 2019-09-27 昆明理工大学 A kind of nearly word form classification method of stroke coding combination Chinese character dot matrix

Similar Documents

Publication Publication Date Title
CN110046350A (en) Grammatical bloopers recognition methods, device, computer equipment and storage medium
CN109446521B (en) Named entity recognition method, named entity recognition device, electronic equipment and machine-readable storage medium
CN101950285A (en) Utilize native language pronunciation string converting system and the method thereof of statistical method to Chinese character
JP2009110159A (en) Location expression detection device, program, and storage medium
CA2969593A1 (en) Method for text recognition and computer program product
CN107577663B (en) Key phrase extraction method and device
CN101639734A (en) Chinese input method and device thereof
JP5502814B2 (en) Method and system for assigning diacritical marks to Arabic text
CN103678362A (en) Search method and search system
CN101984436A (en) Inquiry device of similar-shaped Chinese characters and method thereof
CN101894160B (en) Intelligent search method
CN101986309A (en) Method and device for inquiring question bank
US20120109994A1 (en) Robust auto-correction for data retrieval
CN102243708B (en) Handwriting recognition method, handwriting recognition system and handwriting recognition terminal
CN103455527A (en) Handwritten document retrieval apparatus, handwritten document retrieval method and recording medium
Jain et al. BLSTM neural network based word retrieval for Hindi documents
CN115470307A (en) Address matching method and device
CN101539433A (en) Searching method with first letter of pinyin and intonation in navigation system and device thereof
Skylaki et al. Legal entity extraction using a pointer generator network
WO2012152039A1 (en) Method and device for determining candidate character in handwriting input
CN102222417A (en) Device and method for checking and studying characters with proximate forms
CN101770478B (en) Data retrieval method, data retrieval engine and embedded terminal
CN100501656C (en) Tone and shape combination method for inputting Chinese character into electronic apparatus
KR100629862B1 (en) The korean transcription apparatus and method for transcribing convert a english language into a korea language
KR102355731B1 (en) Analysis program, analysis method, and analysis device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20110309