CN110580348A - chinese character and Russian intercommunication mutual recognition technical method - Google Patents

chinese character and Russian intercommunication mutual recognition technical method Download PDF

Info

Publication number
CN110580348A
CN110580348A CN201710540896.6A CN201710540896A CN110580348A CN 110580348 A CN110580348 A CN 110580348A CN 201710540896 A CN201710540896 A CN 201710540896A CN 110580348 A CN110580348 A CN 110580348A
Authority
CN
China
Prior art keywords
russian
picture
chinese character
chinese
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710540896.6A
Other languages
Chinese (zh)
Inventor
艾朝君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201710540896.6A priority Critical patent/CN110580348A/en
Publication of CN110580348A publication Critical patent/CN110580348A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/153Segmentation of character regions using recognition of characters or words

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)

Abstract

the invention relates to a system implementation method for mutually recognizing Russian by Chinese character intercommunication, which comprises a method for mutually recognizing Chinese character patterns according to Russian intercommunication and a method for inquiring corresponding Russian according to the Chinese character patterns: the method for mutually recognizing the character patterns according to the Russian intercommunication comprises two steps of establishing data mapping between the character patterns and corresponding Russian, realizing network service and providing a function of mutual intercommunication and recognition; the method for inquiring corresponding Russian characters according to the Chinese character patterns comprises two steps of establishing a Chinese character pattern characteristic character string database and a Chinese character pattern recognition implementation method and algorithm.

Description

chinese character and Russian intercommunication mutual recognition technical method
1. Field of the invention
The invention relates to the field of intercommunication and mutual recognition of Chinese characters and various languages, in particular to a method for realizing direct intercommunication and mutual recognition of Chinese and Russian applied to a intercommunication and mutual recognition system of Chinese and various languages.
2. background of the invention
The characters are tools for expressing ideas and developing life communication for human beings, have long history and huge characters, and are universal for human beings speaking Chinese. The number of Chinese characters is the first in the world at present, and 5000 years of culture inherits China to enhance the centripetal force and cohesion of each nation. The culture essence propagated by the Chinese characters strengthens the recognition among all nationalities, and is a binder for cultural communication of all nationalities in history. The Chinese spline is used as a long-term basis for the large country, and one of the important reasons is that the Chinese character-carrying culture is always recognized by each nation.
chinese characters are one of official characters of the united nations, all files must be translated into Chinese characters for storage, and foreigners learning Chinese \ Chinese characters are increasing along with increasing strength of Chinese nations. With the historical status of China in the world, the essence of Chinese famous family, Chinese character, should also become the universal language in the world.
however, in order to promote the Chinese character culture more quickly and more widely, a set of system for intercommunicating and mutually recognizing the Chinese characters and the languages and characters of all countries in the world needs to be established by means of the current high technology, each Chinese character is collected into a database of the system by the system, then a Chinese character learner accesses the database of the system by using a terminal capable of being connected with the internet, information in the database is fed back to the terminal, and the learner queries the character patterns of the Chinese characters according to the languages and characters in the world or queries the corresponding languages and characters in the world according to the character patterns of the Chinese characters.
3. summary of the invention
the invention aims to overcome the defects in the prior art and provides a method for realizing direct intercommunication and mutual recognition of Chinese and Russian in a intercommunication and mutual recognition system of Chinese and various languages.
a system implementation method for Chinese direct intercommunication mutual recognition Russian is characterized in that: the method comprises a method for mutually recognizing Chinese character patterns according to Russian intercommunication and a method for inquiring corresponding Russian according to the Chinese character patterns.
The method for mutually recognizing the font of the Chinese character book according to Russian intercommunication comprises the following steps:
step 1), establishing data mapping of Chinese character patterns corresponding to Russian: all Russian translations are translated and written into Chinese character patterns, the written Chinese character patterns are scanned into electronic versions, each character generates a picture, the picture is stored into a jpeg format picture file named by corresponding Russian, the jpeg format picture file and the corresponding Chinese characters are mapped in a one-to-one mode, and a database is established.
Step 2), realizing network service, and providing the intercommunication mutual recognition function of Chinese characters and Russian: installing a Chinese character text input box with a Chinese character intercommunication and mutual recognition function on hardware connected with the Internet for a user to input Chinese characters, wherein the Chinese character text input box is connected with a database storing a jpeg format picture file, the user selects the Russian intercommunication and mutual recognition function on the Chinese character text input box, and a background service inquires a picture corresponding to Russian from the database mapped by Russian and Chinese character font pictures according to the Chinese characters, transmits the picture to a client and displays the picture on the client for the user to check and use;
the method for identifying corresponding Russian according to Chinese character intercommunication comprises the following steps:
Step 1), establishing a Chinese character font characteristic character string database;
compiling and writing all Russian characters into corresponding Chinese characters, scanning the written Chinese characters into electronic versions, generating a picture for each character, storing the picture into a JPEG format picture file named by the corresponding Russian characters, processing each picture file respectively to generate corresponding characteristic strings, wherein the generation method of the characteristic strings comprises the following steps:
firstly, reading an image;
reading the generated original picture in the corresponding Russian named JPEG format into picture processing software;
secondly, color processing;
Processing the picture added in the first step in picture processing software to completely generate white black characters, wherein the white black characters are black fonts, and the background except the fonts is white;
thirdly, cutting;
processing the picture obtained in the second step through picture processing software, cutting off the vacant part outside the character horizontally and vertically, and overlapping the edge of the outermost side of the character with the edge of the picture;
fourthly, compressing;
Compressing the picture obtained after the third step to obtain a picture with a standard size;
Fifthly, generating a characteristic string;
Scanning each pixel point of the picture obtained after the fourth step, taking black as 1 and white as 0 to obtain a 64-bit string,
All Chinese character fonts are processed according to the method to obtain character strings, pictures and Russian translation which are mapped one to one, and a character database is established;
step 2), a Chinese character font identification realization method and an algorithm:
step one, imaging:
the method comprises the steps that Chinese character recognition software with a photographing function is installed on hardware equipment connected with the Internet, the equipment connected with the Internet has the photographing function, Chinese characters needing to be recognized are photographed and taken by the hardware equipment, the size of an image obtained by photographing is set to be a fixed size, and only a single Chinese character font needing to be recognized is required to be ensured in the image obtained by photographing during photographing.
and a second step of treatment:
processing the picture generated in the first step according to the picture processing method in the step 1) of establishing the Chinese character font characteristic character string to obtain the Chinese character font characteristic character string in the picture.
step three, comparison:
and uploading the character string of the Chinese character font to be identified, which is calculated in the second step, to a server, and comparing the character string with all character strings in the database to find the character string with the highest similarity to the character string of the Chinese character font to be identified.
And fourthly, displaying:
and after finding the character string with the highest similarity to the characteristic character string of the Chinese character font to be identified according to the third step, searching the picture and the Russian translation corresponding to the character string from the database according to the character string, transmitting the picture and the Russian translation to software installed on hardware connected with the Internet by a background through the Internet, and displaying the picture and the Russian translation through an interface of the software for comparison, study and use of a user.
in the step 1) of the method for mutually identifying corresponding russian characters according to Chinese character font intercommunication, the picture size of the standard size picture generated in the fourth step is 8 × 8, and the unit is millimeter.
The size of the picture obtained by photographing in the first step of the step 2) of the method for mutually identifying corresponding russian according to Chinese character font intercommunication is set to be 800 × 600 in unit of millimeter.
In the first step of the method step 2) corresponding to Russian according to Chinese character font intercommunication mutual identification, the hardware connected with the Internet is a computer with a photographing function.
In the first step of the method step 2) corresponding to Russian according to Chinese character font intercommunication mutual identification, hardware connected with the Internet is other intelligent equipment such as a smart phone, a smart watch and the like with a photographing function.
the invention has originality, the invention uses the invention to adopt the high-tech informatization of Chinese character font, every Chinese character uses the record of making a video recording, input in the Chinese character database, every Chinese character has Russian translation corresponding to it, Chinese intercommunication that uses this method to set up knows Russian system each other, for Russian users according to the writing method of the straight-through Russian of Chinese character, and after providing Russian learner to see a certain Chinese character, intercommunicate each other and recognize the corresponding Russian immediately, especially when intercommunicating and recognizing the corresponding Russian with the Chinese character font, the Chinese intercommunication that uses the invention to set up knows the recognition accuracy of the Russian system to the Chinese character font more than 99%, help people to know and learn Chinese, help the popularization, use of the Chinese character in the whole world, help to use, is more than 99%
4. Description of the drawings
the above and other aspects and advantages of the present invention will become more readily apparent by describing in more detail exemplary embodiments thereof with reference to the attached drawings, in which:
FIG. 1: the design method adopted by the invention is a flow chart;
fig. 2 is a picture in an embodiment of the invention.
fig. 3 is a picture in an embodiment of the invention.
fig. 4 is a picture in an embodiment of the invention.
5. Detailed description of the preferred embodiments
the present invention now will be described more fully hereinafter with reference to the accompanying drawings, in which various embodiments are shown. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
hereinafter, exemplary embodiments of the present invention will be described in more detail with reference to the accompanying drawings.
Example (b): as shown in fig. 1, a method for implementing a chinese intercommunication mutual recognition russian system includes a method for mutually recognizing russian according to chinese intercommunication and a method for mutually recognizing corresponding chinese according to russian intercommunication;
The method for mutually recognizing Russian characters according to Chinese character intercommunication comprises the following steps:
Step 1) establishing data mapping between Chinese character patterns and corresponding Russian: writing all Russian characters into Chinese character patterns, scanning the written Chinese character patterns into electronic versions, generating a picture for each character, storing the picture into a Peg format picture file named by corresponding Russian characters, mapping the Peg format picture file and the corresponding Russian characters one to one, and storing the Peg format picture file and the corresponding Russian characters into a database;
the following examples of the chinese characters and the corresponding russian named Peg format picture file:
chinese characters Russian named Chinese character pattern picture
han dynasty style toy Han, JP
Character (Chinese character) word, JP
Character (Chinese character) Word, JP
Shape of Form, JP
Step 2), realizing network service, and providing a query function: the method comprises the steps that mobile phone software with a Chinese character text input box with a query function is installed on a mobile phone connected with the Internet, a user can input Chinese characters, the Chinese character text input box is connected with a database storing a Peg format picture file, the user selects a Russian intercommunication mutual identification function on the Chinese character text input box, a background service queries pictures corresponding to Russian from the database mapped by Chinese characters and Chinese character font pictures and transmits the pictures to a client, and the pictures are displayed on the client and can be checked and learned by the user;
The method for inquiring corresponding Russian language according to the Chinese character patterns comprises the following steps:
step 1), establishing a font character string database;
Writing all Russian characters into Chinese character patterns, scanning the written Chinese character patterns into electronic versions, generating a picture for each character, storing the picture into a Peg format picture file named in corresponding Russian, processing each picture file respectively to generate corresponding characteristic character strings, wherein the generation method of the characteristic character strings comprises the following steps:
First step, image reading:
Reading the generated original picture in the Peg format named in Russian into picture processing software, taking "Ming" as an example, the read original picture is shown in FIG. 2
step two, color processing:
the picture added in the first step is processed in the picture processing software, and the 'white (0xFFFFFF) black (0x000000) character' is completely generated, wherein the 'white black character' is that the font is black, the background except the font is white, and the processed picture is as shown in figure 3
Third, cutting treatment:
Processing the picture obtained in the second step through picture processing software, horizontally and vertically cutting off the spare part outside the character, and overlapping the edges of the outermost sides of the upper direction, the lower direction, the left direction and the right direction of the character with the edges of the picture;
The processed picture is shown in fig. 4;
step four, compression treatment:
compressing each pixel point of the picture obtained after the third step to obtain an 8 x 8 picture;
fifthly, generating a characteristic string:
scanning each pixel point of the picture obtained after the fourth step, wherein black is 1, white is 0, a 64-bit character string can be obtained, and the character string obtained after the Chinese character font of the 'Ming' character is processed is as follows:
0000001100000100000100001000101011111110000101000010001000100100
mapping the word strings obtained after processing with pictures, Chinese characters, pinyin and Russian translations in a one-to-one manner, and storing the mapping in a database;
Step 2), a Chinese character font identification realization method and an algorithm:
the first step is as follows: imaging:
the method comprises the steps that Chinese character recognition software with a photographing function is installed on a smart phone connected with the Internet, the smart phone connected with the Internet has the photographing function, a smart camera is used for photographing and taking images of Chinese character patterns needing to be recognized, a flash lamp on the smart phone is turned on during photographing, accordingly, the phenomenon that shadow exists on a picture and the picture quality is affected is avoided, the size of the picture obtained through photographing is set to be 800 x 600, and during photographing, it is required to be ensured that the picture obtained through photographing only has a single Chinese character pattern needing to be recognized;
the second step is that: and (3) treatment:
processing the picture generated in the first step according to the picture processing method in the step 1) of establishing the character string of the Chinese character font to obtain the character string of the Chinese character font in the picture;
the third step: and (3) comparison:
uploading the characteristic character string of the Chinese character font to be identified calculated in the second step to a server, comparing the characteristic character string with all character strings in a database, and finding out the character string with the highest similarity with the characteristic character string of the Chinese character font to be identified, wherein a similarity comparison algorithm is shown as the following table:
The fourth step: displaying:
and after finding the character string with the highest similarity to the characteristic character string of the character pattern to be identified according to the third step, finding picture Chinese characters, pinyin and Russian corresponding to the character string from the database according to the character string, and comparing and learning the pictures, Chinese characters, pinyin and Russian by the background through the Internet for users.
through tests, according to the algorithm, the accuracy of the Chinese character pattern recognition is over 99%.
the above description is only an example of the present invention, and is not intended to limit the present invention. The invention is susceptible to various modifications and alternative forms. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (5)

1. A Chinese character and Russian intercommunication mutual identification technique,
the method is characterized in that:
The method comprises a method for directly corresponding to Russian and a method for directly corresponding to Chinese characters through Russian;
the technical method for intercommunicating and mutually recognizing the Chinese characters and the Russian comprises the following steps:
step 1, establishing data mapping between Chinese characters and corresponding Russian: compiling and writing all Russian languages into Chinese characters, scanning the written Chinese character patterns into electronic versions, generating a picture for each character, storing the picture into a jpeg format picture file named by the corresponding Russian language, and mapping the jpeg format picture file and the corresponding Chinese characters one to build a database;
step 2, realizing network service, and providing the intercommunication mutual recognition function of Chinese characters and Russian: installing a Chinese character text input box with a Chinese character intercommunication and mutual recognition function on hardware connected with the Internet for a user to input Chinese characters, wherein the Chinese character text input box is connected with a database storing a jpeg format picture file, the user selects the Russian intercommunication and mutual recognition function on the Chinese character text input box, and a background service inquires a picture corresponding to Russian from the database mapped by Russian and Chinese character font pictures according to the Chinese characters, transmits the picture to a client and displays the picture on the client for the user to check and use;
the method for identifying corresponding Russian according to Chinese character intercommunication comprises the following steps:
Step 1), establishing a Chinese character characteristic character string database;
compiling and writing all Russian characters into corresponding Chinese characters, scanning the written Chinese characters into electronic versions, generating a picture for each character, storing the picture into a JPEG format picture file named by the corresponding Russian characters, processing each picture file respectively to generate corresponding characteristic strings, wherein the generation method of the characteristic strings comprises the following steps:
Firstly, reading an image;
Reading the generated original picture in the corresponding Russian named JPEG format into picture processing software;
secondly, color processing;
Processing the picture added in the first step in picture processing software to completely generate white black characters, wherein the white black characters are black fonts, and the background except the fonts is white;
thirdly, cutting;
Processing the picture obtained in the second step through picture processing software, cutting off the vacant part outside the character horizontally and vertically, and overlapping the edge of the outermost side of the character with the edge of the picture;
fourthly, compressing;
compressing the picture obtained after the third step to obtain a picture with a standard size;
fifthly, generating a characteristic string;
scanning each pixel point of the picture obtained after the fourth step, taking black as 1 and white as 0 to obtain a 64-bit string,
all Chinese character fonts are processed according to the method to obtain character strings, and the character strings are subjected to one-to-one mapping with pictures and Russian translations, so that a character string database is established;
Step 2), a Chinese character recognition realization method and algorithm:
Step one, imaging:
the method comprises the steps that Chinese character recognition software with a photographing function is installed on hardware equipment connected with the Internet, the equipment connected with the Internet has the photographing function, the hardware equipment is used for photographing Chinese characters to be recognized, the size of an image obtained by photographing is set to be a fixed size, and only a single Chinese character font needing to be recognized is required to be ensured in the image obtained by photographing during photographing.
and a second step of treatment:
Processing the picture generated in the first step according to the picture processing method in the step 1) of establishing the Chinese character font characteristic character string to obtain the Chinese character font characteristic character string in the picture.
Step three, comparison:
and uploading the character string of the Chinese character font to be identified, which is calculated in the second step, to a server, and comparing the character string with all character strings in the database to find the character string with the highest similarity to the character string of the Chinese character font to be identified.
and fourthly, displaying:
And after finding the character string with the highest similarity to the characteristic character string of the Chinese character font to be identified according to the third step, searching the picture and the Russian translation corresponding to the character string from the database according to the character string, transmitting the picture and the Russian translation to software installed on hardware connected with the Internet by a background through the Internet, and displaying the picture and the Russian translation through an interface of the software for comparison, study and use of a user.
2. the method for implementing Chinese character and Russian intercommunication as recited in claim 1, wherein: in the step 1) of the method for mutually identifying corresponding russian characters according to Chinese character font intercommunication, the picture size of the standard size picture generated in the fourth step is 8 × 8, and the unit is millimeter.
3. The method for implementing Chinese character and Russian intercommunication as recited in claim 1, wherein: the size of the picture obtained by photographing in the first step of the step 2) of the method for mutually identifying corresponding russian according to Chinese character font intercommunication is set to be 800 × 600 in unit of millimeter.
4. The method for implementing Chinese character and Russian intercommunication as recited in claim 1, wherein:
in the first step of the method step 2) corresponding to Russian according to Chinese character font intercommunication mutual identification, the hardware connected with the Internet is a computer with a photographing function.
5. the method for implementing Chinese character and Russian intercommunication as recited in claim 1, wherein:
in the first step of the method step 2) corresponding to Russian according to Chinese character font intercommunication mutual identification, hardware connected with the Internet is other intelligent equipment such as a smart phone, a smart watch and the like with a photographing function.
CN201710540896.6A 2017-07-04 2017-07-04 chinese character and Russian intercommunication mutual recognition technical method Pending CN110580348A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710540896.6A CN110580348A (en) 2017-07-04 2017-07-04 chinese character and Russian intercommunication mutual recognition technical method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710540896.6A CN110580348A (en) 2017-07-04 2017-07-04 chinese character and Russian intercommunication mutual recognition technical method

Publications (1)

Publication Number Publication Date
CN110580348A true CN110580348A (en) 2019-12-17

Family

ID=68808705

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710540896.6A Pending CN110580348A (en) 2017-07-04 2017-07-04 chinese character and Russian intercommunication mutual recognition technical method

Country Status (1)

Country Link
CN (1) CN110580348A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5303312A (en) * 1991-04-19 1994-04-12 International Business Machines Corporation Handwriting recognition by character template
CN102637168A (en) * 2012-03-19 2012-08-15 深圳市共进电子股份有限公司 Method for realizing automatic language translation in graphical user interface
CN103778250A (en) * 2014-02-19 2014-05-07 张朝亮 Implement method for Chinese wubi cursive script dictionary query system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5303312A (en) * 1991-04-19 1994-04-12 International Business Machines Corporation Handwriting recognition by character template
CN102637168A (en) * 2012-03-19 2012-08-15 深圳市共进电子股份有限公司 Method for realizing automatic language translation in graphical user interface
CN103778250A (en) * 2014-02-19 2014-05-07 张朝亮 Implement method for Chinese wubi cursive script dictionary query system

Similar Documents

Publication Publication Date Title
US9785627B2 (en) Automated form fill-in via form retrieval
RU2634194C1 (en) Verification of optical character recognition results
CN109753968A (en) Generation method, device, equipment and the medium of character recognition model
CN104253904A (en) Method and smartphone for implementing reading learning
CN111753717B (en) Method, device, equipment and medium for extracting structured information of text
CN109034148A (en) One kind is based on character image identification audio reading method and its device
CN108304815A (en) A kind of data capture method, device, server and storage medium
CN110580359A (en) Chinese character and Arabic intercommunication mutual identification technical method
CN110580343A (en) Chinese character and Urdu intercommunication mutual recognition technical method
CN110516125B (en) Method, device and equipment for identifying abnormal character string and readable storage medium
CN111881900A (en) Corpus generation, translation model training and translation method, apparatus, device and medium
CN110580348A (en) chinese character and Russian intercommunication mutual recognition technical method
US20160283444A1 (en) Human input to relate separate scanned objects
CN110580353A (en) intercommunication mutual identification technical method for Chinese characters and Vietnamese
CN110580349A (en) chinese character and Persian intercommunication mutual identification technical method
CN110580345A (en) chinese character and French intercommunication mutual identification technical method
CN110580346A (en) intercommunication mutual identification technical method for Chinese characters and Bengali
CN110580354A (en) chinese character and Japanese intercommunicating and mutual identifying technical method
CN110580356A (en) Chinese character and German intercommunication mutual identification technical method
CN110580344A (en) intercommunication mutual identification technical method of Chinese characters and Spanish language
CN110580357A (en) chinese character and Korean intercommunication mutual identification technical method
CN110580358A (en) intercommunication mutual identification technical method for Chinese characters and Sanskrit
CN110580355A (en) Intercommunication mutual identification technique for Chinese characters and all language characters
CN110580360A (en) intercommunication mutual identification technique for Chinese characters and all language characters
CN110580350A (en) Chinese character and English intercommunication mutual identification technical method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20191217