AU2021101278A4 - System and Method for Automatic Language Detection for Handwritten Text - Google Patents
System and Method for Automatic Language Detection for Handwritten Text Download PDFInfo
- Publication number
- AU2021101278A4 AU2021101278A4 AU2021101278A AU2021101278A AU2021101278A4 AU 2021101278 A4 AU2021101278 A4 AU 2021101278A4 AU 2021101278 A AU2021101278 A AU 2021101278A AU 2021101278 A AU2021101278 A AU 2021101278A AU 2021101278 A4 AU2021101278 A4 AU 2021101278A4
- Authority
- AU
- Australia
- Prior art keywords
- language
- handwritten
- text
- inputs
- handwritten text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 238000001514 detection method Methods 0.000 title abstract description 5
- 239000013598 vector Substances 0.000 claims abstract description 7
- 238000010586 diagram Methods 0.000 claims description 9
- 238000012545 processing Methods 0.000 claims description 4
- 238000013519 translation Methods 0.000 abstract description 9
- 230000008569 process Effects 0.000 abstract description 3
- 230000009466 transformation Effects 0.000 abstract description 2
- 238000013528 artificial neural network Methods 0.000 abstract 1
- 230000000306 recurrent effect Effects 0.000 abstract 1
- 238000000844 transformation Methods 0.000 abstract 1
- 208000006011 Stroke Diseases 0.000 description 21
- 230000014616 translation Effects 0.000 description 8
- 230000011218 segmentation Effects 0.000 description 4
- 238000013459 approach Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 241001522296 Erithacus rubecula Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000005714 functional activity Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 238000013077 scoring method Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/42—Data-driven translation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/263—Language identification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/22—Character recognition characterised by the type of writing
- G06V30/226—Character recognition characterised by the type of writing of cursive writing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/182—Extraction of features or characteristics of the image by coding the contour of the pattern
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/22—Character recognition characterised by the type of writing
- G06V30/226—Character recognition characterised by the type of writing of cursive writing
- G06V30/2268—Character recognition characterised by the type of writing of cursive writing using stroke segmentation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Character Discrimination (AREA)
Abstract
System and Method for Automatic Language Detection for Handwritten Text
In the view of transformations of technology and embedded with smart system has been taken
more attention since 1990s. Automatic language detection (ALD) for handwritten text are
perform by novel method and system device. Indeed, ALD system is deploy prior to sending
specific descriptions of the handwritten texts to recognition section. However, multi-input could
be considered for this interface, which is associated with coordinates of the inputs entity and time
period recorded for inputs. Meanwhile also need to focus on groups of text (handwritten) inputs
and transform in terms of word based on the coordinates and time span. Hereafter it will be form
of writing strokes for regularising process and individual words will be process for generate
language vectors. Moreover, the language vectors will be recognised the probabilities of
language of the handwritten texts through recurrent neural network (RRN). In addition, based on
language probabilities output, the handwritten inputs are transfer to a specific language
recognition system for determined the language thereof before to translation.
Description
System and Method for Automatic Language Detection for Handwritten Text
This invention relates to a system and method of Automatic language detection (ALD) for handwritten text are perform by novel device.
Automatic handwritten recognition systems permit to user about inputs the handwritten text for transformed into the word form. In current inventions allow to user to input handwritten text and also need to download the language packs for better performed the transformation functions. It has been obserded that online system having the translations system, which is provide strokes of handwritten text into available language (in database). However, for calibrating point of view the confidence scores from each corresponding to meaningful picked the correct results is complex and difficult. And such approach does not work in large scale with good accuracy. Moreover, the language recognizers produce result and sufficient suggestions in several language inputs.
AU 2015318386 B2: Hu, Yulong; Zhang, Yintian; Zhu, Bo;Wei, Si; Hu, Guoping; Hu, Yu; Liu, Qingfeng: An intelligent scoring method and system for a text objective question, the method comprising: acquiring an answer image of a text objective question (101); segmenting the answer image to obtain one or more segmentation results of an answer string to be identified (102); determining whether any of the segmentation results has the same number of characters as the standard answer (103); if no, the answer is determined to be wrong (106); otherwise, calculating identification confidence of the segmentation result having the same number of words as the standard answer, and/or calculating the identification confidence of respective characters in the segmentation result having the same number of words as the standard answer (104); determining whether the answer is correct according to the calculated identification confidence (105). The method can automatically score text objective questions, thus reducing consumption of human resource, and improving scoring efficiency and accuracy.
US 8,014,603 B2: Jose A. Rodriguez Serrano, Florent C. Perronnin; method of characterizing a word image includes traversing the word image stepwise with a window to provide a plurality of window images. For each of the plurality of window images, the method includes splitting the window image to provide a plurality of cells. A feature, such as a gradient direction histogram, is extracted from each of the plurality of cells. The word image can then be characterized based on the features extracted from the plurality of window images.
US 10, 185, 882 B2: Yousef S. I. Elarian; Systems and associated methodology are presented for Ara bic handwriting synthesis including accessing character shape images of an alphabet, determining a connection point location between two or more character shapes based on a calculated right edge position and a calculated left edge position of the character shape images extracting character features that describe language attributes and width attributes of characters of the character shape images , the language attributes including character Kashida attributes, and generating images of cursive text based on the character Kashida attribues and the width attribues.
US 10,643,067 B2: Romain Bednarowicz; Robin M6Linand; Claire Sidoli; Fabien Ric; Khaoula Elagouni; David Hebert; Fabio Valente; Gregory Cousin; Ma1 Nagot; Cyril Cerovic; Anne Bonnaud; A system, method and computer program product for hand drawing diagrams including text and non - text elements on a computing device are provided. The computing device has a processor and a non - transitory computer readable medium for detecting and recognizing hand - drawing diagram element input under control of the processor. Display of input diagram elements in interactive digital ink is performed on a display device associated with the computing device. One or more of the diagram elements are associated with one or more other of the diagram elements in accordance with a class and type of each diagram element. The diagram elements are re - displayed based on one or more interactions with the digital ink received and in accordance with the one or more associations.
US 2018 / 0137350 Al: Felipe Petroski SUCH; Raymond PTUCHA; Frank BROCKLER; Paul HUTKOWSKI; Embodiments of the present disclosure include a method that obtains a digital image. The method includes extracting a word block from the digital image. The method includes processing the word block by evaluating a value of the word block against a dictionary. The method includes outputting a prediction equal to a common word in the dictionary when a confidence factor is greater than a predetermined thresh old. The method includes processing the word block and assigning a descriptor to the word block corresponding to a property of the word block. The method includes processing the word block using the descriptor to prioritize evaluation of the word block. The method includes concatenating a first output and a second output. The method includes predicting a value of the word block.
US 8,077,973 B2: Jianxiong Dong; A method of recognizing a handwritten word of cursive Script includes providing a template of previously classified words, and optically reading a handwritten word so as to form an image representation thereof comprising a bit map of pixels. The external pixel contour of the bit map is extracted and the Vertical peak and minima pixel extrema on upper and lower Zones respectively of this external contour are detected. Feature vectors of the vertical peak and minima pixel extrema are determined and compared to the template so as to generate a match between the handwritten word and a previously classified word. A method for classifying an image representation of a handwritten word of cursive script is also provided. Also provided is an apparatus for recognizing a handwritten word of cursive Script.
US 10 , 156 , 982 B1: Sabri A . Mahmoud; Baligh M . Al -Helali; A character recognition device includes circuitry that is configured to remove duplicate successive points of a plurality of points in a handwritten stroke to form an enhanced handwritten stroke; space the plurality of points a uniform distance apart; detect primary strokes and secondary strokes of the enhanced handwritten stroke; merge the primary strokes; generate a primary merged stroke; extract raw point - based features from local features of the primary merged stroke; extract statistical features from computed statistics associated with the raw point - based features to form primary merged stroke features ; train and classify data from the primary merged stroke features and secondary stroke features to form stroke models; determine a plurality of primary merged stroke model candidates from the stroke models; compute a likelihood value for a combined set of primary stroke candidates and a set of secondary stroke candidates; and determine the handwritten stroke from the computing.
Handwritten text systems are generally used for various applications and now-a-days this technology is booming. Prior to implementation of this technology, which was dependent on writing texts with our own hands and languages. However, it is difficult to store in huge quantity, access physical data and process the efficient manner, due to manual management. Since for long time it has been encountered a severe loss of data because of the traditional method of storing data. In current scenario various technological tools are introduced based on handwritten texts, and because of these tools now it is easier to store a huge data inn single click. The implementation of handwritten text recognition device is a real-world idea for easy storing of precious data. Moreover, the invention reveals and make a model for recognitions of handwritten texts, which will be help to recognized of multiples handwritten text (available in database system i.e., language packages). Fig. 1, shows the functional activity of current invention, and this function is used for translating the handwritten texts to word form text.
The scope of present invention is not only limited to disclosed embodiments but also includes combinations of the disclosed embodiments, as well as modifications to the disclosed embodiments.
[0003] Fig. 1: Schematics block-diagram of current developments. It is comprising with four embedded system like (1) Image's acquisition system, (2) Digitization rendering, (3) Pre processing of inputs (handwritten text), (4) Feature analysis. This system will be help for performing the translation of handwritten text as per database input or may depends on language packages.
In the current inventions are primarily reliant on the ensuing measuring aspect: (a) Recognition approaches are profoundly exertion on the nature of the data (style of handwritten text) to be recognized. (b) As the letters in the word are generally linked together, critically written and may even be missing.
The disclosed embodiments provide of ALD which will be performed before sending representations of the handwritten text to a language recognition tool to scale back performance penalties for text translations. That is by determining a selected language recognition tool to be utilized before text translation rather than translating text across multiple engines for every translation, resource utilization is greatly reduced. Accordingly, techniques are provided herein for efficient performance of ALD for handwritten text and its translation that permits implementations to be utilized on people and authority devices. Performing the described ALD is not source concentrated, unlike earlier explanations, through the pre-determination of languages for handwritten text, and thus does not needed a source heavy server to performing. It is contemplated herein that any sorts of languages could also be determined from handwritten text in accordance with the disclosed embodiments.
In an embodiment, a language determination is formed, word by word, before selecting a language recognition tool and attempting translations of handwritten text. That is, a soft decision is formed, based a minimum of on the handwritten text inputs, such one or specific language recognition tool could also be run to acknowledge the inputs. Groups of strokes could also be determined as words based a minimum of on coordinates of the strokes with reference to the input interface and every other, and therefore the time at which the strokes are made with reference to one another.
10007] The inputs, as words, could also be provided to a language generic tool to detect indicia of the language for the handwritten text inputs before they are sent to a selected language recognizer. The generic tool may include several components such as but not limited to an RNN. The RNN takes featured inputs and generates output vectors. The output vectors of the RNN are provided to the soft decision tool to get language probabilities for the handwritten text.
Afterward, a selected language recognition tool could also be identified and select. The handwritten text inputs could also be provided to the identified specific language recognition tool for a final determination of the language, allowing the handwritten text inputs to be translated by one translation tool. As words are translated, it will be provided on a demonstration device for observing and choice by a user.
EDITORIAL NOTE 2021101278
There is 1 page of claims only.
Claims (4)
1. A technique of portraying of handwritten text and capturing the text image comprising with handwritten text to translating to word form;
2. The method of claim 1, further comprising: for each window image, determining a features vector based on the extracted features of each of the handwritten text;
3. The method of claim 2, wherein the computation of the features vector comprises concatenating the extracted features of handwritten text;
4. The method of claim 1, A complete processing system which executes about instructions stored in tool for performing the method of claim 1.
Fig. 1: Schematics block-diagram of current inventions.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2021101278A AU2021101278A4 (en) | 2021-03-12 | 2021-03-12 | System and Method for Automatic Language Detection for Handwritten Text |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
AU2021101278A AU2021101278A4 (en) | 2021-03-12 | 2021-03-12 | System and Method for Automatic Language Detection for Handwritten Text |
Publications (1)
Publication Number | Publication Date |
---|---|
AU2021101278A4 true AU2021101278A4 (en) | 2021-05-06 |
Family
ID=75714374
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AU2021101278A Ceased AU2021101278A4 (en) | 2021-03-12 | 2021-03-12 | System and Method for Automatic Language Detection for Handwritten Text |
Country Status (1)
Country | Link |
---|---|
AU (1) | AU2021101278A4 (en) |
-
2021
- 2021-03-12 AU AU2021101278A patent/AU2021101278A4/en not_active Ceased
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102266529B1 (en) | Method, apparatus, device and readable storage medium for image-based data processing | |
US11783615B2 (en) | Systems and methods for language driven gesture understanding | |
Tang et al. | Text-independent writer identification via CNN features and joint Bayesian | |
TWI435276B (en) | A method and apparatus for recognition of handwritten symbols | |
US7729538B2 (en) | Spatial recognition and grouping of text and graphics | |
JP5031741B2 (en) | Grammatical analysis of document visual structure | |
CN110929573A (en) | Examination question checking method based on image detection and related equipment | |
Naz et al. | Segmentation techniques for recognition of Arabic-like scripts: A comprehensive survey | |
Wilkinson et al. | Neural Ctrl-F: segmentation-free query-by-string word spotting in handwritten manuscript collections | |
CN104463250A (en) | Sign language recognition translation method based on Davinci technology | |
CN109983473A (en) | Flexible integrated identification and semantic processes | |
CN110689018A (en) | Intelligent marking system and processing method thereof | |
CN114647713A (en) | Knowledge graph question-answering method, device and storage medium based on virtual confrontation | |
Tayyab et al. | Recognition of Visual Arabic Scripting News Ticker From Broadcast Stream | |
Panda et al. | Odia offline typewritten character recognition using template matching with unicode mapping | |
CN113705468A (en) | Digital image identification method based on artificial intelligence and related equipment | |
CN116935411A (en) | Radical-level ancient character recognition method based on character decomposition and reconstruction | |
CN111898528A (en) | Data processing method and device, computer readable medium and electronic equipment | |
AU2021101278A4 (en) | System and Method for Automatic Language Detection for Handwritten Text | |
CN114398482A (en) | Dictionary construction method and device, electronic equipment and storage medium | |
Cui et al. | Chinese calligraphy recognition system based on convolutional neural network | |
CN115346225A (en) | Writing evaluation method, device and equipment | |
Senthilselvi et al. | An Adaptive, Dynamic and Semantic Approach for Understanding of Sign Language based on Convolution Neural Network. | |
Le et al. | An Attention-Based Encoder–Decoder for Recognizing Japanese Historical Documents | |
Jain | Unconstrained Arabic & Urdu text recognition using deep CNN-RNN hybrid networks |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FGI | Letters patent sealed or granted (innovation patent) | ||
MK22 | Patent ceased section 143a(d), or expired - non payment of renewal fee or expiry |