CN111381683A - Photographing recognition input method and software - Google Patents
Photographing recognition input method and software Download PDFInfo
- Publication number
- CN111381683A CN111381683A CN201811620611.0A CN201811620611A CN111381683A CN 111381683 A CN111381683 A CN 111381683A CN 201811620611 A CN201811620611 A CN 201811620611A CN 111381683 A CN111381683 A CN 111381683A
- Authority
- CN
- China
- Prior art keywords
- software
- input
- word
- user
- words
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
- G06F3/0233—Character input methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/153—Segmentation of character regions using recognition of characters or words
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Machine Translation (AREA)
- Character Discrimination (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention discloses a character input method and software based on a photographing function, which are mainly used for expanding the function of an input method, are particularly suitable for application scenes of a mobile terminal, and for example, a user uses dictionary APP software of the mobile terminal to look up new characters and new words, and specifically comprise the following steps: 1. the user selects a photographing identification input method, and software is started; 2. the software calls a photographing function to snap a photo containing words to be input and displayed in a photo area of the input interface; 3. a user touches or scratches a word to be input in the photo area by a finger, the software segments a word picture from the touch or scratch position and calls a character recognition module to recognize the word; 4. and displaying the recognition result by the software for the user to confirm. The invention expands the input method function of the mobile terminal or other terminals by photographing and identifying, and improves the convenience of character input.
Description
Technical Field
The invention relates to a photographing recognition input method and software, and belongs to the technical field of software.
Background
The input method software is one of basic software of other terminals such as a mobile terminal, a personal computer and the like, the existing input method software supports spelling input, handwriting input, voice input and the like, different input methods have different characteristics and have defects, for example, the spelling input method is not convenient enough, if the handwriting input method meets words which cannot be written, the handwriting input method cannot input, and if the voice input method meets words which cannot be read, the voice input method cannot input. The input method related by the invention adopts a photographing identification method, can complete input as long as photos containing words can be shot, is not only convenient, but also not limited by reading and writing problems, and is particularly suitable for application scenes of mobile terminals.
Disclosure of Invention
The purpose of the invention is as follows: the invention aims to provide a photographing recognition input method, which improves the convenience of input of a mobile terminal or other terminals.
The technical scheme is as follows: the key steps of the photographing recognition input method are shown in the attached figure 1 and are explained as follows:
(1) calling a photographing function after software is started, and snapping a photo containing words to be input by a user and displaying the photo in a photo area of an input interface;
(2) a user touches or scratches a word to be input in the photo area by a finger, the software segments a word picture from the touch or scratch position and calls a character recognition module to recognize the word;
(3) and displaying the recognition result by the software for the user to confirm.
Further perfecting the technical scheme, the software segments word pictures from the user touch positions, one of the schemes is a boundary search method, and the description is as follows:
(1) setting a split view frame with an initial size, such as 10 × 10 pixels, with the touch point as the center;
(2) firstly, starting from one side of a split view-finding frame, judging whether the side intersects with the word or not, if so, extending a pixel outwards, and circulating until the side does not intersect with the word, namely determining a boundary of the searched word;
(3) and (3) sequentially executing the step (2) on the other three edges of the split view-finding frame until the four boundaries of the words are determined.
Further, in order to enable the user to visually see the boundary searching process of the scheme so as to enable the user to intervene in time, the software displays the split view-finding frame on the photo in real time while dynamically adjusting the boundary of the split view-finding frame, when the user sees that the word to be input completely falls into the split view-finding frame, the finger can be released, and the software finishes searching. If the user does not release the fingers, the software fixes the upper and lower boundaries on the basis of the last search result, and executes the search algorithm again to search the left and right boundaries of the words until the user releases the fingers to confirm.
Further perfecting the technical scheme, the software divides the word picture from the touch position of the user, the scheme can also use a boundary search method, and the difference lies in the setting of the initial size of the divided viewing frame, which is explained as follows:
(1) setting a split view frame with an initial size by taking a touch and stroke starting point as a starting point, wherein the initial length of the split view frame is the length of a finger touch and stroke track, and the initial width is an empirical value, such as 10 pixels;
(2) firstly, starting from one side of a split view-finding frame, judging whether the side intersects with the word or not, if so, extending a pixel outwards, and circulating until the side does not intersect with the word, namely determining a boundary of the searched word;
(3) and (3) sequentially executing the step (2) on the other three edges of the split view-finding frame until the four boundaries of the words are determined.
Similarly, the software also displays the dynamic split view box on the photo in real time to let the user confirm when the splitting process can be ended, which is not described herein again.
The above-mentioned boundary search method is one of the word segmentation schemes that can be used in the present invention, but this scheme is not a limitation for restricting the scope of the right of the present invention, and in terms of technical implementation, other segmentation schemes may be selected, or multiple segmentation schemes may be mutually verified to improve the reliability of segmentation.
Further perfecting the technical scheme, the software calls a character recognition module to recognize words in the segmented picture, the character recognition module is composed of a plurality of sub-modules, the sub-modules can be expanded, and each sub-module corresponds to one language. The recognition sub-module may be implemented using mature OCR program code, such as Tesserac OCR program code, or GOCR program code, and different languages may use different algorithms and programs, but which algorithms and programs are used is not intended as a limitation on the scope of the present invention.
The photographing recognition input method software designed according to the technical scheme at least comprises the following functional modules:
(1) and the photographing module is used for photographing a photo containing words to be input. After the input method software is started, the module is called to take a picture, and the picture is displayed in the picture area.
(2) And the character segmentation module is used for segmenting the picture containing the words to be input from the picture. In addition to the boundary search method-based segmentation scheme described in this specification, other segmentation schemes may also be employed. The text segmentation module is invoked when the user taps or strokes the photo with a finger.
(3) And the character recognition module is used for calling the character recognition module to recognize the words from the divided picture after the division is finished. The character recognition module is composed of a plurality of character recognition submodules, the submodules can be expanded, and each submodule corresponds to one language. The recognition sub-module may be implemented using sophisticated open source character recognition OCR program code, such as Tesserac OCR program code, or GOCR program code, with different languages possibly using different algorithms and programs.
The invention also provides application of the photographing identification input method in a mobile phone input method and infant literacy software.
Has the advantages that: the invention extends the input method by photographing and identifying, can finish input as long as photos containing words can be photographed, is not only convenient, but also not limited by the problem of reading and writing, and is particularly suitable for application scenes of mobile terminals. For example, the input method is used in the mobile terminal electronic dictionary APP, when a user encounters a new word in the process of reading an article, the user can directly take a picture of the reading object, touch or touch and stroke the new word in the picture with a finger, automatically recognize the word by the input method, and look up the dictionary; or the user can directly take a picture to identify and look up the dictionary when encountering unknown signboards or guideboards when traveling to a strange mirror, so that the use is very convenient.
Drawings
FIG. 1 is a step diagram of the photographing recognition input method according to the present invention.
In the attached figure 1, after the input method software is called, a photographing module is called to photograph and display a picture containing words to be input, a user touches or scratches the words to be input in the picture, the software calls a segmentation module to segment the words, after the segmentation is completed, the software calls an identification module to identify the words in the segmented picture, and the identification result is displayed for the user to confirm.
Detailed description of the preferred embodiments
The technical solution of the present invention is described in detail below, but the scope of the present invention is not limited to the embodiments.
Example 1: the invention is used for a mobile phone input method, and the photographing identification input method module is expanded in mobile phone input method software and consists of the following sub-modules:
(1) and (5) a photographing submodule. When the user opens the input method and selects the 'photographing recognition' input method, the photographing submodule is called to capture a photo containing words to be input and display the photo in the photo area of the input interface.
(2) And partitioning the sub-modules. When a user touches or strokes the words in the photo area with a finger, the segmentation sub-module is called, the new words are segmented by using a boundary search method, and segmented pictures are obtained.
(3) English word discernment submodule piece. This module is implemented with Tesserac OCR program code. After the software obtains the segmentation picture, the module is called to identify English words in the segmentation picture.
Example 2: the invention is used for mobile phone infant literacy APP software, and the photographing identification input module is arranged in the mobile phone infant literacy APP software to help infants to recognize characters. The input module is composed of the following sub-modules
(1) And (5) a photographing submodule. The infant opens literacy APP software, calls the photographing submodule, shoots the infant literacy card photo, and displays the photo in the photo area.
(2) And partitioning the sub-modules. When the infant touches the words in the photo area with a finger, the segmentation submodule is called, the words are segmented by using a boundary search method, and segmented pictures are obtained.
(3) English word discernment submodule piece. The module is implemented in GOCR program code. After the segmentation picture is obtained, the module is called to identify English words in the segmentation picture, and APP software explains word meanings, pronunciation, example sentences and the like to help the infant to recognize characters.
As noted above, while the present invention has been shown and described with reference to certain preferred embodiments, it is not to be construed as limited thereto. Various changes in form and detail may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.
Claims (7)
1. A shooting recognition input method and software are characterized in that: comprises the following steps of (a) carrying out,
(1) calling a photographing function after software is started, and snapping a photo containing words to be input by a user and displaying the photo in a photo area of an input interface;
(2) the user touches or strokes the words to be input in the photo area by a finger, and the software automatically identifies the words at the touch or stroke position;
(3) and displaying the recognition result by the software for the user to confirm.
2. The photographing recognition input method and software of claim 1, wherein: the software automatically identifies the words of the positions of the strokes or strokes, and comprises the following two steps,
(1) segmenting word pictures from the positions of the touch or touch and scratch;
(2) and calling a character recognition module to recognize the words.
3. The software of claim 2, segmenting word pictures from user swipe points, wherein: and the software displays the dynamic segmentation view-finding frame in real time by taking the touch point as a reference position until the view-finding frame frames the word to be input, and the user releases the hands to confirm that the software completes segmentation.
4. The software of claim 2, wherein the word picture is divided from the user swipe location: and the software displays the dynamic segmentation view-finding frame in real time by taking the finger touch and stroke action track as a reference until the view-finding frame frames the word to be input, and the user releases the hand to confirm that the software finishes segmentation.
5. The software real-time display dynamic split viewfinder of claims 3 and 4, wherein: the center position and the length and width adjustment of the dynamic viewfinder are controlled by a segmentation algorithm.
6. The software-invoked word recognition module of claim 2, wherein: the word recognition module supports multi-language switching.
7. A shooting recognition input method and software are characterized in that: the software comprises the following modules which are,
(1) the shooting module is used for capturing a photo containing words to be input;
(2) the word segmentation module dynamically adjusts and displays a view-finding frame in real time under the control of a segmentation algorithm by taking the touch or touch-and-scratch position of a finger of a user as a reference position to finish word segmentation;
(3) the character recognition module is composed of a plurality of character recognition submodules, each submodule corresponds to one language, and the submodules can be expanded.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811620611.0A CN111381683A (en) | 2018-12-28 | 2018-12-28 | Photographing recognition input method and software |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811620611.0A CN111381683A (en) | 2018-12-28 | 2018-12-28 | Photographing recognition input method and software |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111381683A true CN111381683A (en) | 2020-07-07 |
Family
ID=71219079
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811620611.0A Pending CN111381683A (en) | 2018-12-28 | 2018-12-28 | Photographing recognition input method and software |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111381683A (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101059840A (en) * | 2007-05-24 | 2007-10-24 | 深圳市杰特电信控股有限公司 | Words input method using mobile phone shooting style |
CN101667251A (en) * | 2008-09-05 | 2010-03-10 | 三星电子株式会社 | OCR recognition method and device with auxiliary positioning function |
-
2018
- 2018-12-28 CN CN201811620611.0A patent/CN111381683A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101059840A (en) * | 2007-05-24 | 2007-10-24 | 深圳市杰特电信控股有限公司 | Words input method using mobile phone shooting style |
CN101667251A (en) * | 2008-09-05 | 2010-03-10 | 三星电子株式会社 | OCR recognition method and device with auxiliary positioning function |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10599914B2 (en) | Method and apparatus for human face image processing | |
CN110163076B (en) | Image data processing method and related device | |
CN111353501A (en) | Book point-reading method and system based on deep learning | |
US20170139911A1 (en) | Address book based picture matching method and terminal | |
WO2022089170A1 (en) | Caption area identification method and apparatus, and device and storage medium | |
WO2021097750A1 (en) | Human body posture recognition method and apparatus, storage medium, and electronic device | |
WO2017197593A1 (en) | Apparatus, method and computer program product for recovering editable slide | |
CN111709414A (en) | AR device, character recognition method and device thereof, and computer-readable storage medium | |
CN108256523A (en) | Recognition methods, device and computer readable storage medium based on mobile terminal | |
US9904361B2 (en) | Electronic device and facial expression operation method | |
WO2023138403A1 (en) | Method and apparatus for determining trigger gesture, and device | |
US20180144189A1 (en) | Data entry system with drawing recognition | |
CN104765796A (en) | Image recognizing searching method and device | |
CN111754414B (en) | Image processing method and device for image processing | |
CN112738555B (en) | Video processing method and device | |
CN112163513A (en) | Information selection method, system, device, electronic equipment and storage medium | |
CN111381683A (en) | Photographing recognition input method and software | |
CN106557766B (en) | Fuzzy character processing method and system and electronic equipment | |
CN114281236B (en) | Text processing method, apparatus, device, medium, and program product | |
CN110969161B (en) | Image processing method, circuit, vision-impaired assisting device, electronic device, and medium | |
CN111711758B (en) | Multi-pointing test question shooting method and device, electronic equipment and storage medium | |
US11367296B2 (en) | Layout analysis | |
CN111753715B (en) | Method and device for shooting test questions in click-to-read scene, electronic equipment and storage medium | |
CN111582281B (en) | Picture display optimization method and device, electronic equipment and storage medium | |
KR20140134844A (en) | Method and device for photographing based on objects |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20200707 |