CN112200185A - Method and device for reversely positioning picture by characters and computer storage medium - Google Patents
Method and device for reversely positioning picture by characters and computer storage medium Download PDFInfo
- Publication number
- CN112200185A CN112200185A CN202011076589.5A CN202011076589A CN112200185A CN 112200185 A CN112200185 A CN 112200185A CN 202011076589 A CN202011076589 A CN 202011076589A CN 112200185 A CN112200185 A CN 112200185A
- Authority
- CN
- China
- Prior art keywords
- picture
- information
- positioning
- window
- characters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 27
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/60—Type of objects
- G06V20/62—Text, e.g. of license plates, overlay texts or captions on TV images
- G06V20/635—Overlay text, e.g. embedded captions in a TV program
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention relates to the field of character positioning, in particular to a method and a device for reversely positioning a picture by characters and a computer storage medium, comprising the following steps of: displaying a preset first window and a preset second window on a display interface; receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information; positioning the picture where the key field is located according to the picture information and displaying the picture in a second window; and lightening characters corresponding to the key fields in the picture according to the position information. The invention provides a method and a device for reversely positioning a picture by characters and a computer storage medium, which solve the problem that the existing positioning method cannot accurately position.
Description
Technical Field
The invention relates to the field of document positioning, in particular to a method and a device for reversely positioning a picture by characters and a computer storage medium.
Background
The judicial paperwork refers to the special paperwork formed and used by the judicial authorities of investigation, inspection, judgment, notarization and the like in each link and step of processing various cases. Mainly includes documents with legal effectiveness, such as judgment books, adjudication books, etc.; documents which do not directly take place in legal force, but which have a tangible guarantee of law enforcement, such as decision books, are also included. The number of judicial documents is large, the form of the text is unstructured, and a character reverse positioning picture technology is generally adopted, but the existing character reverse positioning picture method has the following problems:
(1) only a single picture can be positioned, and the selection can not be positioned across pictures.
(2) The editor only supports character positioning of a paragraph where a cursor is located, and characters cannot be accurately positioned.
Disclosure of Invention
The invention provides a method and a device for reversely positioning a picture by characters and a computer storage medium, which are used for solving the problem that the characters cannot be accurately positioned by the existing positioning method.
The technical scheme for solving the problems is as follows: the method for reversely positioning the picture by the characters is characterized by comprising the following steps of:
displaying a preset first window and a preset second window on a display interface;
receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;
positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;
and lightening characters corresponding to the key fields in the picture according to the position information.
Further, the method also comprises the following steps: the first window is also provided with a catalog for the user to select, and the catalog is generated by selecting a plurality of key fields according to the needs.
Further, the step of lighting up the text corresponding to the key field in the picture according to the position information includes:
searching corresponding characters in the picture according to the position information of the key field;
and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lightening the character.
Further, the picture information is obtained by performing OCR recognition on a plurality of target pictures, and the picture information includes character information and a plurality of paragraph information, where the character information includes each character in the picture and a coordinate of each character.
Further, the position information is information of a key field position obtained by extracting a plurality of paragraphs of information, the position information includes information of a starting position of a paragraph where the key field is located, and the plurality of paragraphs of information are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures respectively. .
Further, the method for extracting the multi-section information is based on regular expression strong matching and NLP capability algorithm.
In addition, the invention also provides a device for reversely positioning the picture by the characters, which is characterized by comprising the following components: the display module is used for displaying the first window and the second window in the display area;
the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;
the positioning module is used for positioning the picture where the key field is located according to the picture information;
and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.
The system further comprises a judging module, wherein the judging module is used for judging whether the similarity between the key field and the searched character is greater than a threshold value.
The invention also proposes a computer storage medium, which is characterized in that a computer-executable instruction is stored thereon, which, when being executed by a processor, carries out the method steps of any one of claims 1 to 8.
The invention has the advantages that:
1) the invention can be selected by cross-picture positioning;
2) the invention can accurately position the character where the picture is located and highlight the character;
3) the invention supports the positioning of multiple lines, multiple lines and multiple pages in a line according to the character coordinates covered on the picture.
Drawings
FIG. 1 is a schematic flow chart of example 1 of the present invention;
FIG. 2 is a schematic flow chart of example 2 of the present invention;
FIG. 3 is a diagram illustrating web page positioning according to embodiment 2 of the present invention;
fig. 4 is a schematic diagram of editor positioning in embodiment 2 of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings of the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention.
Example 1: the method for reversely positioning the picture by the text as shown in fig. 1 comprises the following steps:
displaying a preset first window and a preset second window on a display interface;
receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;
positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;
and lightening characters corresponding to the key fields in the picture according to the position information.
As a preferred embodiment of the present invention, the first window further has a directory for the user to select, and the directory is generated by selecting a plurality of key fields according to the requirement.
As a preferred embodiment of the present invention, the step of lighting up the text corresponding to the key field in the picture according to the location information comprises:
searching corresponding characters in the picture according to the position information of the key field;
and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lighting the character.
As a preferred embodiment of the present invention, the picture information is obtained by performing OCR recognition on a plurality of target pictures, and the picture information includes text information and a plurality of paragraph information, where the text information includes each text in the picture and a coordinate of each text.
As a preferred embodiment of the present invention, the position information is information of a key field position obtained by extracting a plurality of pieces of paragraph information, the position information includes information of a start position of a paragraph where the key field is located, and the plurality of pieces of paragraph information are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures, respectively.
As a preferred embodiment of the present invention, the method for extracting multiple pieces of colony information is based on regular expression strong matching and NLP capability algorithm.
Example 2: the method for reversely positioning the picture by the characters as shown in fig. 2 comprises the following steps:
the method comprises the following steps: generation of coordinate information
The generation method of the coordinate information comprises the following steps:
1. performing OCR recognition on the picture to obtain all characters in the picture, coordinate information of all the characters and information of all paragraphs;
2. combining information of each paragraph obtained by performing OCR recognition on a plurality of pictures into multi-paragraph information, extracting key fields and information of initial positions of the paragraphs where the key fields are located through an algorithm of regular expression strong matching and NLP (non line of sight) capacity, wherein the key fields are business fields such as case types and document types;
3. performing cross positioning on the information of the starting position of the paragraph where the key field is located in the step 2 and the information of the plurality of paragraphs in the step 1;
4. and obtaining coordinate information according to the cross positioning result.
Step 2: web page positioning or editor positioning as required by user end
As shown in fig. 3, when web page positioning is selected, key fields clicked or searched in a directory by a user side are received in a web page, corresponding pictures are positioned according to coordinate information carried by the key fields, characters corresponding to the key fields on the selected pictures are highlighted according to position information, and the contents of the key fields and the characters are the same.
As shown in fig. 4, when the positioning of the editor is selected, the characters selected by the user side are received in the editor, the corresponding picture is positioned through the coordinate information carried by the selected characters, the characters with similarity greater than the threshold value with the selected characters are searched in the picture according to the position information of the selected characters, and the characters are lightened.
Example 3: a device for reversely positioning picture by characters comprises
The display module is used for displaying the first window and the second window in the display area;
the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;
the positioning module is used for positioning the picture where the key field is located according to the picture information;
and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.
As a preferred embodiment of the present invention: the judging module is used for judging whether the similarity between the key field and the searched character is greater than a threshold value.
Example 4: a computer storage medium having stored thereon computer-executable instructions that, when executed by a processor, perform the steps of the text-reverse positioning picture method of embodiments 1-4.
The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent structures or equivalent flow transformations made by using the contents of the specification and the drawings, or applied directly or indirectly to other related systems, are included in the scope of the present invention.
Claims (9)
1. A method for reversely positioning pictures by characters is characterized by comprising the following steps:
displaying a preset first window and a preset second window on a display interface;
receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;
positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;
and lightening characters corresponding to the key fields in the picture according to the position information.
2. The method of claim 1, further comprising the steps of:
the first window is also provided with a catalog for the user to select, and the catalog is generated by selecting a plurality of key fields according to the needs.
3. The method as claimed in claim 1, wherein the step of lighting up the text corresponding to the key field in the picture according to the position information comprises:
searching corresponding characters in the picture according to the position information of the key field;
and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lightening the character.
4. The method according to any one of claims 1-3, wherein the picture information is obtained by performing OCR recognition on the target picture, and the picture information includes text information and paragraph information, wherein the text information includes each text in the picture and coordinates of each text.
5. The method as claimed in claim 4, wherein the position information is a key field position obtained by extracting a plurality of paragraphs, the position information includes a start position of a paragraph where the key field is located, and the plurality of paragraphs are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures respectively.
6. The method for reverse positioning of pictures by characters according to claim 5, wherein the method for extracting the multi-paragraph information is based on regular expression strong matching and NLP capability algorithm.
7. A device for reversely positioning pictures by characters is characterized by comprising
The display module is used for displaying the first window and the second window in the display area;
the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;
the positioning module is used for positioning the picture where the key field is located according to the picture information;
and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.
8. The apparatus for reverse positioning picture by letters according to claim 7, further comprising a determining module for determining whether the similarity between the key field and the found letter is greater than a threshold.
9. A computer storage medium having stored thereon computer-executable instructions which, when executed by a processor, carry out the method steps of any of claims 1 to 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011076589.5A CN112200185A (en) | 2020-10-10 | 2020-10-10 | Method and device for reversely positioning picture by characters and computer storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011076589.5A CN112200185A (en) | 2020-10-10 | 2020-10-10 | Method and device for reversely positioning picture by characters and computer storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112200185A true CN112200185A (en) | 2021-01-08 |
Family
ID=74013682
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011076589.5A Pending CN112200185A (en) | 2020-10-10 | 2020-10-10 | Method and device for reversely positioning picture by characters and computer storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112200185A (en) |
Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20110103190A (en) * | 2010-03-12 | 2011-09-20 | 강대현 | Method and apparatus of inputting keyword by selection on image |
US20120288203A1 (en) * | 2011-05-13 | 2012-11-15 | Fujitsu Limited | Method and device for acquiring keywords |
CN104252475A (en) * | 2013-06-27 | 2014-12-31 | 腾讯科技(深圳)有限公司 | Method and device for positioning text messages in picture |
US20150317530A1 (en) * | 2012-03-14 | 2015-11-05 | Omron Corporation | Key word detection device, control method, and display apparatus |
CN110059559A (en) * | 2019-03-15 | 2019-07-26 | 深圳壹账通智能科技有限公司 | The processing method and its electronic equipment of OCR identification file |
CN110263616A (en) * | 2019-04-29 | 2019-09-20 | 五八有限公司 | A kind of character recognition method, device, electronic equipment and storage medium |
CN110442744A (en) * | 2019-08-09 | 2019-11-12 | 泰康保险集团股份有限公司 | Extract method, apparatus, electronic equipment and the readable medium of target information in image |
CN110991456A (en) * | 2019-12-05 | 2020-04-10 | 北京百度网讯科技有限公司 | Bill identification method and device |
US20200151886A1 (en) * | 2018-11-08 | 2020-05-14 | Industrial Technology Research Institute | Information display system and information display method |
CN111160193A (en) * | 2019-12-20 | 2020-05-15 | 中国平安财产保险股份有限公司 | Key information extraction method, device and storage medium |
CN111291572A (en) * | 2020-01-20 | 2020-06-16 | Oppo广东移动通信有限公司 | Character typesetting method and device and computer readable storage medium |
CN111310750A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Information processing method and device, computing equipment and medium |
CN111476227A (en) * | 2020-03-17 | 2020-07-31 | 平安科技(深圳)有限公司 | Target field recognition method and device based on OCR (optical character recognition) and storage medium |
CN111582169A (en) * | 2020-05-08 | 2020-08-25 | 腾讯科技(深圳)有限公司 | Image recognition data error correction method, device, computer equipment and storage medium |
CN111695439A (en) * | 2020-05-20 | 2020-09-22 | 平安科技(深圳)有限公司 | Image structured data extraction method, electronic device and storage medium |
-
2020
- 2020-10-10 CN CN202011076589.5A patent/CN112200185A/en active Pending
Patent Citations (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR20110103190A (en) * | 2010-03-12 | 2011-09-20 | 강대현 | Method and apparatus of inputting keyword by selection on image |
US20120288203A1 (en) * | 2011-05-13 | 2012-11-15 | Fujitsu Limited | Method and device for acquiring keywords |
US20150317530A1 (en) * | 2012-03-14 | 2015-11-05 | Omron Corporation | Key word detection device, control method, and display apparatus |
CN104252475A (en) * | 2013-06-27 | 2014-12-31 | 腾讯科技(深圳)有限公司 | Method and device for positioning text messages in picture |
US20200151886A1 (en) * | 2018-11-08 | 2020-05-14 | Industrial Technology Research Institute | Information display system and information display method |
CN111310750A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Information processing method and device, computing equipment and medium |
CN110059559A (en) * | 2019-03-15 | 2019-07-26 | 深圳壹账通智能科技有限公司 | The processing method and its electronic equipment of OCR identification file |
CN110263616A (en) * | 2019-04-29 | 2019-09-20 | 五八有限公司 | A kind of character recognition method, device, electronic equipment and storage medium |
CN110442744A (en) * | 2019-08-09 | 2019-11-12 | 泰康保险集团股份有限公司 | Extract method, apparatus, electronic equipment and the readable medium of target information in image |
CN110991456A (en) * | 2019-12-05 | 2020-04-10 | 北京百度网讯科技有限公司 | Bill identification method and device |
CN111160193A (en) * | 2019-12-20 | 2020-05-15 | 中国平安财产保险股份有限公司 | Key information extraction method, device and storage medium |
CN111291572A (en) * | 2020-01-20 | 2020-06-16 | Oppo广东移动通信有限公司 | Character typesetting method and device and computer readable storage medium |
CN111476227A (en) * | 2020-03-17 | 2020-07-31 | 平安科技(深圳)有限公司 | Target field recognition method and device based on OCR (optical character recognition) and storage medium |
CN111582169A (en) * | 2020-05-08 | 2020-08-25 | 腾讯科技(深圳)有限公司 | Image recognition data error correction method, device, computer equipment and storage medium |
CN111695439A (en) * | 2020-05-20 | 2020-09-22 | 平安科技(深圳)有限公司 | Image structured data extraction method, electronic device and storage medium |
Non-Patent Citations (1)
Title |
---|
廖晓彬;: "基于深度学习的浏览器OCR插件设计与实现", 信息与电脑(理论版), no. 10, 25 May 2018 (2018-05-25) * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20220075832A1 (en) | Related Notes And Multi-Layer Search In Personal And Shared Content | |
US7730050B2 (en) | Information retrieval apparatus | |
US10365792B2 (en) | Generating visualizations of facet values for facets defined over a collection of objects | |
CN110321470B (en) | Document processing method, device, computer equipment and storage medium | |
US8838657B1 (en) | Document fingerprints using block encoding of text | |
US20040044958A1 (en) | Systems and methods for inserting a metadata tag in a document | |
CN110909123B (en) | Data extraction method and device, terminal equipment and storage medium | |
WO2012012808A2 (en) | Method for document search and analysis | |
US20120284250A1 (en) | Enhanced search engine | |
CN104541288A (en) | Handwritten document processing apparatus and method | |
JP5516918B2 (en) | Image element search | |
CN104750791A (en) | Image retrieval method and device | |
US10261987B1 (en) | Pre-processing E-book in scanned format | |
CN115687655A (en) | PDF document-based knowledge graph construction method, system, equipment and storage medium | |
KR102089797B1 (en) | Protecting personal information leakage interception system | |
CN111967367B (en) | Image content extraction method and device and electronic equipment | |
CN103559512B (en) | A kind of Text region output intent and system | |
CN112084342A (en) | Test question generation method and device, computer equipment and storage medium | |
CN111310750A (en) | Information processing method and device, computing equipment and medium | |
CN113806472A (en) | Method and equipment for realizing full-text retrieval of character, picture and image type scanning piece | |
US20120109638A1 (en) | Electronic device and method for extracting component names using the same | |
CN112200185A (en) | Method and device for reversely positioning picture by characters and computer storage medium | |
CN111368693A (en) | Identification method and device for identity card information | |
US12045280B2 (en) | Method and system for facilitating keyword-based searching in images | |
US20130332824A1 (en) | Embedded font processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |