CN112200185A

CN112200185A - Method and device for reversely positioning picture by characters and computer storage medium

Info

Publication number: CN112200185A
Application number: CN202011076589.5A
Authority: CN
Inventors: 程高伟; 公慧; 王锰; 王守栋; 杨茗茵
Original assignee: Casic Wisdom Industrial Development Co ltd
Current assignee: Casic Wisdom Industrial Development Co ltd
Priority date: 2020-10-10
Filing date: 2020-10-10
Publication date: 2021-01-08

Abstract

The invention relates to the field of character positioning, in particular to a method and a device for reversely positioning a picture by characters and a computer storage medium, comprising the following steps of: displaying a preset first window and a preset second window on a display interface; receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information; positioning the picture where the key field is located according to the picture information and displaying the picture in a second window; and lightening characters corresponding to the key fields in the picture according to the position information. The invention provides a method and a device for reversely positioning a picture by characters and a computer storage medium, which solve the problem that the existing positioning method cannot accurately position.

Description

Method and device for reversely positioning picture by characters and computer storage medium

Technical Field

The invention relates to the field of document positioning, in particular to a method and a device for reversely positioning a picture by characters and a computer storage medium.

Background

The judicial paperwork refers to the special paperwork formed and used by the judicial authorities of investigation, inspection, judgment, notarization and the like in each link and step of processing various cases. Mainly includes documents with legal effectiveness, such as judgment books, adjudication books, etc.; documents which do not directly take place in legal force, but which have a tangible guarantee of law enforcement, such as decision books, are also included. The number of judicial documents is large, the form of the text is unstructured, and a character reverse positioning picture technology is generally adopted, but the existing character reverse positioning picture method has the following problems:

(1) only a single picture can be positioned, and the selection can not be positioned across pictures.

(2) The editor only supports character positioning of a paragraph where a cursor is located, and characters cannot be accurately positioned.

Disclosure of Invention

The invention provides a method and a device for reversely positioning a picture by characters and a computer storage medium, which are used for solving the problem that the characters cannot be accurately positioned by the existing positioning method.

The technical scheme for solving the problems is as follows: the method for reversely positioning the picture by the characters is characterized by comprising the following steps of:

displaying a preset first window and a preset second window on a display interface;

receiving a key field which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after cross positioning of picture information and position information;

positioning the picture where the key field is located according to the picture information and displaying the picture in a second window;

and lightening characters corresponding to the key fields in the picture according to the position information.

Further, the method also comprises the following steps: the first window is also provided with a catalog for the user to select, and the catalog is generated by selecting a plurality of key fields according to the needs.

Further, the step of lighting up the text corresponding to the key field in the picture according to the position information includes:

searching corresponding characters in the picture according to the position information of the key field;

and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lightening the character.

Further, the picture information is obtained by performing OCR recognition on a plurality of target pictures, and the picture information includes character information and a plurality of paragraph information, where the character information includes each character in the picture and a coordinate of each character.

Further, the position information is information of a key field position obtained by extracting a plurality of paragraphs of information, the position information includes information of a starting position of a paragraph where the key field is located, and the plurality of paragraphs of information are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures respectively. .

Further, the method for extracting the multi-section information is based on regular expression strong matching and NLP capability algorithm.

In addition, the invention also provides a device for reversely positioning the picture by the characters, which is characterized by comprising the following components: the display module is used for displaying the first window and the second window in the display area;

the receiving module is used for receiving a first character which is selected by a user side in a first window and carries coordinate information, wherein the coordinate information is generated after the picture information and the position information are positioned in a crossed mode;

the positioning module is used for positioning the picture where the key field is located according to the picture information;

and the lighting module is used for lighting characters corresponding to the key fields in the picture according to the position information.

The system further comprises a judging module, wherein the judging module is used for judging whether the similarity between the key field and the searched character is greater than a threshold value.

The invention also proposes a computer storage medium, which is characterized in that a computer-executable instruction is stored thereon, which, when being executed by a processor, carries out the method steps of any one of claims 1 to 8.

The invention has the advantages that:

1) the invention can be selected by cross-picture positioning;

2) the invention can accurately position the character where the picture is located and highlight the character;

3) the invention supports the positioning of multiple lines, multiple lines and multiple pages in a line according to the character coordinates covered on the picture.

Drawings

FIG. 1 is a schematic flow chart of example 1 of the present invention;

FIG. 2 is a schematic flow chart of example 2 of the present invention;

FIG. 3 is a diagram illustrating web page positioning according to embodiment 2 of the present invention;

fig. 4 is a schematic diagram of editor positioning in embodiment 2 of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be described clearly and completely with reference to the accompanying drawings of the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be obtained by a person skilled in the art without any inventive step based on the embodiments of the present invention, are within the scope of the present invention. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention.

Example 1: the method for reversely positioning the picture by the text as shown in fig. 1 comprises the following steps:

As a preferred embodiment of the present invention, the first window further has a directory for the user to select, and the directory is generated by selecting a plurality of key fields according to the requirement.

As a preferred embodiment of the present invention, the step of lighting up the text corresponding to the key field in the picture according to the location information comprises:

and judging whether the similarity between the key field and the searched character is greater than a threshold value, and if so, lighting the character.

As a preferred embodiment of the present invention, the picture information is obtained by performing OCR recognition on a plurality of target pictures, and the picture information includes text information and a plurality of paragraph information, where the text information includes each text in the picture and a coordinate of each text.

As a preferred embodiment of the present invention, the position information is information of a key field position obtained by extracting a plurality of pieces of paragraph information, the position information includes information of a start position of a paragraph where the key field is located, and the plurality of pieces of paragraph information are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures, respectively.

As a preferred embodiment of the present invention, the method for extracting multiple pieces of colony information is based on regular expression strong matching and NLP capability algorithm.

Example 2: the method for reversely positioning the picture by the characters as shown in fig. 2 comprises the following steps:

the method comprises the following steps: generation of coordinate information

The generation method of the coordinate information comprises the following steps:

1. performing OCR recognition on the picture to obtain all characters in the picture, coordinate information of all the characters and information of all paragraphs;

2. combining information of each paragraph obtained by performing OCR recognition on a plurality of pictures into multi-paragraph information, extracting key fields and information of initial positions of the paragraphs where the key fields are located through an algorithm of regular expression strong matching and NLP (non line of sight) capacity, wherein the key fields are business fields such as case types and document types;

3. performing cross positioning on the information of the starting position of the paragraph where the key field is located in the step 2 and the information of the plurality of paragraphs in the step 1;

4. and obtaining coordinate information according to the cross positioning result.

Step 2: web page positioning or editor positioning as required by user end

As shown in fig. 3, when web page positioning is selected, key fields clicked or searched in a directory by a user side are received in a web page, corresponding pictures are positioned according to coordinate information carried by the key fields, characters corresponding to the key fields on the selected pictures are highlighted according to position information, and the contents of the key fields and the characters are the same.

As shown in fig. 4, when the positioning of the editor is selected, the characters selected by the user side are received in the editor, the corresponding picture is positioned through the coordinate information carried by the selected characters, the characters with similarity greater than the threshold value with the selected characters are searched in the picture according to the position information of the selected characters, and the characters are lightened.

Example 3: a device for reversely positioning picture by characters comprises

The display module is used for displaying the first window and the second window in the display area;

As a preferred embodiment of the present invention: the judging module is used for judging whether the similarity between the key field and the searched character is greater than a threshold value.

Example 4: a computer storage medium having stored thereon computer-executable instructions that, when executed by a processor, perform the steps of the text-reverse positioning picture method of embodiments 1-4.

The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent structures or equivalent flow transformations made by using the contents of the specification and the drawings, or applied directly or indirectly to other related systems, are included in the scope of the present invention.

Claims

1. A method for reversely positioning pictures by characters is characterized by comprising the following steps:

2. The method of claim 1, further comprising the steps of:

the first window is also provided with a catalog for the user to select, and the catalog is generated by selecting a plurality of key fields according to the needs.

3. The method as claimed in claim 1, wherein the step of lighting up the text corresponding to the key field in the picture according to the position information comprises:

4. The method according to any one of claims 1-3, wherein the picture information is obtained by performing OCR recognition on the target picture, and the picture information includes text information and paragraph information, wherein the text information includes each text in the picture and coordinates of each text.

5. The method as claimed in claim 4, wherein the position information is a key field position obtained by extracting a plurality of paragraphs, the position information includes a start position of a paragraph where the key field is located, and the plurality of paragraphs are a combination of paragraph information obtained by performing OCR recognition on a plurality of pictures respectively.

6. The method for reverse positioning of pictures by characters according to claim 5, wherein the method for extracting the multi-paragraph information is based on regular expression strong matching and NLP capability algorithm.

7. A device for reversely positioning pictures by characters is characterized by comprising

8. The apparatus for reverse positioning picture by letters according to claim 7, further comprising a determining module for determining whether the similarity between the key field and the found letter is greater than a threshold.

9. A computer storage medium having stored thereon computer-executable instructions which, when executed by a processor, carry out the method steps of any of claims 1 to 8.