CN114299515A

CN114299515A - Method and device for identifying bold font and electronic equipment

Info

Publication number: CN114299515A
Application number: CN202111475916.9A
Authority: CN
Inventors: 万晓东
Original assignee: Xinao Xinzhi Technology Co ltd
Current assignee: Xinao Xinzhi Technology Co ltd
Priority date: 2021-12-06
Filing date: 2021-12-06
Publication date: 2022-04-08

Abstract

The application discloses a method and a device for identifying a bolded font and electronic equipment, wherein the method comprises the following steps: acquiring the area corresponding to the row area of the identified document; judging whether a first ratio between the area corresponding to the line region and the area of the pixel region corresponding to the same line region in the original document is larger than a first preset ratio or not; if so, determining the line area as a first font bold area; otherwise, determining the line area as a non-first font bold area. Whether the line area in the identified document is the font-bold area or not is determined based on the method, manual comparison is not needed, and convenience and rapidness are achieved.

Description

Method and device for identifying bold font and electronic equipment

Technical Field

The present application relates to the field of data processing technologies, and in particular, to a method and an apparatus for recognizing a bold font, and an electronic device.

Background

Currently, an Optical Character Recognition (OCR) technology is generally used in a process of processing a printed document, and the OCR technology is: the characters of various bills, newspapers, books, manuscripts and other printed matters are converted into image information through optical input modes such as scanning and the like, and then the image information is converted into usable computer input information by utilizing a character recognition technology.

With the OCR technology, the process of inputting the printed document into the computer manually can be omitted, but the bold font in the printed document is usually the important content of the document, and the OCR technology cannot mark the bold font in the printed document, so that the important content cannot be embodied in the generated recognition document, and therefore, a method for recognizing the bold font in the printed document is urgently needed.

At present, the method for recognizing the bold font is mainly to compare the recognition document generated after the OCR processing with the original printing document and then manually mark the bold font in the recognition document.

Disclosure of Invention

The application provides a method and a device for identifying a bold font, and electronic equipment, wherein whether a line area in an identification document is the font bold area is determined by judging whether a first ratio between the area corresponding to the line area in the identification document and the area of a pixel area corresponding to the same line area in an original document is larger than a first preset ratio, and the method for identifying the bold font does not need manual comparison, and is convenient and fast.

In a first aspect, the present application provides a method for identifying a bold font, where the method includes:

acquiring the area corresponding to a row area of an identified document, wherein the identified document is a document generated by subjecting an original document to Optical Character Recognition (OCR);

judging whether a first ratio between the area corresponding to the line region and the area of a pixel region corresponding to the same line region in the original document is larger than a first preset ratio or not;

if so, determining the line area as a first font bold area;

otherwise, determining the line area as a non-first font bold area.

By the method, whether the line area in the identification document is the font-bold area or not is determined by judging whether the first ratio between the area corresponding to the line area in the identification document and the area of the pixel area corresponding to the same line area in the original document is larger than the first preset ratio or not, and the method for identifying the bold font does not need manual comparison and is convenient and fast.

In a possible design, before obtaining the area corresponding to the row area of the identification document, the method further includes:

performing OCR processing on the original document to generate the recognition document;

extracting the row region coordinates of the identification document;

and calculating the area corresponding to the row area according to the row area coordinate.

By the method, the area corresponding to the line area is calculated and is used for subsequently judging whether the line area is a font bold area or not.

Further, the determining whether a first ratio between the area corresponding to the line region and the area of the pixel region corresponding to the same line region in the original document is greater than a first preset ratio includes:

calculating the area of a pixel region corresponding to the same line region in the original document;

calculating the first ratio between the area corresponding to the row area and the area of the pixel area;

and judging whether the first occupation ratio is larger than the first preset occupation ratio or not.

In one possible design, after determining that the line area is the first font bold area, the method further includes:

calculating a second ratio between the whole area of the page where the line region is located and the whole pixel block corresponding to the same page in the original document;

judging whether the second ratio is larger than a second preset ratio or not;

if so, determining the line area as a second font bold area;

otherwise, determining the line area as a non-second font bold area.

By the method, the determined first font bold area is further confirmed, and the second font bold area with higher accuracy is obtained.

In a second aspect, the present application provides a bolded font recognition apparatus, the apparatus comprising:

the acquisition module is used for acquiring the area corresponding to the row area of the identified document, wherein the identified document is a document generated by subjecting an original document to Optical Character Recognition (OCR);

the judging module is used for judging whether a first ratio between the area corresponding to the line region and the area of the pixel region corresponding to the same line region in the original document is larger than a first preset ratio or not;

a determining module, configured to determine that the line area is a first font bold area if the first occupation ratio is greater than the first preset occupation ratio; and if the first occupation ratio is smaller than or equal to the first preset occupation ratio, determining the line area as a non-first font bold area.

In one possible design, the apparatus further includes:

the processing module is used for carrying out OCR processing on the original document to generate the recognition document;

an extraction module for extracting the line region coordinates of the identification document;

and the calculation module is used for calculating the area corresponding to the row area according to the row area coordinate.

Further, the calculation module is further configured to calculate areas of pixel regions corresponding to the same row of regions in the original document; calculating the first ratio between the area corresponding to the row area and the area of the pixel area;

the determination module is further configured to determine whether the first duty ratio is greater than the first preset duty ratio.

In a possible design, the calculating module is further configured to calculate a second ratio between the entire area of the page where the line region is located and an entire pixel block corresponding to the same page in the original document;

the judging module is further configured to judge whether the second ratio is greater than a second preset ratio;

the determining module is further configured to determine that the line area is a second font bold area if the second occupation ratio is greater than the second preset occupation ratio; and if the second occupation ratio is smaller than or equal to the second preset occupation ratio, determining the line area as a non-second font bold area.

In a third aspect, the present application provides an electronic device, comprising:

a memory for storing a computer program;

and the processor is used for realizing the steps of the bold font identification method when executing the computer program stored in the memory.

In a fourth aspect, the present application provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, performs the above-mentioned bolded font identification method steps.

Based on the method for identifying the bold font, whether the line area in the identified document is the font bold area or not is determined by judging whether a first occupation ratio between the area corresponding to the line area in the identified document and the area of the pixel area corresponding to the same line area in the original document is larger than a first preset occupation ratio or not.

For each of the second to fourth aspects and possible technical effects of each aspect, reference is made to the above description of the possible technical effects of the first aspect or various possible schemes of the first aspect, and repeated description is omitted here.

Drawings

Fig. 1 is a flowchart of a method for identifying a bold font according to the present application;

FIG. 2 is a schematic diagram of a row region provided herein;

FIG. 3 is a schematic illustration of another row region provided herein;

fig. 4 is a schematic structural diagram of a bold font recognition apparatus provided in the present application;

fig. 5 is a schematic structural diagram of an electronic device provided in the present application.

Detailed Description

In order to make the objects, technical solutions and advantages of the present application more clear, the present application will be further described in detail with reference to the accompanying drawings. The particular methods of operation in the method embodiments may also be applied to apparatus embodiments or system embodiments. It should be noted that "a plurality" is understood as "at least two" in the description of the present application. "and/or" describes the association relationship of the associated objects, meaning that there may be three relationships, e.g., a and/or B, which may mean: a exists alone, A and B exist simultaneously, and B exists alone. A is connected with B and can represent: a and B are directly connected and A and B are connected through C. In addition, in the description of the present application, the terms "first," "second," and the like are used for descriptive purposes only and are not intended to indicate or imply relative importance nor order to be construed.

The embodiments of the present application will be described in detail below with reference to the accompanying drawings.

In the process of identifying the printed document, the OCR technology is used to omit the process of inputting the printed document into a computer manually, but the bold font in the printed document is usually the key content of the document, and the OCR technology cannot mark the bold font in the printed document, so that the key content cannot be embodied in the generated identified document, and therefore, a method for identifying the bold font in the printed document is urgently needed.

Based on the above problem, the application provides a method for identifying a bold font, which determines whether a line region in an identification document is a font bold region by judging whether a first ratio between an area corresponding to the line region in the identification document and an area of a pixel region corresponding to the same line region in an original document is larger than a first preset ratio, and the method for identifying the bold font is convenient and fast without manual comparison. The method and the device in the embodiment of the application are based on the same technical concept, and because the principles of the problems solved by the method and the device are similar, the device and the embodiment of the method can be mutually referred, and repeated parts are not repeated.

As shown in fig. 1, a flowchart of a method for recognizing a bold font provided by the present application specifically includes the following steps:

s11, acquiring the area corresponding to the line area of the identified document;

in the embodiment of the present application, identifying a document is a document generated after an original document is subjected to optical character recognition OCR, and acquiring an area corresponding to a row region of the identified document is mainly implemented based on coordinates of the row region, where the specific method for acquiring an area corresponding to a row region of the identified document includes:

performing OCR processing on an original document to generate an identified document, wherein the original document can be a printed document, extracting the coordinate information of the row area of the identified document, and calculating the area corresponding to the row area according to the extracted coordinate information.

For example, referring to fig. 2, in fig. 2, a black solid frame is a boundary line for identifying a document, a black "×" part is a text region, and a region between two dotted lines and the boundary line is a current line region, wherein each dotted line is located in the middle between two lines of text regions, and an area corresponding to the current line region can be calculated by coordinate values of end points of the dotted lines.

S12, judging whether a first ratio between the area corresponding to the line region and the area of the pixel region corresponding to the same line region in the original document is larger than a first preset ratio;

in the embodiment of the present application, after the area corresponding to the line region is obtained, the area of the pixel region corresponding to the same line region in the original document is further calculated, where the area of the pixel region corresponding to the same line region in the original document is the area of the black text portion of the line region, the area of the pixel region may be calculated by obtaining the black edge coordinate value of each text, further, a first ratio between the area corresponding to the line region and the area of the pixel region is calculated, and if the area of the current line region is S1 and the area of the pixel region corresponding to the same line region in the original document is S2, the calculated first ratio is S2/S1.

After the first ratio is obtained, whether the first ratio is greater than a first preset ratio is judged, if yes, step S13 is executed; otherwise, step S14 is executed.

S13, if yes, determining the line area as a first font bold area;

s14, otherwise, determining the line area as the non-first font bold area.

In the application, the bold font is identified, the line regions are sequentially acquired from front to back of the identification document, then the acquired line regions are processed according to the method, and finally all font bold regions in the whole identification document are identified.

Based on the method, whether the line area in the identification document is the font-bold area or not is determined by judging whether a first ratio between the area corresponding to the line area in the identification document and the area of the pixel area corresponding to the same line area in the original document is larger than a first preset ratio or not, and the method for identifying the bold font does not need manual comparison and is convenient and fast.

However, in the process of identifying the font-bold area according to the method, because the original document has different font sizes and different font-bold modes, or the binary maps corresponding to the bold fonts in different areas in the original document are different, the first preset ratio is difficult to accurately take values, wherein the binary maps indicate a black area and a white area of the document, the black area represents the font, and the white area represents the blank part.

For example, referring to fig. 3, a black solid frame in fig. 3 is a boundary line for identifying a document, a black "-" portion is a text region, and a region between two dotted lines and the boundary line is a current line region, where each dotted line is located at the middle between two text regions, a font of a 4 th line is larger but is not a bold font, where a first percentage corresponding to the 4 th line is 70%, and at this time, if the first preset percentage is set to be 70%, the first preset percentage is not applicable to other line regions, which may cause a recognition omission of a bold font corresponding to a line region with a smaller font, such as a line region corresponding to the 3 rd line, because a font of a line region corresponding to the 3 rd line is smaller, although a font of the region is a bold font, the corresponding first percentage is only 68%.

Under the above circumstances, in order to ensure convenience and rapidness and avoid the situation of missing recognition, in the process of taking a value of the first preset occupation ratio, the value of the first preset occupation ratio is generally lower than the theoretical value, so that the determined font thickening region can contain a non-font thickening region, the recognition result needs to be further processed, and the recognition accuracy is improved.

In order to solve the above problem and improve the recognition accuracy of the bolded font, after determining that the travel area is the first font bolded area according to the step flow shown in fig. 1, the recognized first font bolded area needs to be further confirmed, and the specific confirmation method includes:

judging whether the second ratio is larger than a second preset ratio, wherein the value of the second preset ratio is larger than the first preset ratio, generally higher than about 20% in the actual operation process, and can also be adjusted according to the actual condition, and the specific limitation is not made herein;

if the second occupation ratio is larger than a second preset occupation ratio, further determining the line area as a second font thickening area;

otherwise, determining the line area as a non-second font bold area.

Based on the font-thickening identification method, whether the line area in the identification document is the font-thickening area is determined by judging whether a first ratio between the area corresponding to the line area in the identification document and the area of the pixel area corresponding to the same line area in the original document is larger than a first preset ratio, and the method for identifying the thickened font is convenient and quick without manual comparison.

Based on the same inventive concept, an embodiment of the present application further provides a bold font identification device, as shown in fig. 4, which is a schematic structural diagram of the bold font identification device in the present application, and the device includes:

an obtaining module 41, configured to obtain an area corresponding to a row region of an identified document, where the identified document is a document generated by subjecting an original document to optical character recognition OCR;

a determining module 42, configured to determine whether a first ratio between an area corresponding to the line region and an area of a pixel region corresponding to the same line region in the original document is greater than a first preset ratio;

a determining module 43, configured to determine that the line area is a first font bold area if the first occupation ratio is greater than the first preset occupation ratio; and if the first occupation ratio is smaller than or equal to the first preset occupation ratio, determining the line area as a non-first font bold area.

In one possible design, the apparatus further includes:

Based on the font-thickening identification device, whether the line area in the identification document is the font-thickening area or not is determined by judging whether a first ratio between the area corresponding to the line area in the identification document and the area of the pixel area corresponding to the same line area in the original document is larger than a first preset ratio or not.

Based on the same inventive concept, an embodiment of the present application further provides an electronic device, where the electronic device can implement the function of the foregoing bold font identification apparatus, and with reference to fig. 5, the electronic device includes:

at least one processor 51, and a memory 52 connected to the at least one processor 51, in this embodiment, a specific connection medium between the processor 51 and the memory 52 is not limited, and fig. 5 illustrates an example in which the processor 51 and the memory 52 are connected through a bus 50. The bus 50 is shown in fig. 5 by a thick line, and the connection between other components is merely illustrative and not intended to be limiting. The bus 50 may be divided into an address bus, a data bus, a control bus, etc., and is shown with only one thick line in fig. 5 for ease of illustration, but does not represent only one bus or type of bus. Alternatively, the processor 51 may also be referred to as a controller, without limitation to name a few.

In the embodiment of the present application, the memory 52 stores instructions executable by the at least one processor 51, and the at least one processor 51 can execute the bold font identification method discussed above by executing the instructions stored in the memory 52. The processor 51 may implement the functions of the various modules in the apparatus shown in fig. 4.

The processor 51 is a control center of the apparatus, and may be connected to various parts of the entire control device by various interfaces and lines, and perform various functions of the apparatus and process data by executing or executing instructions stored in the memory 52 and calling data stored in the memory 52, thereby performing overall monitoring of the apparatus.

In one possible design, processor 51 may include one or more processing units, and processor 51 may integrate an application processor, which primarily handles operating systems, user interfaces, application programs, and the like, and a modem processor, which primarily handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 51. In some embodiments, the processor 51 and the memory 52 may be implemented on the same chip, or in some embodiments, they may be implemented separately on separate chips.

The processor 51 may be a general-purpose processor, such as a Central Processing Unit (CPU), digital signal processor, application specific integrated circuit, field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or the like, that may implement or perform the methods, steps, and logic blocks disclosed in embodiments of the present application. A general purpose processor may be a microprocessor or any conventional processor or the like. The steps of the method for recognizing the bolded font disclosed in the embodiments of the present application may be directly implemented by a hardware processor, or implemented by a combination of hardware and software modules in the processor.

The memory 52, which is a non-volatile computer-readable storage medium, may be used to store non-volatile software programs, non-volatile computer-executable programs, and modules. The Memory 52 may include at least one type of storage medium, and may include, for example, a flash Memory, a hard disk, a multimedia card, a card-type Memory, a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Programmable Read Only Memory (PROM), a Read Only Memory (ROM), a charge Erasable Programmable Read Only Memory (EEPROM), a magnetic Memory, a magnetic disk, an optical disk, and the like. The memory 52 is any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer, but is not limited to such. The memory 52 in the embodiments of the present application may also be circuitry or any other device capable of performing a storage function for storing program instructions and/or data.

The processor 51 is programmed to solidify the codes corresponding to the bold font identification method described in the foregoing embodiment into the chip, so that the chip can execute the steps of the bold font identification method of the embodiment shown in fig. 1 when running. How to program the processor 51 is well known to those skilled in the art and will not be described in detail here.

Based on the same inventive concept, the present application also provides a storage medium storing computer instructions, which when executed on a computer, cause the computer to perform the method for recognizing bold font discussed above.

In some possible embodiments, the various aspects of the bold font identification method provided herein may also be implemented in the form of a program product comprising program code for causing a control apparatus to perform the steps of the bold font identification method according to various exemplary embodiments of the present application described above in this specification when the program product is run on a device.

As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.

The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.

These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.

It will be apparent to those skilled in the art that various changes and modifications may be made in the present application without departing from the spirit and scope of the application. Thus, if such modifications and variations of the present application fall within the scope of the claims of the present application and their equivalents, the present application is intended to include such modifications and variations as well.

Claims

1. A method for identifying a bolded font, the method comprising:

if so, determining the line area as a first font bold area;

otherwise, determining the line area as a non-first font bold area.

2. The method of claim 1, wherein prior to obtaining the area corresponding to the line region of the identified document, further comprising:

extracting the row region coordinates of the identification document;

3. The method of claim 1, wherein the determining whether a first ratio between an area corresponding to the line region and an area of a pixel region corresponding to the same line region in the original document is greater than a first preset ratio comprises:

4. The method of claim 1, wherein after determining that the line region is a first font bold region, further comprising:

judging whether the second ratio is larger than a second preset ratio or not;

if so, determining the line area as a second font bold area;

otherwise, determining the line area as a non-second font bold area.

5. A bolded font recognition apparatus, the apparatus comprising:

6. The apparatus of claim 5, wherein the apparatus further comprises:

7. The apparatus of claim 5, wherein the calculation module is further configured to calculate pixel region areas corresponding to a same row region in an original document; calculating the first ratio between the area corresponding to the row area and the area of the pixel area;

8. The apparatus of claim 5, wherein the computing module is further configured to compute a second ratio between an overall area of a page in which the line region is located and an overall pixel block corresponding to a same page in the original document;

9. An electronic device, comprising:

a memory for storing a computer program;

a processor for implementing the method steps of any one of claims 1-4 when executing the computer program stored on the memory.

10. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of claims 1-4.