CN114139492A

CN114139492A - Image processing method, intelligent terminal and storage medium

Info

Publication number: CN114139492A
Application number: CN202111450927.1A
Authority: CN
Inventors: 陈俊升
Original assignee: Shenzhen Transsion Holdings Co Ltd
Current assignee: Shenzhen Transsion Holdings Co Ltd
Priority date: 2021-11-30
Filing date: 2021-11-30
Publication date: 2022-03-04

Abstract

The application provides an image processing method, an intelligent terminal and a storage medium, wherein the method comprises the following steps: s11, acquiring a first text in the first image; s12, determining or generating a second text corresponding to the first text, and processing the first image to obtain a second image; and S13, overlaying the second image on the first image, and displaying the second text on the second image. According to the method, the second text is still displayed on the first image, so that only the second text is displayed on the page, the situation that the content displayed on the page is too much and disordered due to the fact that the first image and the second text are displayed on the page at the same time is avoided, a user can conveniently check the translation result, and user experience is greatly improved.

Description

Image processing method, intelligent terminal and storage medium

Technical Field

The present application relates to the field of image processing technologies, and in particular, to an image processing method, an intelligent terminal, and a storage medium.

Background

At present, various languages are involved in life and work, so that language translation is more and more important, and translation refers to a behavior of converting one language information into another language information on the basis of accuracy, smoothness and beauty, so that a text in an image to be translated needs to be translated.

In some implementations, when translating the language, generally inputting an image to be translated, recognizing and translating a first text in the image to be translated, and determining or generating a second text; then, the second text is displayed below the first text, or the second text is repeatedly displayed on the first text.

In the course of conceiving and implementing the present application, the inventors found that at least the following problems existed: when the image to be translated is processed, the second text is displayed below the first text, or the second text is repeatedly displayed on the first text, so that the content of the displayed page is more and disordered, and the content of the displayed page is cracked, so that a user cannot conveniently view the translation result, and the user experience is poor.

The foregoing description is provided for general background information and is not admitted to be prior art.

Disclosure of Invention

In view of the above technical problems, the present application provides an image processing method, an intelligent terminal and a storage medium, so that a user can solve the technical problem that content displayed on a page is abundant and messy, which causes content in the page to be split, and the operation is simple and convenient.

In order to solve the above technical problem, the present application provides an image processing method, including the steps of:

s11, acquiring a first text in the first image;

s12, determining or generating a second text corresponding to the first text, and processing the first image to obtain a second image;

and S13, overlaying the second image on the first image, and displaying the second text on the second image.

Optionally, step S12 includes:

carrying out character recognition processing and translation processing on the first image by utilizing an optical character recognition technology; and/or, performing cutting and erasing processing on the first image to obtain a second image.

Optionally, performing character recognition processing and translation processing on the first image by using an optical character recognition technology, and determining or generating a second text corresponding to the first text, including:

carrying out character recognition processing on the first image by utilizing an optical character recognition technology to obtain a first text and positioning information of the first text;

and translating the first text to obtain the second text.

Optionally, the cutting and erasing the first image to obtain a second image includes:

according to the positioning information of the first text, the first text is cut in the first image to obtain a preliminary image;

and erasing the text in the preliminary image to obtain a second image.

Optionally, the positioning information of the first text includes paragraph information of the first text, and/or text line information within the paragraph information;

optionally, the paragraph information of the first text is paragraph coordinate information of four corners of an area where each paragraph in the first text is located; the text line information in the paragraph information is the text line coordinate information of the four corners of the area where each text line in each paragraph in the first text is located.

Optionally, the cutting the first text in the first image according to the positioning information of the first text to obtain a preliminary image includes:

according to paragraph coordinate information of four corners of an area where each paragraph in the first text is located, cutting the first text in the first image to obtain a first image;

and according to the text line coordinate information of the four corners of the area where each text line is located in each paragraph, cutting each text line in the first image to obtain a preliminary image.

and according to paragraph coordinate information of four corners of the area where each paragraph in the first text is located, cutting the first text to obtain a preliminary image.

Optionally, overlaying the second image on the first image comprises:

and covering a second image on the first image according to the positioning information of the first text.

In a second aspect, the present application provides an image processing apparatus comprising:

an acquisition unit configured to acquire a first text in a first image;

the processing unit is used for determining or generating a second text corresponding to the first text, and processing the first image to obtain a second image;

the covering unit is used for covering the second image on the first image;

a display unit for displaying the second text on the second image.

Optionally, the processing unit includes:

the translation module is used for carrying out character recognition processing and translation processing on the first image by utilizing an optical character recognition technology; and/or the presence of a gas in the gas,

and the shearing and erasing module is used for shearing and erasing the first image to obtain a second image.

Optionally, the translation module includes:

the character recognition submodule is used for carrying out character recognition processing on the first image by utilizing an optical character recognition technology to obtain a first text and positioning information of the first text;

and the translation submodule is used for translating the first text to obtain the second text.

Optionally, the cutting and erasing module includes:

the cutting sub-module is used for cutting the first text in the first image according to the positioning information of the first text to obtain a preliminary image;

and the erasing submodule is used for erasing the text in the preliminary image to obtain a second image.

Optionally, the shear sub-module comprises:

the first cutting sub-module is used for cutting the first text in the first image according to paragraph coordinate information of four corners of an area where each paragraph in the first text is located to obtain a first image;

and the second cutting submodule is used for cutting each text line in the first image according to the text line coordinate information of the four corners of the area where each text line is located in each paragraph, so as to obtain a preliminary image.

Optionally, the shear sub-module comprises:

and the third cutting sub-module is used for cutting the first text according to paragraph coordinate information of four corners of the area where each paragraph in the first text is located to obtain a preliminary image.

Optionally, the covering unit is specifically configured to:

The application further provides an intelligent terminal, the intelligent terminal includes: a memory, a processor, wherein the memory has stored thereon an image processing program, which when executed by the processor implements the steps of any of the methods described above.

The present application also provides a computer-readable storage medium, which stores a computer program that, when executed by a processor, performs the steps of the method as set forth in any one of the above.

As described above, the image processing method, the intelligent terminal, and the storage medium of the present application, applied to the intelligent terminal, include the steps of: s11, acquiring a first text in the first image; s12, determining or generating a second text corresponding to the first text, and processing the first image to obtain a second image; and S13, overlaying the second image on the first image, and displaying the second text on the second image. Through the technical scheme, the page can only display the second text, the situation that the content displayed on the page is multiple and disordered due to the fact that the first image and the second text are displayed simultaneously on the page is avoided, a user can conveniently check the translation result, the technical problem that the content displayed on the page is multiple and disordered to cause content cracking in the page is solved, and user experience is further improved.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present application and together with the description, serve to explain the principles of the application. In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings needed to be used in the description of the embodiments will be briefly described below, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.

Fig. 1 is a schematic diagram of a hardware structure of an intelligent terminal implementing various embodiments of the present application;

fig. 2 is a communication network system architecture diagram according to an embodiment of the present application;

fig. 3 is a flowchart illustrating an image processing method according to a first embodiment;

fig. 4 is a flowchart illustrating still another image processing method according to the first embodiment;

fig. 5 is a scene schematic diagram of a first image provided in an embodiment of the present application;

fig. 6 is a schematic view of a scene of a second image provided in an embodiment of the present application;

fig. 7 is a schematic view of a scene in which a second image overlaps a first image according to an embodiment of the present disclosure;

fig. 8 is a flowchart showing an image processing apparatus according to the first embodiment;

fig. 9 is a flowchart illustrating still another image processing apparatus according to the first embodiment.

The implementation, functional features and advantages of the objectives of the present application will be further explained with reference to the accompanying drawings. With the above figures, there are shown specific embodiments of the present application, which will be described in more detail below. These drawings and written description are not intended to limit the scope of the inventive concepts in any manner, but rather to illustrate the inventive concepts to those skilled in the art by reference to specific embodiments.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The embodiments described in the following exemplary embodiments do not represent all embodiments consistent with the present application. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present application, as detailed in the appended claims.

It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, the recitation of an element by the phrase "comprising an … …" does not exclude the presence of additional like elements in the process, method, article, or apparatus that comprises the element, and further, where similarly-named elements, features, or elements in different embodiments of the disclosure may have the same meaning, or may have different meanings, that particular meaning should be determined by their interpretation in the embodiment or further by context with the embodiment.

It should be understood that although the terms first, second, third, etc. may be used herein to describe various information, such information should not be limited to these terms. These terms are only used to distinguish one type of information from another. For example, first information may also be referred to as second information, and similarly, second information may also be referred to as first information, without departing from the scope herein. The word "if" as used herein may be interpreted as "at … …" or "when … …" or "in response to a determination", depending on the context. Also, as used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, unless the context indicates otherwise. It will be further understood that the terms "comprises," "comprising," "includes" and/or "including," when used in this specification, specify the presence of stated features, steps, operations, elements, components, items, species, and/or groups, but do not preclude the presence, or addition of one or more other features, steps, operations, elements, components, species, and/or groups thereof. The terms "or," "and/or," "including at least one of the following," and the like, as used herein, are to be construed as inclusive or mean any one or any combination. For example, "includes at least one of: A. b, C "means" any of the following: a; b; c; a and B; a and C; b and C; a and B and C ", again for example," A, B or C "or" A, B and/or C "means" any of the following: a; b; c; a and B; a and C; b and C; a and B and C'. An exception to this definition will occur only when a combination of elements, functions, steps or operations are inherently mutually exclusive in some way.

It should be understood that, although the steps in the flowcharts in the embodiments of the present application are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and may be performed in other orders unless explicitly stated herein. Moreover, at least some of the steps in the figures may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, in different orders, and may be performed alternately or at least partially with respect to other steps or sub-steps of other steps.

The words "if", as used herein, may be interpreted as "at … …" or "at … …" or "in response to a determination" or "in response to a detection", depending on the context. Similarly, the phrases "if determined" or "if detected (a stated condition or event)" may be interpreted as "when determined" or "in response to a determination" or "when detected (a stated condition or event)" or "in response to a detection (a stated condition or event)", depending on the context.

It should be noted that step numbers such as S11 and S12 are used herein for the purpose of more clearly and briefly describing the corresponding content, and do not constitute a substantial limitation on the sequence, and those skilled in the art may perform S12 first and then S11 in specific implementation, which should be within the scope of the present application.

It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.

In the following description, suffixes such as "module", "component", or "unit" used to denote elements are used only for the convenience of description of the present application, and have no specific meaning in themselves. Thus, "module", "component" or "unit" may be used mixedly.

The smart terminal may be implemented in various forms. For example, the smart terminal described in the present application may include smart terminals such as a mobile phone, a tablet computer, a notebook computer, a palmtop computer, a Personal Digital Assistant (PDA), a Portable Media Player (PMP), a navigation device, a wearable device, a smart band, a pedometer, and the like, and fixed terminals such as a Digital TV, a desktop computer, and the like.

The following description will be given taking a mobile terminal as an example, and it will be understood by those skilled in the art that the configuration according to the embodiment of the present application can be applied to a fixed type terminal in addition to elements particularly used for mobile purposes.

Referring to fig. 1, which is a schematic diagram of a hardware structure of a mobile terminal for implementing various embodiments of the present application, the mobile terminal 100 may include: RF (Radio Frequency) unit 101, WiFi module 102, audio output unit 103, a/V (audio/video) input unit 104, sensor 105, display unit 106, user input unit 107, interface unit 108, memory 109, processor 110, and power supply 111. Those skilled in the art will appreciate that the mobile terminal architecture shown in fig. 1 is not intended to be limiting of mobile terminals, which may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.

The following describes each component of the mobile terminal in detail with reference to fig. 1:

the radio frequency unit 101 may be configured to receive and transmit signals during information transmission and reception or during a call, and specifically, receive downlink information of a base station and then process the downlink information to the processor 110; in addition, the uplink data is transmitted to the base station. Typically, radio frequency unit 101 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. In addition, the radio frequency unit 101 can also communicate with a network and other devices through wireless communication. The wireless communication may use any communication standard or protocol, including but not limited to GSM (Global System for Mobile communications), GPRS (General Packet Radio Service), CDMA2000(Code Division Multiple Access 2000), WCDMA (Wideband Code Division Multiple Access), TD-SCDMA (Time Division-Synchronous Code Division Multiple Access), FDD-LTE (Frequency Division duplex-Long Term Evolution), TDD-LTE (Time Division duplex-Long Term Evolution, Time Division Long Term Evolution), and fifth Generation (5th Generation, 5G) Mobile communication systems.

WiFi belongs to short-distance wireless transmission technology, and the mobile terminal can help a user to receive and send e-mails, browse webpages, access streaming media and the like through the WiFi module 102, and provides wireless broadband internet access for the user. Although fig. 1 shows the WiFi module 102, it is understood that it does not belong to the essential constitution of the mobile terminal, and may be omitted entirely as needed within the scope not changing the essence of the invention.

The audio output unit 103 may convert audio data received by the radio frequency unit 101 or the WiFi module 102 or stored in the memory 109 into an audio signal and output as sound when the mobile terminal 100 is in a call signal reception mode, a call mode, a recording mode, a voice recognition mode, a broadcast reception mode, or the like. Also, the audio output unit 103 may also provide audio output related to a specific function performed by the mobile terminal 100 (e.g., a call signal reception sound, a message reception sound, etc.). The audio output unit 103 may include a speaker, a buzzer, and the like.

The a/V input unit 104 is used to receive audio or video signals. The a/V input Unit 104 may include a Graphics Processing Unit (GPU) 1041 and a microphone 1042, the Graphics processor 1041 Processing image data of still pictures or video obtained by an image capturing device (e.g., a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 106. The image frames processed by the graphic processor 1041 may be stored in the memory 109 (or other storage medium) or transmitted via the radio frequency unit 101 or the WiFi module 102. The microphone 1042 may receive sounds (audio data) via the microphone 1042 in a phone call mode, a recording mode, a voice recognition mode, or the like, and may be capable of processing such sounds into audio data. The processed audio (voice) data may be converted into a format output transmittable to a mobile communication base station via the radio frequency unit 101 in case of a phone call mode. The microphone 1042 may implement various types of noise cancellation (or suppression) algorithms to cancel (or suppress) noise or interference generated in the course of receiving and transmitting audio signals.

The mobile terminal 100 also includes at least one sensor 105, such as a light sensor, a motion sensor, and other sensors. Optionally, the light sensor includes an ambient light sensor that may adjust the brightness of the display panel 1061 according to the brightness of ambient light, and a proximity sensor that may turn off the display panel 1061 and/or the backlight when the mobile terminal 100 is moved to the ear. As one of the motion sensors, the accelerometer sensor can detect the magnitude of acceleration in each direction (generally, three axes), can detect the magnitude and direction of gravity when stationary, and can be used for applications of recognizing the posture of a mobile phone (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), vibration recognition related functions (such as pedometer and tapping), and the like; as for other sensors such as a fingerprint sensor, a pressure sensor, an iris sensor, a molecular sensor, a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which can be configured on the mobile phone, further description is omitted here.

The display unit 106 is used to display information input by a user or information provided to the user. The Display unit 106 may include a Display panel 1061, and the Display panel 1061 may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like.

The user input unit 107 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the mobile terminal. Alternatively, the user input unit 107 may include a touch panel 1071 and other input devices 1072. The touch panel 1071, also referred to as a touch screen, may collect a touch operation performed by a user on or near the touch panel 1071 (e.g., an operation performed by the user on or near the touch panel 1071 using a finger, a stylus, or any other suitable object or accessory), and drive a corresponding connection device according to a predetermined program. The touch panel 1071 may include two parts of a touch detection device and a touch controller. Optionally, the touch detection device detects a touch orientation of a user, detects a signal caused by a touch operation, and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 110, and can receive and execute commands sent by the processor 110. In addition, the touch panel 1071 may be implemented in various types, such as a resistive type, a capacitive type, an infrared ray, and a surface acoustic wave. In addition to the touch panel 1071, the user input unit 107 may include other input devices 1072. Optionally, other input devices 1072 may include, but are not limited to, one or more of a physical keyboard, function keys (e.g., volume control keys, switch keys, etc.), a trackball, a mouse, a joystick, and the like, and are not limited thereto.

Alternatively, the touch panel 1071 may cover the display panel 1061, and when the touch panel 1071 detects a touch operation thereon or nearby, the touch panel 1071 transmits the touch operation to the processor 110 to determine the type of the touch event, and then the processor 110 provides a corresponding visual output on the display panel 1061 according to the type of the touch event. Although the touch panel 1071 and the display panel 1061 are shown in fig. 1 as two separate components to implement the input and output functions of the mobile terminal, in some embodiments, the touch panel 1071 and the display panel 1061 may be integrated to implement the input and output functions of the mobile terminal, and is not limited herein.

The interface unit 108 serves as an interface through which at least one external device is connected to the mobile terminal 100. For example, the external device may include a wired or wireless headset port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 108 may be used to receive input (e.g., data information, power, etc.) from external devices and transmit the received input to one or more elements within the mobile terminal 100 or may be used to transmit data between the mobile terminal 100 and external devices.

The memory 109 may be used to store software programs as well as various data. The memory 109 may mainly include a program storage area and a data storage area, and optionally, the program storage area may store an operating system, an application program (such as a sound playing function, an image playing function, and the like) required by at least one function, and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 109 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

The processor 110 is a control center of the mobile terminal, connects various parts of the entire mobile terminal using various interfaces and lines, and performs various functions of the mobile terminal and processes data by operating or executing software programs and/or modules stored in the memory 109 and calling data stored in the memory 109, thereby performing overall monitoring of the mobile terminal. Processor 110 may include one or more processing units; preferably, the processor 110 may integrate an application processor and a modem processor, optionally, the application processor mainly handles operating systems, user interfaces, application programs, etc., and the modem processor mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 110.

The mobile terminal 100 may further include a power supply 111 (e.g., a battery) for supplying power to various components, and preferably, the power supply 111 may be logically connected to the processor 110 via a power management system, so as to manage charging, discharging, and power consumption management functions via the power management system.

Although not shown in fig. 1, the mobile terminal 100 may further include a bluetooth module or the like, which is not described in detail herein.

In order to facilitate understanding of the embodiments of the present application, a communication network system on which the mobile terminal of the present application is based is described below.

Referring to fig. 2, fig. 2 is an architecture diagram of a communication Network system according to an embodiment of the present disclosure, where the communication Network system is an LTE system of a universal mobile telecommunications technology, and the LTE system includes a UE (User Equipment) 201, an E-UTRAN (Evolved UMTS Terrestrial Radio Access Network) 202, an EPC (Evolved Packet Core) 203, and an IP service 204 of an operator, which are in communication connection in sequence.

Optionally, the UE201 may be the terminal 100 described above, and is not described herein again.

The E-UTRAN202 includes eNodeB2021 and other eNodeBs 2022, among others. Alternatively, the eNodeB2021 may be connected with other enodebs 2022 through a backhaul (e.g., X2 interface), the eNodeB2021 is connected to the EPC203, and the eNodeB2021 may provide the UE201 access to the EPC 203.

The EPC203 may include an MME (Mobility Management Entity) 2031, an HSS (Home Subscriber Server) 2032, other MMEs 2033, an SGW (Serving gateway) 2034, a PGW (PDN gateway) 2035, and a PCRF (Policy and Charging Rules Function) 2036, and the like. Optionally, the MME2031 is a control node that handles signaling between the UE201 and the EPC203, providing bearer and connection management. HSS2032 is used to provide registers to manage functions such as home location register (not shown) and holds subscriber specific information about service characteristics, data rates, etc. All user data may be sent through SGW2034, PGW2035 may provide IP address assignment for UE201 and other functions, and PCRF2036 is a policy and charging control policy decision point for traffic data flow and IP bearer resources, which selects and provides available policy and charging control decisions for a policy and charging enforcement function (not shown).

The IP services 204 may include the internet, intranets, IMS (IP Multimedia Subsystem), or other IP services, among others.

Although the LTE system is described as an example, it should be understood by those skilled in the art that the present application is not limited to the LTE system, but may also be applied to other wireless communication systems, such as GSM, CDMA2000, WCDMA, TD-SCDMA, and future new network systems (e.g. 5G), and the like.

Based on the above mobile terminal hardware structure and communication network system, various embodiments of the present application are provided.

First embodiment

Fig. 3 is a schematic flowchart of an image processing method according to an embodiment of the present application, and as shown in fig. 3, the method includes:

and S11, acquiring the first text in the first image.

For example, the execution subject of this embodiment may be a smart terminal, or an image processing apparatus or device, or other apparatuses or devices that can execute this embodiment, which is not limited in this respect. In this embodiment, an execution subject is introduced as an intelligent terminal.

First, a first image needs to be acquired. The first image can be shot or acquired from a memory; or acquiring the first image from the webpage or receiving the first image transmitted by other equipment. The initial text to be detected may be a picture, etc. When a user performs translation operation on a first image, a first operation instruction is generated, after the intelligent terminal receives the first operation instruction, Character Recognition processing and translation processing are performed on the first image by using an Optical Character Recognition technology (OCR), and a first text is determined or generated.

And S12, determining or generating a second text corresponding to the first text, and processing the first image to obtain a second image.

The intelligent terminal performs Character Recognition processing and translation processing on a first text in the first image by using an Optical Character Recognition technology (OCR), determines or generates a second text, and performs cutting and erasing processing on the first image to obtain a second image.

Illustratively, the second image is overlaid on the first image, and the second text is displayed on the second image, resulting in a translation result image.

Second embodiment

Fig. 4 is a schematic flowchart of an image processing method according to an embodiment of the present application, and as shown in fig. 4, the method includes:

and S21, acquiring the first text in the first image.

For example, this step may be referred to as step S11 in fig. 1, and is not described again.

S22, performing character recognition processing on the first image by using an optical character recognition technology to obtain a first text and positioning information of the first text; and translating the first text to obtain a second text.

For example, the intelligent terminal may perform character recognition processing on the first image by using an optical character recognition technology, may recognize the first text and the positioning information of the first text, and further obtain the positioning information of the first text and the first text, where the positioning information includes paragraph information and line information. And then the first text is translated to determine or generate a second text.

S23, and/or cutting the first text in the first image according to the positioning information of the first text to obtain a preliminary image; and erasing the text in the preliminary image to obtain a second image.

In one example, the positioning information of the first text comprises paragraph information of the first text, and/or text line information within the paragraph information; optionally, the paragraph information of the first text is paragraph coordinate information of four corners of an area where each paragraph in the first text is located; the text line information in the paragraph information is the text line coordinate information of the four corners of the area where each text line in each paragraph in the first text is located.

Step S23 includes two ways:

first mode of step S23: according to paragraph coordinate information of four corners of an area where each paragraph in the first text is located, cutting the first text in the first image to obtain a first image; and according to the text line coordinate information of the four corners of the area where each text line is located in each paragraph, cutting each text line in the first image to obtain a preliminary image.

Second mode of step S23: and according to paragraph coordinate information of four corners of the area where each paragraph in the first text is located, cutting the first text to obtain a preliminary image.

Exemplarily, the intelligent terminal may cut the first text in the first image according to the positioning information of the first text to obtain a preliminary image, then erase the preliminary image to obtain a second image, and cut the first text and obtain the preliminary image, which may be as follows:

in a first mode, the intelligent terminal cuts a first text in a first image according to paragraph coordinate information of four corners of an area where each paragraph in the first text is located, and obtains a first image related to the paragraph; and then, according to the text line coordinate information of the four corners of the area where each text line is located in each paragraph, cutting each text line in the first image to obtain a preliminary image related to the text line.

In a second mode, the intelligent terminal cuts the first text according to paragraph coordinate information of four corners of an area where each paragraph in the first text is located, and obtains a preliminary image about the paragraph.

And S24, overlaying the second image on the first image according to the positioning information of the first text.

For example, the positioning information includes paragraph coordinate information of four corners of a region where each paragraph in the first text is located, and/or text line coordinate information of four corners of a region where each text line in each paragraph in the first text is located, so that there is a position correspondence between the paragraph coordinate information, the text line coordinate information, and the second image, the intelligent terminal may cover the second image on the first image according to the position correspondence, and finally, the second text is displayed on the second image to obtain the translation result image.

For example, fig. 5 is a schematic view of a scene of a first image provided in the embodiment of the present application, fig. 6 is a schematic view of a scene of a second image provided in the embodiment of the present application, and fig. 7 is a schematic view of a scene in which the second image provided in the embodiment of the present application is overlaid on the first image, as can be seen from fig. 5, the first image includes 3 lines of first text, as can be seen from fig. 6, the second image includes 3 lines of blank, as can be seen from fig. 7, the second images with 3 lines of blank are overlaid on the 3 lines of first text in fig. 5, respectively, and the second text is displayed on the blank second image.

Through the combination scheme, the second image is obtained by cutting and erasing the first image, the cutting and erasing processing is local operation, network flow is not consumed in the cutting and erasing processing process, the processing speed is improved, and the second text is still displayed on the first image, so that the page only displays the second text, the situation that the content displayed on the page is too much and messy due to the fact that the first image and the second text are displayed on the page at the same time is avoided, a user can conveniently view translation results, and further the user experience is greatly improved.

Third embodiment

Fig. 8 is a schematic structural diagram of an image processing apparatus according to an embodiment of the present application, and as shown in fig. 8, the apparatus includes:

an acquiring unit 31 is used for acquiring a first text in the first image.

And the processing unit 32 is configured to determine or generate a second text corresponding to the first text, and process the first image to obtain a second image.

And an overlaying unit 33 for overlaying the second image on the first image.

A display unit 34 for displaying the second text on the second image.

The apparatus of this embodiment may execute the technical solution in the method, and the specific implementation process and the technical principle are the same, which are not described herein again.

Fourth embodiment

Fig. 9 is a schematic structural diagram of another processing apparatus for an image to be translated according to an embodiment of the present application, and based on the embodiment shown in fig. 8, as shown in fig. 9, a processing unit 32 includes:

a translation module 321 configured to perform character recognition processing and translation processing on the first image by using an optical character recognition technique; and/or the presence of a gas in the gas,

and a cutting and erasing module 322, configured to perform cutting and erasing processing on the first image to obtain a second image.

In one example, the translation module 321 includes:

the character recognition submodule 3211 is configured to perform character recognition processing on the first image by using an optical character recognition technology to obtain a first text and location information of the first text.

The translation submodule 3212 is configured to perform translation processing on the first text to obtain a second text.

In one example, the cut-erase module 322 includes:

the cropping sub-module 3221 is configured to crop the first text in the first image according to the positioning information of the first text, so as to obtain a preliminary image.

The erasing sub-module 3222 is configured to perform erasing processing on the text in the preliminary image to obtain a second image.

In one example, the positioning information of the first text includes paragraph information of the first text, and/or text line information within the paragraph information.

In one example, the cropping sub-module 3221 includes:

the first cropping sub-module 32211 is configured to crop the first text in the first image according to paragraph coordinate information of four corners of the area where each paragraph in the first text is located, so as to obtain a first image.

The second cropping sub-module 32212 is configured to crop each text line in the first image according to the text line coordinate information of the four corners of the area where each text line is located in each paragraph, so as to obtain a preliminary image.

In one example, the cropping sub-module 3221 includes:

the third cropping sub-module 32213 is configured to crop the first text according to paragraph coordinate information of four corners of the area where each paragraph in the first text is located, so as to obtain a preliminary image.

In one example, the overlay unit is specifically configured to:

and covering the second image on the first image according to the positioning information of the first text.

The embodiment of the present application further provides an intelligent terminal, where the intelligent terminal includes a memory and a processor, and the memory stores an image processing program, and the image processing program is executed by the processor to implement the steps of the image processing method in any of the above embodiments.

The embodiment of the present application further provides a computer-readable storage medium, where an image processing program is stored on the storage medium, and when the image processing program is executed by a processor, the image processing program implements the steps of the image processing method in any of the above embodiments.

In the embodiments of the intelligent terminal and the computer-readable storage medium provided in the present application, all technical features of any one of the embodiments of the image processing method may be included, and the expanding and explaining contents of the specification are basically the same as those of the embodiments of the method, and are not described herein again.

Embodiments of the present application also provide a computer program product, which includes computer program code, when the computer program code runs on a computer, the computer is caused to execute the method in the above various possible embodiments.

Embodiments of the present application further provide a chip, which includes a memory and a processor, where the memory is used to store a computer program, and the processor is used to call and run the computer program from the memory, so that a device in which the chip is installed executes the method in the above various possible embodiments.

It is to be understood that the foregoing scenarios are only examples, and do not constitute a limitation on application scenarios of the technical solutions provided in the embodiments of the present application, and the technical solutions of the present application may also be applied to other scenarios. For example, as can be known by those skilled in the art, with the evolution of system architecture and the emergence of new service scenarios, the technical solution provided in the embodiments of the present application is also applicable to similar technical problems.

The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.

The steps in the method of the embodiment of the application can be sequentially adjusted, combined and deleted according to actual needs.

The units in the device in the embodiment of the application can be merged, divided and deleted according to actual needs.

In the present application, the same or similar term concepts, technical solutions and/or application scenario descriptions will be generally described only in detail at the first occurrence, and when the description is repeated later, the detailed description will not be repeated in general for brevity, and when understanding the technical solutions and the like of the present application, reference may be made to the related detailed description before the description for the same or similar term concepts, technical solutions and/or application scenario descriptions and the like which are not described in detail later.

In the present application, each embodiment is described with emphasis, and reference may be made to the description of other embodiments for parts that are not described or illustrated in any embodiment.

The technical features of the technical solution of the present application may be arbitrarily combined, and for brevity of description, all possible combinations of the technical features in the embodiments are not described, however, as long as there is no contradiction between the combinations of the technical features, the scope of the present application should be considered as being described in the present application.

Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present application may be embodied in the form of a software product, which is stored in a storage medium (e.g., ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, a controlled terminal, or a network device) to execute the method of each embodiment of the present application.

In the above embodiments, the implementation may be wholly or partially realized by software, hardware, firmware, or any combination thereof. When implemented in software, may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. The procedures or functions according to the embodiments of the present application are all or partially generated when the computer program instructions are loaded and executed on a computer. The computer may be a general purpose computer, a special purpose computer, a network of computers, or other programmable device. The computer instructions may be stored on a computer readable storage medium or transmitted from one computer readable storage medium to another, for example, the computer instructions may be transmitted from one website, computer, server, or data center to another website, computer, server, or data center by wire (e.g., coaxial cable, fiber optic, digital subscriber line) or wirelessly (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device, such as a server, a data center, etc., that incorporates one or more of the available media. The usable medium may be a magnetic medium (e.g., floppy Disk, memory Disk, magnetic tape), an optical medium (e.g., DVD), or a semiconductor medium (e.g., Solid State Disk (SSD)), among others.

The above description is only a preferred embodiment of the present application, and not intended to limit the scope of the present application, and all modifications of equivalent structures and equivalent processes, which are made by the contents of the specification and the drawings of the present application, or which are directly or indirectly applied to other related technical fields, are included in the scope of the present application.

Claims

1. An image processing method, comprising:

s11, acquiring a first text in the first image;

2. The method according to claim 1, wherein step S12 includes:

carrying out character recognition processing and translation processing on the first image by utilizing an optical character recognition technology; and/or the presence of a gas in the gas,

and performing cutting and erasing processing on the first image to obtain a second image.

3. The method of claim 2, wherein performing character recognition processing and translation processing on the first image by using an optical character recognition technology to determine or generate a second text corresponding to the first text comprises:

and translating the first text to obtain the second text.

4. The method of claim 2, wherein performing a cut-and-erase process on the first image to obtain a second image comprises:

and erasing the text in the preliminary image to obtain a second image.

5. The method according to claim 4, wherein the positioning information of the first text comprises paragraph information of the first text, and/or text line information in the paragraph information.

6. The method according to claim 5, wherein the cutting the first text in the first image according to the positioning information of the first text to obtain a preliminary image comprises:

7. The method according to claim 5, wherein the cutting the first text in the first image according to the positioning information of the first text to obtain a preliminary image comprises:

8. The method of any of claims 2 to 7, wherein overlaying the second image on the first image comprises:

9. An intelligent terminal, characterized in that, intelligent terminal includes: memory, a processor, wherein the memory has stored thereon an image processing program which, when executed by the processor, implements the steps of the image processing method according to any one of claims 1 to 8.

10. A computer-readable storage medium, characterized in that the storage medium has stored thereon a computer program which, when being executed by a processor, carries out the steps of the image processing method according to any one of claims 1 to 8.