CN113010718A

CN113010718A - Photographing question searching method, device, equipment and storage medium

Info

Publication number: CN113010718A
Application number: CN202110200264.1A
Authority: CN
Inventors: 徐荣荣; 栾舒涵; 苏丽荣
Original assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Current assignee: Beijing Baidu Netcom Science and Technology Co Ltd
Priority date: 2021-02-23
Filing date: 2021-02-23
Publication date: 2021-06-22

Abstract

The application discloses a photographing question searching method, device, equipment and storage medium, and relates to the field of artificial intelligence such as natural language processing and computer vision. One embodiment of the method comprises: shooting and uploading a multi-question image, wherein the multi-question image comprises a plurality of exercises; receiving coordinates of a plurality of exercises obtained by performing multi-question detection on the multi-question image; performing frame selection on the multi-question image based on the coordinates of the multiple exercises to obtain selection frames of the multiple exercises; uploading a selection frame of at least one of the plurality of exercises; and receiving a question searching result obtained by searching questions based on at least one question. The embodiment supports multi-question identification and automatic question framing, simplifies the operation of users when photographing and searching the questions, optimizes the interactive experience of photographing and searching the questions, and makes photographing and searching the questions more convenient.

Description

Photographing question searching method, device, equipment and storage medium

Technical Field

The embodiment of the application relates to the field of computers, in particular to the field of artificial intelligence such as natural language processing and computer vision, and particularly relates to a photographing and question searching method, device, equipment and storage medium.

Background

At present, in order to solve the problem of difficult work of students, various applications for photographing and searching questions are diversified. The existing shooting and question searching applications enable a user to shoot a single question, submit a picture with the single question for searching, and finally return a question searching result. The shooting and problem searching function provided by the shooting and problem searching application can only shoot and search one problem at a time. If the user wants to continue searching for other problems after searching for one problem, the user needs to shoot again and search for the problem again.

Disclosure of Invention

The embodiment of the application provides a photographing question searching method, device and equipment and a storage medium.

In a first aspect, an embodiment of the present application provides a method for searching for a question by taking a picture, including: shooting and uploading a multi-question image, wherein the multi-question image comprises a plurality of exercises; receiving coordinates of a plurality of exercises obtained by performing multi-question detection on the multi-question image; performing frame selection on the multi-question image based on the coordinates of the multiple exercises to obtain selection frames of the multiple exercises; uploading a selection frame of at least one of the plurality of exercises; and receiving a question searching result obtained by searching questions based on at least one question.

In a second aspect, an embodiment of the present application provides a device for searching for a question by taking a picture, including: a first uploading module configured to capture and upload a multi-topic image, wherein the multi-topic image comprises a plurality of problems; a first receiving module configured to receive coordinates of a plurality of problems obtained by multi-problem detection of a multi-problem image; the frame selection module is configured to perform frame selection on the multi-question image based on the coordinates of the multiple questions to obtain selection frames of the multiple questions; a second upload module configured to upload a selection frame of at least one of the plurality of exercises; and the second receiving module is configured to receive a question searching result obtained by searching questions based on at least one question.

In a third aspect, an embodiment of the present application provides an electronic device, including: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method as described in any one of the implementations of the first aspect.

In a fourth aspect, embodiments of the present application propose a non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform the method as described in any one of the implementations of the first aspect.

In a fifth aspect, the present application provides a computer program product, which includes a computer program that, when executed by a processor, implements the method as described in any implementation manner of the first aspect.

The method, the device, the equipment and the storage medium for shooting and searching the problems, provided by the embodiment of the application, are characterized in that firstly, a multi-problem image comprising a plurality of problems is shot and uploaded, and coordinates of the plurality of problems obtained by carrying out multi-problem detection on the multi-problem image are received; then, frame selection is carried out on the multi-question image based on the coordinates of the multiple exercises to obtain selection frames of the multiple exercises; and finally, uploading a selection frame of at least one of the plurality of problems, and receiving a problem searching result obtained by searching problems based on the at least one problem. The shooting and searching questions support multi-question identification and automatic frame question, repeated shooting by a user is not needed, manual frame question of the user is not needed, operation of the user during shooting and searching the questions is simplified, interactive experience of shooting and searching the questions is optimized, and the shooting and searching questions are more convenient. The shooting and question searching can be applied to the intelligent learning tablet, and the core competitiveness of the intelligent learning tablet is improved.

It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.

Drawings

Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings. The drawings are included to provide a better understanding of the present solution and are not intended to limit the present application. Wherein:

FIG. 1 is a flow diagram of one embodiment of a photo topic searching method according to the present application;

FIG. 2A is a schematic view of a question sheet;

FIG. 2B is a schematic diagram of scanning an animated float;

FIG. 2C is a schematic view of a boxed coordinate display page;

FIG. 2D is a schematic diagram of a results page;

FIG. 3 is a photograph question-searching interaction flow diagram;

FIG. 4 is a flow chart of yet another embodiment of a photo topic searching method according to the present application;

FIG. 5 is a flow chart of another embodiment of a photo topic searching method according to the present application;

fig. 6A is a schematic diagram of first guidance information;

fig. 6B is a diagram of second guidance information;

fig. 6C is a diagram of third guidance information;

fig. 6D is a diagram of fourth guidance information;

fig. 6E is a diagram of fifth guidance information;

FIG. 7 is a schematic diagram illustrating an embodiment of a photographing topic searching apparatus according to the present application;

fig. 8 is a block diagram of an electronic device for implementing a photo topic searching method according to an embodiment of the present application.

Detailed Description

The following description of the exemplary embodiments of the present application, taken in conjunction with the accompanying drawings, includes various details of the embodiments of the application for the understanding of the same, which are to be considered exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.

It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.

FIG. 1 shows a flow 100 of one embodiment of a photo topic search method according to the present application. The method for searching questions by photographing comprises the following steps:

step 101, shooting and uploading multiple images.

In this embodiment, the client may capture multiple images and upload the images to the server. Wherein, the multi-topic image can comprise a plurality of exercises.

Typically, a photo-taking question-searching application is installed on the client. When the user opens the photo title search application, a title page can be displayed. The photo-topic search application can support not only single-topic image shooting but also multi-topic image shooting. The user can switch the shooting mode on the shooting page of the shooting question searching application. FIG. 2A shows a schematic view of a question page. As shown in fig. 2A, the user can switch to the shooting question mode by clicking the "shooting question" on the shooting page. The user clicks the 'multi-question shooting' on the shooting page, and then the mode can be switched to the multi-question shooting mode. Meanwhile, the method prompts that a plurality of questions can be photographed at one time and the characters are aligned with the reference line. The user can place the problem document in an appropriate position so that the problem that the user desires to photograph is presented in the viewing frame of the photographic page. The user clicks a shooting button on the shooting page, and the problem image can be shot. During the process of shooting the problem image, the view-finding frame of the shooting page scans the floating layer of the animation. FIG. 2B shows a schematic diagram of scanning an animated float. After the scanning is finished, the exercise image can be obtained. If only one problem is presented in the view-finding frame, the user can click the shooting button after switching to the single problem shooting mode, and the problem image shot at the moment is the single problem image. If a plurality of exercises are presented in the view-finding frame, the user can click the shooting button after switching to the multi-exercise shooting mode, and the exercise image shot at the moment is the multi-exercise image.

In practical applications, the exercise document may include, in addition to the exercise, other contents such as exercise type, question answering description, and exercise explanation. In some embodiments, the framing box presents content in addition to the problem. In this case, the captured problem image includes other contents in addition to the problem. For example, a multi-issue image may include an answer specification and a plurality of exercises. As another example, a multi-problem image may include a plurality of problems and problem interpretations of at least some of the problems.

In addition, the shooting page of the shooting question searching application can also display shooting guide information. The hardware gyroscope combined with the tablet guides and prompts according to various use scenes of a user. Under the condition that the protective sleeve of the flat plate has a plurality of placing angles, the photographing effect can be ensured to be optimal.

Step 102, receiving coordinates of multiple problems obtained by performing multiple problem detection on multiple problem images.

In this embodiment, the server may perform multi-question detection on the multi-question image, obtain coordinates of each question in the multi-question image, and send the coordinates to the client. Thus, the client receives the coordinates of each problem in the multi-problem image. Wherein the coordinates of the problem may determine the frame of the problem. The frame of the problem is the Bounding box (Bounding box) of the problem. For example, the coordinates of the problem may include the coordinates of at least three vertices of a frame of the problem.

Generally, when detecting a multi-topic image, the server identifies whether the content in the multi-topic image is a problem. If the exercise is a question, the server detects the coordinates of the exercise. If the content is not a problem, the server may end the detection of the content, or may continue to process the content. The processing method may include, but is not limited to, coordinate detection, content deletion, content annotation, content refinement, and the like. For example, if the server recognizes the answer instruction in the multi-question image, the detection of the answer instruction may be ended, and the coordinates of the answer instruction may not be output. For example, if the server recognizes the problem explanation in the multi-problem image, the server can not only detect the coordinates of the problem explanation, but also recognize the content of the problem explanation, thereby further improving the content of the problem explanation.

And 103, selecting frames of the multi-question images based on the coordinates of the multiple questions to obtain selected frames of the multiple questions.

In this embodiment, the client may perform frame selection on the multi-question image based on the coordinates of each question to obtain a frame selection of each question. Wherein, the rectangle with the coordinate of the problem as the vertex is the frame of the problem.

In some optional implementations of this embodiment, to facilitate the user to search for problems, the selection boxes of each problem are labeled with serial numbers. Fig. 2C shows a schematic diagram of a boxed coordinate presentation page. As shown in FIG. 2C, the frame selection coordinate display page has three selection frames of problems, and the sequence numbers are (i), (ii) and (iii) in sequence.

And 104, uploading a selection frame of at least one of the plurality of exercises.

In this embodiment, the client may upload a selection frame of at least one of the plurality of exercises. Specifically, the client can divide a frame of at least one exercise from the multi-question image and upload the frame to the server.

In some optional implementations of the present embodiment, the client may upload a selection frame of all the problems in the multi-problem image. And (4) strengthening the shooting button, searching the questions by clicking the shooting button, and obtaining the question searching results of all the questions without the selection of a user.

In some optional implementation manners of this embodiment, the user may select a frame of a question from frames of multiple questions, and the client may divide the selected frame from the image of multiple questions and upload the divided frame to the server. And searching for the questions based on the user selection, thereby realizing targeted searching for the questions. Wherein the user may select the selection box by clicking on the selection box of the problem. In addition, the preset area of the shooting page may display information prompting the user to select a checkbox for a question, and the content thereof may be, for example, "find a question below, select a question that you want to search". The information is usually displayed in the upper right corner of the multi-topic image and does not change with the click change of the button click in the page. After the user clicks on a frame of a problem, the loading state is entered.

In some optional implementations of the present embodiment, the checkbox supports manual fine-tuning in order to ensure that the complete problem content is included within the checkbox. Specifically, the user selects a selection box for any one of the problems, which selection box may be highlighted and on which adjustment buttons may be presented. The user operates the adjusting button to correspondingly adjust the selected selection frame. Wherein the adjusting button can be dragged and stretched. When the user drags the adjustment button, the position of the selected box can be adjusted according to the dragging operation of the user. When the user stretches the adjusting button, the size of the selected selection frame can be adjusted according to the stretching operation of the user.

Step 105, receiving a question searching result obtained by searching questions based on at least one question.

In this embodiment, the server may perform a question search based on the content in the received frame, obtain a question search result, and send the question search result to the client. Thus, the client receives the result of the search question.

In general, the server may identify the text in the received box. For example, the box is recognized by using an OCR (Optical Character Recognition) technique, so as to obtain the text content in the box. Then, the server can search in the question bank based on the text content in the selection frame, and a question searching result can be obtained. FIG. 2D shows a schematic diagram of a results page. As shown in FIG. 2D, the results page displays four answers to the problem: answer one, answer two, answer three and answer four. Each answer comprises a question, an examination point, a resolution, an answer and a knowledge point explanation.

The method for searching for the problems by photographing comprises the steps of firstly photographing and uploading a multi-problem image comprising a plurality of problems, and receiving coordinates of the plurality of problems obtained by carrying out multi-problem detection on the multi-problem image; then, frame selection is carried out on the multi-question image based on the coordinates of the multiple exercises to obtain selection frames of the multiple exercises; and finally, uploading a selection frame of at least one of the plurality of problems, and receiving a problem searching result obtained by searching problems based on the at least one problem. The shooting and searching questions support multi-question identification and automatic frame question, repeated shooting by a user is not needed, manual frame question of the user is not needed, operation of the user during shooting and searching the questions is simplified, interactive experience of shooting and searching the questions is optimized, and the shooting and searching questions are more convenient. The shooting and question searching can be applied to the intelligent learning tablet, and the core competitiveness of the intelligent learning tablet is improved.

With continued reference to FIG. 3, a photo title interaction flow diagram is shown. As shown in fig. 3, the user clicks an icon of the photo title application on the client, the client sends a click request to aries, and the aries issues launchAPPPlayload to the client. And clicking a shooting button by a user to shoot the multi-topic pictures, and uploading the multi-topic pictures to the aries by the terminal. The aries requests the multi-topic interface from the multi-mode, the multi-mode returns multi-topic coordinates to the aries, and the aries returns multi-topic coordinates to the terminal. And the terminal selects the frames of the multi-question pictures according to the coordinates to obtain the selection frames of the multiple exercises. And clicking a selection frame of a problem by the user, and capturing a picture of the selection frame of the problem according to the coordinate of the problem. And uploading the frame selection picture to aries by the terminal, and uploading the frame selection picture to the multimode by the aries. And issuing corresponding search question results to the aries in a multi-mode. aries issues a result to the Swan request. Swan issues the results of the search questions to the end.

With further reference to FIG. 4, a flow 400 of yet another embodiment of a photo topic search method in accordance with the present application is illustrated. The method for searching questions by photographing comprises the following steps:

step 401, in response to the detection of the front shooting scene, displaying guidance information for installing the image acquisition device and placing the flat panel.

In the present embodiment, after entering the shooting page, the client can detect the shooting scene. In response to detecting the front shooting scene, the client may display guidance information for installing the image capture device and placing the tablet. Wherein, the scene of opening leading camera is leading scene of shooing. The image acquisition device can be a front camera, also called an intelligent eye. The guiding information for installing the image acquisition device and placing the flat plate can be pictures and characters. The user can install image acquisition device according to the guide information of installing image acquisition device and putting the flat board to place the inboard constant head tank department of protective sheath (for example first draw-in groove department of protective sheath) with the flat board. Fig. 6A shows a schematic diagram of the first guidance information. As shown in fig. 6A, the first guide information guides the user to install the smart eye and to stand up the tablet.

And 402, responding to the condition that the image acquisition device is not installed, and displaying guide information for installing the image acquisition device.

In this embodiment, if the image capturing device is not installed, the client may display the guidance information for installing the image capturing device no matter whether the angle of the flat panel is correct. The guiding information for installing the image acquisition device can be pictures and characters. The user can install the image capturing device according to the guiding information for installing the image capturing device.

And step 403, in response to that the image acquisition device is installed but the angle of the flat plate is incorrect, displaying guide information for placing the flat plate.

In this embodiment, if the image capturing device is installed but the angle of the flat panel is incorrect, the client may display guidance information for placing the flat panel. The guiding information of the placing plate can be pictures and characters. The user can place the inboard constant head tank department of protective sheath with the flat board according to the guide information who puts the flat board. Fig. 6C shows a schematic diagram of the third guidance information. As shown in fig. 6C, the third guidance message guides the user to put the smart eye and hold the tablet at the first card slot.

And step 404, displaying guide information for placing the exercise document in response to the fact that the image acquisition device is installed and the angle of the flat plate is correct.

In this embodiment, if the image capturing device is installed and the angle of the flat panel is correct, the client can display the guiding information for placing the exercise document. The guide information for placing the exercise document can be pictures and characters. The user can place the exercise document in the trapezoidal region covering picture according to the guide information for placing the exercise document, and the characters are aligned with the reference lines. Typically, the problem document is placed under the plate, keeping the problem centered and aligned with the bottom of the plate, i.e., the problem document is placed in the trapezoidal area mask image. Fig. 6B shows a schematic diagram of the second guidance information. If shown in FIG. 6B, the second guidance information guides the user to place the problem document in the trapezoidal area and to keep the horizontal placement.

In some alternative implementations of the present embodiment, the guidance information is typically displayed when the photo title function is first turned on, typically in the form of a GIF animation. In addition, in order to avoid the long-time display from obstructing the user's sight, the guide information may be automatically faded out after displaying for a preset time period (e.g., 3 seconds).

Step 405, shooting and uploading multiple images.

Step 406, receiving coordinates of multiple problems obtained by performing multiple problem detection on the multiple problem image.

Step 407, selecting frames of the multi-question images based on the coordinates of the multiple questions to obtain selected frames of the multiple questions.

Step 408, uploading a selection frame of at least one of the plurality of exercises.

Step 409, receiving a question searching result obtained by searching questions based on at least one question.

In the present embodiment, the specific operations of steps 405-409 have been described in detail in step 101-105 in the embodiment shown in fig. 1, and are not described herein again.

As can be seen from fig. 4, compared with the embodiment corresponding to fig. 1, the method for searching for questions by taking a picture in the present embodiment adds a step of displaying the guide information of the front shooting scene. Therefore, the scheme described in the embodiment is combined with the hardware gyroscope of the tablet to guide and prompt the user for the front shooting scene. Under the condition that the protective sleeve of the tablet has a plurality of placing angles, the tablet is used by a user and the posture of the tablet is guided to be strengthened in the page of the exercise. And guiding the user to correctly place the flat plate and righting the title, thereby ensuring that the photographing effect is optimal.

With further reference to FIG. 5, a flow 500 of another embodiment of a photo topic search method in accordance with the present application is illustrated. The method for searching questions by photographing comprises the following steps:

step 501, in response to detecting the rear shooting scene, displaying guidance information for placing the tablet and the problem document.

In the present embodiment, after entering the shooting page, the client can detect the shooting scene. In response to detecting the rear-facing shooting scene, the client may display guidance information for placing the tablet with the problem document. Wherein, the scene of opening the rear camera is the rear shooting scene. The guidance information for displaying the placing plate and the exercise document can be pictures and characters. The user can put the dull and stereotyped and exercise document parallel placement with the dull and stereotyped and exercise document according to showing the guide information who puts dull and stereotyped and exercise document, places the exercise document in well word check sheet picture, parallel exercise document paper, the reference line is aligned to the characters. Fig. 6D shows a schematic diagram of fourth guidance information. As shown in fig. 6D, the fourth guide information guides the user to place the tablet in parallel with the problem document. Fig. 6E shows a schematic diagram of fifth guidance information. As shown in fig. 6E, the fifth guide information guides the user to place the problem document in front of the tablet.

Step 502, shooting and uploading multiple images.

Step 503, receiving the coordinates of the plurality of exercises obtained by performing the multi-question detection on the multi-question image.

Step 504, selecting the multi-question image based on the coordinates of the multiple questions to obtain a selected frame of the multiple questions.

Step 505, uploading a selection frame of at least one of the plurality of exercises.

Step 506, receiving a question searching result obtained by searching questions based on at least one question.

In the present embodiment, the specific operations of steps 502-506 have been described in detail in steps 101-105 in the embodiment shown in fig. 1, and are not described herein again.

As can be seen from fig. 5, compared with the embodiment corresponding to fig. 1, the method for searching for questions by taking a picture in the present embodiment adds a step of displaying the guide information of the post shooting scene. Therefore, the scheme described in the embodiment is combined with the hardware gyroscope of the tablet to guide and prompt the user for the rear shooting scene. And (5) strengthening guidance for the user by using a tablet and a subject shooting posture in a subject shooting page. And guiding the user to correctly place the flat plate and righting the title, thereby ensuring that the photographing effect is optimal.

With further reference to fig. 7, as an implementation of the method shown in the above-mentioned figures, the present application provides an embodiment of a photographing topic searching apparatus, where the embodiment of the apparatus corresponds to the embodiment of the method shown in fig. 1, and the apparatus can be applied to various electronic devices.

As shown in fig. 7, the photographing question searching apparatus 700 of the present embodiment may include: a first uploading module 701, a first receiving module 702, a frame selection module 703, a second uploading module 704 and a second receiving module 705. The first uploading module 701 is configured to capture and upload a multi-question image, wherein the multi-question image includes a plurality of questions; a first receiving module 702 configured to receive coordinates of a plurality of problems obtained by multi-problem detection on a multi-problem image; a framing module 703 configured to frame the multi-question image based on the coordinates of the multiple questions to obtain frames for the multiple questions; a second upload module 704 configured to upload a selection frame of at least one of the plurality of problems; the second receiving module 705 is configured to receive a question searching result obtained by searching questions based on at least one question.

In the present embodiment, in the photographing question searching apparatus 700: the specific processing of the first uploading module 701, the first receiving module 702, the frame selection module 703, the second uploading module 704 and the second receiving module 705 and the technical effects thereof can refer to the related description of step 101 and step 105 in the corresponding embodiment of fig. 1, which is not repeated herein.

In some optional implementations of this embodiment, the photographing question searching apparatus 700 further includes: a tagging module configured to tag a sequence number for a checkbox of a plurality of problems.

In some optional implementations of this embodiment, the second uploading module 704 includes: an upload sub-module configured to upload the selected frame in response to selecting the frame from the frames of the plurality of problems.

In some optional implementations of this embodiment, the second uploading module 704 further includes: a presentation sub-module configured to present the adjustment button on the selected selection box; an adjustment sub-module configured to adjust the selected frame in response to detecting operation of the adjustment button.

In some optional implementations of this embodiment, the photographing question searching apparatus 700 further includes: and the display module is configured to display the photographing guide information on the photographing page.

In some optional implementations of this embodiment, the display module includes: and the first display sub-module is configured to display guiding information for installing the image acquisition device and placing the flat panel in response to the detection of the front shooting scene.

In some optional implementations of this embodiment, the display module further includes: a second display sub-module configured to display guidance information for mounting the image capture device in response to the image capture device not being mounted; a third display sub-module configured to display guidance information for placing the flat panel in response to the image capture device being installed but the angle of the flat panel being incorrect; and the fourth display sub-module is configured to display guide information for placing the problem document in response to the image acquisition device being installed and the angle of the flat plate being correct.

In some optional implementations of this embodiment, the display module includes: and the fifth display sub-module is configured to display guide information for placing the tablet and the problem document in response to the detection of the rear shooting scene.

In some optional implementation manners of this embodiment, the guidance information is displayed when the photographing question searching function is started for the first time, and automatically fades out after displaying the preset time length.

There is also provided, in accordance with an embodiment of the present application, an electronic device, a readable storage medium, and a computer program product.

FIG. 8 illustrates a schematic block diagram of an example electronic device 800 that can be used to implement embodiments of the present disclosure. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the disclosure described and/or claimed herein.

As shown in fig. 8, the apparatus 800 includes a computing unit 801 that can perform various appropriate actions and processes according to a computer program stored in a Read Only Memory (ROM)802 or a computer program loaded from a storage unit 808 into a Random Access Memory (RAM) 803. In the RAM 803, various programs and data required for the operation of the device 800 can also be stored. The calculation unit 801, the ROM 802, and the RAM 803 are connected to each other by a bus 804. An input/output (I/O) interface 805 is also connected to bus 804.

A number of components in the device 800 are connected to the I/O interface 805, including: an input unit 806, such as a keyboard, a mouse, or the like; an output unit 807 such as various types of displays, speakers, and the like; a storage unit 808, such as a magnetic disk, optical disk, or the like; and a communication unit 809 such as a network card, modem, wireless communication transceiver, etc. The communication unit 809 allows the device 800 to exchange information/data with other devices via a computer network such as the internet and/or various telecommunication networks.

Computing unit 801 may be a variety of general and/or special purpose processing components with processing and computing capabilities. Some examples of the computing unit 801 include, but are not limited to, a Central Processing Unit (CPU), a Graphics Processing Unit (GPU), various dedicated Artificial Intelligence (AI) computing chips, various computing units running machine learning model algorithms, a Digital Signal Processor (DSP), and any suitable processor, controller, microcontroller, and the like. The calculation unit 801 executes the respective methods and processes described above, such as a photo title search method. For example, in some embodiments, the photo title method may be implemented as a computer software program tangibly embodied in a machine-readable medium, such as storage unit 808. In some embodiments, part or all of the computer program can be loaded and/or installed onto device 800 via ROM 802 and/or communications unit 809. When loaded into RAM 803 and executed by computing unit 801, a computer program may perform one or more of the steps of the photo title method described above. Alternatively, in other embodiments, the computing unit 801 may be configured to perform the photo title method in any other suitable manner (e.g., by way of firmware).

Various implementations of the systems and techniques described here above may be implemented in digital electronic circuitry, integrated circuitry, Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), system on a chip (SOCs), load programmable logic devices (CPLDs), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.

Program code for implementing the methods of the present disclosure may be written in any combination of one or more programming languages. These program codes may be provided to a processor or controller of a general purpose computer, special purpose computer, or other programmable data processing apparatus, such that the program codes, when executed by the processor or controller, cause the functions/operations specified in the flowchart and/or block diagram to be performed. The program code may execute entirely on the machine, partly on the machine, as a stand-alone software package partly on the machine and partly on a remote machine or entirely on the remote machine or server.

In the context of this disclosure, a machine-readable medium may be a tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. A machine-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any suitable combination of the foregoing. More specific examples of a machine-readable storage medium would include an electrical connection based on one or more wires, a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.

To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.

The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.

The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.

It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present disclosure may be executed in parallel, sequentially, or in different orders, as long as the desired results of the technical solutions disclosed in the present disclosure can be achieved, and the present disclosure is not limited herein.

The above detailed description should not be construed as limiting the scope of the disclosure. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present disclosure should be included in the scope of protection of the present disclosure.

Claims

1. A method for searching questions by photographing comprises the following steps:

shooting and uploading a multi-question image, wherein the multi-question image comprises a plurality of exercises;

receiving coordinates of the exercises obtained by multi-question detection on the multi-question image;

selecting the multi-question image based on the coordinates of the multiple exercises to obtain selected frames of the multiple exercises;

uploading a selection frame of at least one of the plurality of exercises;

and receiving a question searching result obtained by searching questions based on the at least one question.

2. The method of claim 1, wherein after the framing the multi-topic image based on the coordinates of the plurality of topics, framing the plurality of topics, further comprising:

and marking serial numbers for the selection frames of the plurality of exercises.

3. The method of claim 1 or 2, wherein uploading a checkbox of at least one of the plurality of problems comprises:

in response to selecting a frame from the frames of the plurality of problems, uploading the selected frame.

4. The method of claim 3, wherein prior to the uploading the selected checkbox, further comprising:

presenting an adjustment button on the selected selection box;

adjusting the selected selection box in response to detecting operation of the adjustment button.

5. The method of claim 1, wherein prior to said capturing and uploading of the multi-topic image, further comprising:

and displaying the photographing guide information on the photographing page.

6. The method of claim 5, wherein the displaying the photograph guidance information on the photograph page comprises:

and displaying guide information for installing the image acquisition device and placing the flat plate in response to the detection of the front shooting scene.

7. The method of claim 6, wherein the displaying the photograph guidance information on the photograph page further comprises:

in response to the image acquisition device not being installed, displaying guidance information for installing the image acquisition device;

displaying guide information of the placing flat plate in response to the fact that the image acquisition device is installed but the angle of the flat plate is incorrect;

and displaying guide information for placing the exercise document in response to the fact that the image acquisition device is installed and the angle of the flat plate is correct.

8. The method of claim 5, wherein the displaying the photograph guidance information on the photograph page comprises:

and displaying guide information for placing the flat plate and the exercise document in response to the detection of the rear shooting scene.

9. The method according to one of claims 5 to 8, wherein the guidance information is displayed when the photo title function is turned on for the first time and fades out automatically after displaying a preset time period.

10. A photographing question searching device comprises:

a first upload module configured to capture and upload a multi-topic image, wherein the multi-topic image comprises a plurality of problems;

a first receiving module configured to receive coordinates of the plurality of problems obtained by multi-problem detection on the multi-problem image;

a framing module configured to frame the multi-question image based on the coordinates of the plurality of questions to obtain frames of the plurality of questions;

a second upload module configured to upload a selection frame of at least one of the plurality of exercises;

and the second receiving module is configured to receive a question searching result obtained by searching questions based on the at least one question.

11. The apparatus of claim 10, wherein the apparatus further comprises:

a labeling module configured to label the checkboxes of the plurality of problems with serial numbers.

12. The apparatus of claim 10 or 11, wherein the second upload module comprises:

an upload sub-module configured to upload a selected frame in response to selecting a frame from the frames of the plurality of problems.

13. The apparatus of claim 12, wherein the second upload module further comprises:

a presentation sub-module configured to present an adjustment button on the selected selection box;

an adjustment sub-module configured to adjust the selected frame in response to detecting operation of the adjustment button.

14. The apparatus of claim 10, wherein the apparatus further comprises:

and the display module is configured to display the photographing guide information on the photographing page.

15. The apparatus of claim 14, wherein the display module comprises:

and the first display sub-module is configured to display guiding information for installing the image acquisition device and placing the flat panel in response to the detection of the front shooting scene.

16. The apparatus of claim 15, wherein the display module further comprises:

a second display sub-module configured to display guidance information for mounting the image capture device in response to the image capture device not being mounted;

a third display sub-module configured to display guidance information for placing the flat panel in response to the image capture device being installed but the angle of the flat panel being incorrect;

and the fourth display sub-module is configured to display guide information for placing the problem document in response to the image acquisition device being installed and the angle of the flat plate being correct.

17. The apparatus of claim 14, wherein the display module comprises:

and the fifth display sub-module is configured to display guide information for placing the tablet and the problem document in response to the detection of the rear shooting scene.

18. The apparatus according to one of claims 14-17, wherein the guiding message is displayed when the photo-taking question-searching function is turned on for the first time and fades out automatically after displaying a preset time period.

19. An electronic device, comprising:

at least one processor; and

a memory communicatively coupled to the at least one processor; wherein,

the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-9.

20. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-9.

21. A computer program product comprising a computer program which, when executed by a processor, implements the method according to any one of claims 1-9.