WO2021039437A1

WO2021039437A1 - Image processing device, portable terminal, image processing method, and program

Info

Publication number: WO2021039437A1
Application number: PCT/JP2020/030872
Authority: WO
Inventors: 真司羽田
Original assignee: 富士フイルム富山化学株式会社
Priority date: 2019-08-27
Filing date: 2020-08-14
Publication date: 2021-03-04
Also published as: JPWO2021039437A1; JP7225416B2

Abstract

Provided are an image processing device, a portable terminal, an image processing method, and a program with which it is possible to easily acquire an image in which the marking or printing added to an object to be recognized is emphasized. A smartphone (100) comprises: a recognizer (101B) that has been trained by machine learning using a learning dataset for learning for a plurality of different medicines that have been marked or printed on, a set being formed by a first medicine image in which the marking or printing is not emphasized and a second medicine image in which the marking or printing is emphasized; a camera unit (141) functioning as an image input unit for inputting, to the recognizer (101B), a third image which is an image of a discretionary medicine that has been marked or printed on and in which the marking or printing is not emphasized; and an image output unit for outputting the recognition result obtained from the recognizer (101B) to a display unit (120) when the third image has been inputted to the recognizer (101B).

Description

Image processing equipment, mobile terminals, image processing methods and programs

The present invention relates to an image processing device, a mobile terminal, an image processing method and a program, and particularly relates to a technique for recognizing a stamp or print added to a recognition object.

Conventionally, a drug recognition device capable of accurately recognizing the type of drug with a stamp has been proposed (Patent Document 1).

In the drug recognition device described in Patent Document 1, the lighting unit capable of illuminating the engraved drug from a plurality of lighting directions surrounding the drug switches the lighting direction for illuminating the drug in order. The imaging unit repeatedly photographs the drug each time the illumination direction of the illumination unit is switched. The feature image extraction unit analyzes the captured image (drug image) for each illumination direction acquired by the photographing unit, and extracts the feature image corresponding to the shadow of the engraving for each drug image. The feature image integration unit integrates the feature images for each illumination direction extracted by the feature image extraction unit to generate an integrated image. The recognition unit recognizes the marking included in the integrated image generated by the feature image integration unit, and recognizes the type of the drug based on the recognition result of the marking.

JP 2015-68765

The drug recognition device described in Patent Document 1 can acquire an integrated image in which the engraving is emphasized, but the device becomes large because it requires a plurality of lighting units having different illumination directions for the drug.

Further, in the drug recognition device described in Patent Document 1, a plurality of lighting units are sequentially turned on, and drug images for each illumination direction are photographed a plurality of times at different times, so that the photographing time becomes long. Further, there is a possibility that the drug may move during a plurality of shootings, and in this case, there is a problem that a good integrated image cannot be generated.

The present invention has been made in view of such circumstances, and is an image processing device, a mobile terminal, an image processing method, and a program capable of easily acquiring an image in which the marking or printing emphasized on the recognition object is emphasized. The purpose is to provide.

In order to achieve the above object, the image processing apparatus according to one aspect of the present invention is a learning data set of a plurality of different recognition objects to which marking or printing is added, and the marking or printing of the recognition target is emphasized. A recognizer that has been machine-learned by a learning data set for learning, which is a set of an unmarked first image and a second image with embossed or printed emphasis, and an arbitrary recognition target with engraved or printed images. An image input unit that causes the recognizer to input a third image that is an image of an object and whose marking or printing is not emphasized, and outputs a recognition result obtained from the recognizer when the third image is input to the recognizer. An image output unit is provided.

According to one aspect of the present invention, by constructing a recognizer in which machine learning is performed by the above learning data set, an image of an arbitrary recognition object to which an engraving or printing is added is input to the recognizer. , The recognition result indicating the marking or printing can be output.

In the image processing apparatus according to another aspect of the present invention, the recognition result is a fourth image in which the marking or printing added to an arbitrary recognition object is emphasized.

In the image processing apparatus according to still another aspect of the present invention, it is preferable to include an image generation unit that combines a third image and a fourth image to generate a fifth image in which engraving or printing is emphasized.

In the image processing apparatus according to still another aspect of the present invention, it is preferable that the image output unit outputs the recognition result to the display unit and displays the recognition result on the display unit.

In the image processing apparatus according to still another aspect of the present invention, the recognition target is a drug.

In the image processing device according to still another aspect of the present invention, it is preferable that the image output unit outputs the recognition result to the drug recognition device.

In the image processing apparatus according to still another aspect of the present invention, the recognizer is composed of a convolutional neural network in which the first image of the training data set is used as an input image and the second image is used as an output image for machine learning. Is preferable.

In the image processing apparatus according to still another aspect of the present invention, the second image included in the training data set is a recognition target based on a plurality of images of the recognition target having different illumination directions of light on the recognition target. It is preferable to include an image that has been subjected to an enhancement process that emphasizes the marking or printing added to the object.

In the image processing apparatus according to still another aspect of the present invention, the image input unit includes a camera unit that captures an image including an arbitrary recognition object, and a region corresponding to the recognition object from the captured image captured by the camera unit. It is preferable to include an image extraction unit for extracting the image and input the image extracted by the image extraction unit to the recognizer as a third image.

The invention according to still another aspect is a mobile terminal provided with the above-mentioned image processing device.

The image processing method according to still another aspect of the present invention is a learning data set of a plurality of different recognition objects to which marking or printing is added, and the marking or printing of the recognition target is not emphasized. A step of preparing a learning data set for learning, which is a set of a second image in which the marking or printing is emphasized, a step of causing the recognizer to perform machine learning by the learning data set, and a step of marking or printing are added. A step of inputting a third image, which is an image of an arbitrary recognition object and whose marking or printing is not emphasized, to a machine-learned recognizer, and an image output unit, the third image is sent to the recognizer. Includes a step to output the recognition result obtained from the recognizer when input.

In the image processing method according to still another aspect of the present invention, it is preferable that the recognition result is a fourth image in which the marking or printing added to an arbitrary recognition object is emphasized.

In the image processing method according to still another aspect of the present invention, it is preferable that the image generation unit includes a step of synthesizing the third image and the fourth image to generate a fifth image in which the marking or printing is emphasized. ..

In the image processing method according to still another aspect of the present invention, it is preferable that the step of outputting the recognition result is to output the recognition result to the display unit and display the recognition result on the display unit.

In the image processing method according to still another aspect of the present invention, the object to be recognized is a drug.

In the image processing method according to still another aspect of the present invention, it is preferable that the recognition result is output to the drug recognition device in the step of outputting the recognition result.

In the image processing method according to still another aspect of the present invention, the second image included in the training data set is a recognition target based on a plurality of images of the recognition target having different illumination directions of light on the recognition target. It is preferable to include an image that has been subjected to an enhancement process that emphasizes the marking or printing added to the object.

The program according to still another aspect of the present invention is installed in a computer to make the computer function as the above-mentioned image processing device.

According to the present invention, it is possible to easily obtain an image in which the engraving or printing added to the recognition object is emphasized.

FIG. 1 is a system configuration diagram showing an embodiment of a drug identification system including a mobile terminal according to the present invention. FIG. 2 is an external view of a smartphone constituting the drug identification system shown in FIG. FIG. 3 is a block diagram showing an internal configuration of the smartphone shown in FIG. FIG. 4 is a block diagram showing the electrical configuration of the drug identification system shown in FIG. FIG. 5 is a block diagram showing a hardware configuration of an image processing device including a machine learning device. FIG. 6 is a diagram showing an example of a learning data set stored in the database shown in FIG. FIG. 7 is a functional block diagram showing the functions of the machine learning device, which is a main component of the image processing device shown in FIG. FIG. 8 is a flowchart showing an embodiment of the image processing method according to the present invention, and in particular, is a diagram showing processing of a learning phase in a machine learning device. FIG. 9 is a flowchart showing an embodiment of the image processing method according to the present invention, and in particular, is a diagram showing processing of a drug recognition phase by a smartphone.

Hereinafter, preferred embodiments of the image processing apparatus, mobile terminal, image processing method, and program according to the present invention will be described with reference to the accompanying drawings.

[Drug identification system configuration]
FIG. 1 is a system configuration diagram showing an embodiment of a drug identification system including a mobile terminal according to the present invention.

As shown in FIG. 1, the drug identification system is composed of a smartphone 100 which is a mobile terminal with a camera and a server 200 which functions as a drug identification device. The smartphone 100 and the server 200 are connected to the Internet and a LAN (Local Area Network). Is connected so that data communication is possible via the network 2 such as.

The smartphone 100 has a camera unit, and the camera unit captures the drug 10 which is a recognition target. The smartphone 100 includes an image processing device according to the present invention that processes an image (third image) of the photographed drug 10, and displays an image (fourth image) after image processing by the image processing device on the display unit. Or, it is transmitted to the server 200 via the network 2. The details of the image processing device will be described later.

The server 200 identifies the drug 10 based on the fourth image of the drug 10 uploaded from the smartphone 100, and outputs the identification result (for example, drug identification information consisting of a drug name, a product name, an abbreviation, or a combination thereof). , The fourth image of the drug 10 is transmitted to the smartphone 100 that has transmitted the image.

By the way, identification code information for identifying the type of drug is attached to the surface of the drug (tablet). This identification code information is generally attached by engraving or printing (printing).

The server 200 can improve the discriminating power of the drug by using the identification code information attached to the drug.

The engraving on the drug means that the identification code information is formed by forming a groove, which is a depressed region, on the surface of the drug. The groove is not limited to the one formed by digging the surface, and may be formed by pressing the surface. Further, the engraving may include a marking that does not have an identification function such as a score line.

Further, the printing attached to the drug means that the identification code information is formed by applying edible ink or the like to the surface of the drug in contact or non-contact. Here, "attached by printing" is synonymous with "attached by printing."

<Smartphone configuration>
The smartphone 100 shown in FIG. 2 has a flat-plate housing 102, and a display panel 121 as a display unit and an operation panel 122 as an input unit are integrally formed on one surface of the housing 102. Display unit 120 is provided. The display panel 121 is composed of a liquid crystal panel, and the display unit 120 of this example is a liquid crystal display.

Further, the housing 102 includes a speaker 131, a microphone 132, an operation unit 140, and a camera unit 141. The camera unit 141 includes at least one of a camera (in-camera) provided on the same surface side as the display unit 120 and a camera (out-camera (not shown)) provided on the surface side opposite to the display unit 120.

FIG. 3 is a block diagram showing the internal configuration of the smartphone 100 shown in FIG.

As shown in FIG. 3, the smartphone 100 has, as main components, a wireless communication unit 110, a display unit 120, a call unit 130, an operation unit 140, a camera unit 141, a storage unit 150, and an external input / output unit. It includes 160 (image output unit), a GPS (global positioning system) receiving unit 170, a motion sensor unit 180, a power supply unit 190, and a main control unit 101. Further, as a main function of the smartphone 100, it is provided with a wireless communication function for performing mobile wireless communication via a base station device and a mobile communication network.

The wireless communication unit 110 performs wireless communication with the base station device connected to the mobile communication network according to the instruction of the main control unit 101. The wireless communication is used to send and receive various file data such as voice data and image data, e-mail data, and receive web data and streaming data.

The display unit 120 is a display with a so-called touch panel provided with an operation panel 122 arranged on the screen of the display panel 121, and displays images (still images and moving images), character information, and the like under the control of the main control unit 101. The information is visually transmitted to the user, and the user operation on the displayed information is detected.

The display panel 121 uses an LCD (Liquid Crystal Display) as a display device. The display panel 121 is not limited to the LCD, and may be, for example, an OLED (organic light emission diode).

The operation panel 122 is a device provided in a state in which an image displayed on the display surface of the display panel 121 can be visually recognized, and detects one or a plurality of coordinates operated by a user's finger or a stylus. When the device is operated by the user's finger or stylus, the operation panel 122 outputs a detection signal generated due to the operation to the main control unit 101. Next, the main control unit 101 detects the operation position (coordinates) on the display panel 121 based on the received detection signal.

The call unit 130 includes a speaker 131 and a microphone 132, converts a user's voice input through the microphone 132 into voice data that can be processed by the main control unit 101, and outputs the data to the main control unit 101, or a wireless communication unit. The audio data received by the 110 or the external input / output unit 160 is decoded and output from the speaker 131.

The operation unit 140 is a hardware key using a key switch or the like, and receives an instruction from the user. For example, as shown in FIG. 2, the operation unit 140 is mounted on the side surface of the housing 102 of the smartphone 100, and is switched on when pressed with a finger or the like, and switched off by a restoring force such as a spring when the finger is released. It is a push button type switch that is in a state.

The storage unit 150 includes the control program and control data of the main control unit 101, address data associated with the name and telephone number of the communication partner, transmitted / received e-mail data, web data downloaded by web browsing, and downloaded contents. Data etc. are stored, and streaming data etc. are temporarily stored.

Further, the storage unit 150 is composed of an internal storage unit 151 and an external storage unit 152 having a detachable external memory slot. Each of the internal storage unit 151 and the external storage unit 152 constituting the storage unit 150 is a flash memory type, a hard disk type, a multimedia card micro type, a card type memory, a RAM (Random Access Memory), or a ROM (Read). It is realized by using a storage medium such as Only Memory).

The external input / output unit 160 serves as an interface with all external devices connected to the smartphone 100, and is used for communication (for example, USB (Universal Serial Bus), IEEE 1394, etc.) or network (for example, wireless LAN (Local Area)). Connect directly or indirectly to other external devices via Network) or Bluetooth (registered trademark).

The GPS receiving unit 170 receives GPS signals transmitted from the GPS satellites ST1 and ST2 to STn according to the instruction of the main control unit 101, executes positioning calculation processing based on the received plurality of GPS signals, and executes positioning calculation processing based on the received GPS signals, and the latitude of the smartphone 100. , Acquires position information (GPS information) specified by longitude and altitude. When the GPS receiving unit 170 can acquire the position information from the wireless communication unit 110 and / or the external input / output unit 160 (for example, wireless LAN), the GPS receiving unit 170 can also detect the position using the position information.

The motion sensor unit 180 includes, for example, a three-axis acceleration sensor, and detects the physical movement of the smartphone 100 according to the instruction of the main control unit 101. By detecting the physical movement of the smartphone 100, the moving direction and acceleration of the smartphone 100 are detected. The result of the detection is output to the main control unit 101.

The power supply unit 190 supplies the electric power stored in the battery (not shown) to each unit of the smartphone 100 according to the instruction of the main control unit 101.

The main control unit 101 includes a microprocessor, operates according to the control program and control data stored in the storage unit 150, and controls each part of the smartphone 100 in an integrated manner. In addition, the main control unit 101 includes a mobile communication control function that controls each unit of the communication system and a software processing function in order to perform voice communication and data communication through the wireless communication unit 110.

The software processing function is realized by operating the main control unit 101 according to the software (program) stored in the storage unit 150. The software processing function includes, for example, an e-mail function for sending and receiving e-mail by controlling an external input / output unit 160, a web browsing function for browsing a web page, and an image processing device according to the present invention for the smartphone 100. To function as. The software that causes the smartphone 100 to function as the image processing device according to the present invention (the program according to the present invention) downloads the corresponding software from the server 200 that functions as the drug identification device or the site of the business operator that operates the server 200. By doing so, it can be installed on the smartphone 100.

Further, the main control unit 101 has an image processing function such as displaying an image on the display unit 120 based on image data (still image or moving image data) such as received data or downloaded streaming data.

Further, the main control unit 101 executes display control for the display unit 120 and operation detection control for detecting a user operation through the operation unit 140 and the operation panel 122.

Under the control of the main control unit 101, the camera unit 141 converts the image data obtained by imaging into compressed image data such as JPEG (Joint Photographic Experts Group), and records the image data in the storage unit 150. It can be output through the external input / output unit 160 or the wireless communication unit 110.

Further, the camera unit 141 can be used for various functions of the smartphone 100. In this example, when identifying a drug, it is used for photographing the drug. The image from the camera unit 141 can also be used in the software.

<Electrical configuration of drug identification system>
FIG. 4 is a block diagram showing the electrical configuration of the drug identification system shown in FIG.

A program (application) according to the present invention is installed in the smartphone 100, and the main control unit 101 of the smartphone 100 executes this application to execute the image extraction unit 101A, the recognizer 101B, and the image generation unit 101C. And functions as a communication control unit 101D.

The camera unit 141 and the image extraction unit 101A function as an image input unit for inputting an image (third image) of the drug into the recognizer 101B. The photographed image of the drug photographed by the camera unit 141 is input to the main control unit 101. The image extraction unit of the main control unit 101 extracts a region corresponding to the drug, which is a recognition target, from the input captured image, and causes the recognizer 101B to input an image (drug image) of the extracted region. The drug image is preferably extracted (cut out) by detecting the outer shape of the drug and cutting out according to the outer shape of the drug. For example, a rectangular region inscribed by the outer shape of the drug can be cut out.

The recognizer 101B can apply a convolutional neural network (CNN: Convolution Neural Network), which is one of the deep learning models. The recognizer 101B is a learning data set of a plurality of different drugs to which the marking or printing is added, and the image (first image) in which the marking or printing of the drug is not emphasized and the marking or printing of the drug are emphasized. Machine learning was performed using a learning data set for learning, which is a set of the image (second image). The recognizer 101B does not need to have a learning function by itself, and is configured as a learned model by acquiring the parameters of a model (CNN) in which machine learning is performed by an external machine learning device. It may be a new one.

FIG. 5 is a block diagram showing a hardware configuration of an image processing device 300 including a machine learning device.

As the image processing device 300 shown in FIG. 5, a personal computer or a workstation can be used. The image processing device 300 of this example mainly includes an image input unit 312, a database 314, a storage unit 316, an operation unit 318, a CPU (Central Processing Unit) 320, a RAM (Random Access Memory) 322, and a ROM ( It is composed of a Read Only Memory) 324 and a display unit 326.

The image input unit 312 is a part for inputting an image of a recognition object (“drug” in this example) to which a stamp or print is added, and inputting a learning data set or the like to be stored in the database 314.

Database 314 is a storage unit that stores the learning data set.

FIG. 6 is a diagram showing an example of a learning data set stored in the database 314 shown in FIG.

The learning data set is a set of images of a plurality of drugs of different types (first image 25) and an image in which the marking or printing of each drug corresponding to the first image 25 is emphasized (second image 27). ing. The first image 25 and the second image 27 are input images and correct answer data used during machine learning of the learning model, respectively. The first image 25 can be collected by photographing the drug. Generally, the marking in the first image 25 is not clearly shown.

The second image 27 is an image showing the marking or printing of the drug. The second image 27 can be obtained by displaying the first image 25 on the display unit 326 and the user using the operation unit 318 to fill the engraved portion or the printed portion on the screen of the display unit 326.

Further, the second image is not limited to the one created manually, but uses an integrated image (an image with enhancement processing for emphasizing engraving or printing) generated by the drug recognition device described in Patent Document 1 and the like. can do. That is, as the second image 27, it is possible to use an image that has been subjected to an enhancement process that emphasizes the marking or printing added to the drug, based on a plurality of images of the drug having different illumination directions of the light on the drug. it can.

FIG. 7 is a functional block diagram showing the functions of the machine learning device 350, which is a main component of the image processing device 300 shown in FIG. 5, and includes the CPU 320, the storage unit 316, the RAM 322, the ROM 324, and the like shown in FIG. It consists of hardware.

In FIG. 7, the machine learning device 350 mainly includes a recognizer 352, a loss value calculation unit 354 and a parameter control unit 356 that function as a learning unit that causes the recognizer 352 to perform machine learning.

The CNN model is applied to the recognizer 352 in this example. The recognizer 352 has a plurality of layer structures and holds a plurality of parameters. The recognizer 352 can change from an unlearned model to a trained model by updating the parameters from the initial values to the optimum values. The initial value of the parameter of the recognizer 352 may be an arbitrary value, or for example, the parameter of the trained model of the image system for classifying images may be applied. In the latter case, good machine learning can be performed with a relatively small number of training data sets by performing transfer learning using the learning data set shown in FIG.

The recognizer 352 includes an input layer 352A, an intermediate layer 352B having a plurality of sets composed of a convolutional layer and a pooling layer, and an output layer 352C, and a plurality of "nodes" are connected by "edges" in each layer. It has a structure.

In the learning phase, the first image 25 of the learning data set (FIG. 6) is input to the input layer 352A as an input image.

The intermediate layer 352B has a plurality of sets including a convolutional layer and a pooling layer as one set, and is a portion for extracting features from the first image 25 input from the input layer 352A. The convolutional layer filters nearby nodes in the previous layer (performs a convolutional operation using the filter) and acquires a "feature map". The pooling layer reduces the feature map output from the convolution layer to a new feature map. The "convolution layer" plays a role of feature extraction such as edge extraction from an image, and the "pooling layer" plays a role of imparting robustness so that the extracted features are not affected by translation or the like. The intermediate layer 352B is not limited to the case where the convolutional layer and the pooling layer are set as one set, but also includes the case where the convolutional layers are continuous and the normalization layer. Further, the convolution layer in the final stage is a feature map (image) having the same size as the input image, and is a portion that outputs a feature map showing the features (engraving, etc.) of the drug.

The output layer 352C is a portion that outputs the recognition result of the recognizer 352 (in this example, an image in which the marking or the like is emphasized).

The loss value calculation unit 354 acquires the recognition result (output image) output from the output layer 352C of the recognizer 352 and the second image 27 (correct answer data) paired with the first image 25, and the loss between the two. Calculate the value. As a method of calculating the loss value, for example, a Jaccard coefficient or a dice coefficient can be used.

The parameter control unit 356 minimizes or is similar to the distance between the correct answer data and the output of the recognizer 352 in the feature space by the error back propagation method based on the loss value calculated by the loss value calculation unit 354. The parameters of the recognizer 352 (such as the coefficient of the filter of each convolution layer) are adjusted in order to maximize the degree.

The adjustment process of this parameter is repeated, and the learning is repeated until the loss value calculated by the loss value calculation unit 354 converges.

Using the training data set stored in the database 314 in this way, a trained recognizer 352 with optimized parameters is created.

In the recognition phase, the trained recognizer 352 uses an image of an arbitrary drug (third image) acquired by the image input unit 312 as an input image, recognizes the marking of the drug from the input image, and recognizes the recognition result (third). 4 images) is output to the image output unit 360.

Returning to FIG. 4, the recognizer 101B of the smartphone 100 acquires the same parameters as the parameters of the learned recognizer 352 from the machine learning device 350 shown in FIG. 7, and learns by setting the acquired parameters. It has the same recognition function as the existing recognizer 352.

The image generation unit 101C is recognized by the recognizer 101B and the third image of the drug (the image of the drug to be recognized whose marking or printing is not emphasized) taken by the camera unit 141 and extracted by the image extraction unit 101A. The recognition result (fourth image) is combined to generate a composite image (fifth image) in which the marking or printing of the drug is emphasized.

Here, the fourth image is an image showing only the marking or printing of the drug, like the second image 27 shown in FIG. 6, and is an image having high brightness of the stamped portion or the printed portion. Therefore, the image generation unit 101C can generate a fifth image in which the engraved or printed portion of the drug is highlighted in black by subtracting the fourth image from the third image. In the case of a third image having low brightness (for example, an image obtained by photographing a black drug), the image generation unit 101C adds the fourth image to the third image to emphasize the stamped or printed portion of the drug in white. The fifth image can be generated.

The display control unit (not shown) that functions as an image output unit outputs the recognition result (fourth image) by the recognizer 101B or the fifth image including the fourth image to the display unit 120 and displays it on the display unit 120. Let me.

As a result, the user can display the fourth image or the fifth image on the display unit 120 of the smartphone 100 by photographing the drug with the smartphone 100, and is added to the drug by the fourth image or the fifth image. The engraving or printing can be easily visually recognized. When taking a moving image of the drug, it is also possible to display the fourth image or the fifth image of the moving image on the display unit 120.

Further, the recognition result (fourth image) by the recognizer 101B or the fifth image including the fourth image emphasizes the marking or printing added to the drug, and is therefore suitable for discriminating or auditing the drug. Is.

In this example, the communication control unit 101D and the wireless communication unit 110, which function as image output units, transmit the recognition result (fourth image) by the recognizer 101B or the fifth image including the fourth image to the server via the network 2. It is transmitted to the 200, and the identification result of the drug to be identified, which is identified by the server 200 based on the fourth image or the fifth image, is acquired via the network 2.

<Server 200>
The server 200 shown in FIG. 4 functions as a drug identification device, and is mainly composed of a communication unit 210, a CPU (Central Processing Unit) 220, a drug DB (database) 230, a memory 240, and a drug identification unit 250. There is.

The CPU 220 is a part that controls each part of the server 200, and functions the communication unit 210 as an image receiving unit that receives the fourth image or the fifth image of the drug transmitted from the smartphone 100, and receives the fourth image or the fourth image. 5 The drug identification unit 250 executes the drug identification process based on the image.

The drug DB 230 is a part for registering and managing drug images (drug images on the front side and back side of the drug) in association with drug identification information such as the name of the drug. The drug image of the drug (registered drug) registered in the drug DB 230 is used as a template image for identifying which of the registered drugs the registered drug corresponds to the drug to be identified.

The memory 240 includes a storage unit in which a program for providing a drug identification service is stored, and a portion serving as a work area of the CPU 220.

The drug identification unit 250 performs template matching between the image (fourth image or fifth image) of the drug to be identified received via the communication unit 210 and the template image of the registered drug registered in the drug DB 230, and matches them. Acquire identification results such as drug identification information (including images of registered drugs) of the registered drug having the maximum degree or a plurality of registered drugs having a high degree of matching.

The CPU 220 transmits the drug identification result by the drug identification unit 250 to the smartphone 100 that transmitted the fourth image or the fifth image via the communication unit 210.

The server 200 is provided with the function of the smartphone 100 that generates the fourth image or the fifth image, the fourth image or the fifth image generated by the server 200 is transmitted to the smartphone 100, and the identification result of the drug is transmitted to the smartphone. It may be sent to 100. In this case, the smartphone 100 captures an image of the drug to be identified and transmits the captured drug image to the server 200 as it is, thereby acquiring or capturing an image in which the engraving or printing is emphasized from the server 200. The drug recognition result can be obtained from the server 200.

<Image processing method>
8 and 9 are flowcharts showing embodiments of the image processing method according to the present invention, respectively.

FIG. 8 shows the processing of the learning phase in the machine learning device 350 shown in FIG. 7.

In FIG. 8, a learning data set of a plurality of different drugs with engraving or printing is prepared (step S10). The learning data set is a learning data set for machine learning that includes a first image 25 in which the marking or printing is not emphasized and a second image 27 in which the marking or printing is emphasized as shown in FIG. Yes, it is stored in database 314 (FIG. 5).

As described with reference to FIG. 7, the machine learning device 350 causes the recognizer 352 to perform machine learning using the learning data set stored in the database 314 (step S12).

As a result, a trained recognizer 252 with optimized parameters is created.

FIG. 9 shows the processing of the drug recognition phase by the smartphone 100 shown in FIG. 4 and the like.

The smartphone 100 of this example includes a recognizer 101B in which the same parameters as those of the learned recognizer 252 are set. The recognizer 101B has the same recognition function as the learned recognizer 252.

In FIG. 9, an image (third image) of an arbitrary recognition target drug to which a stamp or print is added from the image input unit is input to the recognizer 101B as an input image (step S20). That is, the drug to be recognized is photographed by the camera unit 141 that functions as an image input unit, an image (drug image) of the region corresponding to the drug is extracted from the captured image, and the extracted drug image (third image) is used as a recognizer. Let 101B input.

The recognizer 101B outputs an image (fourth image) showing the marking or printing added to the drug to be recognized as the recognition result for the input third image (step S22).

The image generation unit 101C synthesizes the third image (drug image) and the fourth image output from the recognizer 101B, and generates a composite image (fifth image) in which the marking or printing of the drug is emphasized (step). S24).

The display control unit that functions as an image output unit outputs the fifth image generated in step S24 to the display unit 120, and causes the display unit 120 to display the fifth image in which the marking or printing of the drug to be recognized is emphasized. (Step S26).

As a result, the user can display the fifth image on the display unit 120 of the smartphone 100 by photographing the drug with the smartphone 100, and easily visually recognizes the marking or printing added to the drug by the fifth image. be able to.

Further, the communication control unit 101D and the wireless communication unit 110 that function as the image output unit transmit the fifth image generated in step S24 to the server 200 via the network 2 (step S28).

The server 200 acquires identification results such as drug identification information such as the name of the drug to be recognized based on the fifth image, transmits the acquired identification results to the smartphone 100, and the smartphone 100 identifies the drug from the server 200. Receive the result (step S30).

The display control unit of the smartphone 100 outputs the drug identification result received from the server 200 to the display unit 120, and displays the drug identification result on the display unit 120 (step S32).

As a result, the user can display the identification result such as the drug name of the drug on the display unit 120 of the smartphone 100 by photographing the drug with the smartphone 100.

[Other]
The image processing device of the present embodiment can be incorporated into a drug recognition device, whereby the drug recognition device can be miniaturized and inexpensive.

Further, the mobile terminal according to the present invention is not limited to a smartphone, but may be a tablet terminal having a camera function, a mobile phone, a PDA (Personal Digital Assistants), or the like.

Further, in the present embodiment, the drug is used as a recognition object, but the present invention is not limited to this, and the present invention can be applied to the recognition of other recognition objects such as metal parts with engravings and precious metals.

Further, the hardware that realizes the image processing device according to the present invention can be configured by various processors. Various processors include CPU (Central Processing Unit), GPU (Graphics Processing Unit), FPGA (Field Programmable Gate Array), which are general-purpose processors that execute programs and function as various processing units. A dedicated electric circuit that is a processor having a circuit configuration specially designed to execute a specific process such as a programmable logic device (PLD) or an ASIC (Application Specific Integrated Circuit), which is a processor that can change the CPU. Etc. are included. One processing unit constituting the image processing device may be composed of one of the above-mentioned various processors, or may be composed of two or more processors of the same type or different types. For example, one processing unit may be composed of a plurality of FPGAs or a combination of a CPU and an FPGA. Further, a plurality of processing units may be configured by one processor. As an example of configuring a plurality of processing units with one processor, first, as represented by a computer such as a client or a server, one processor is configured by a combination of one or more CPUs and software. There is a form in which this processor functions as a plurality of processing units. Secondly, as typified by System On Chip (SoC), there is a form in which a processor that realizes the functions of the entire system including a plurality of processing units with one IC (Integrated Circuit) chip is used. is there. As described above, the various processing units are configured by using one or more of the above-mentioned various processors as a hardware structure. Further, the hardware structure of these various processors is, more specifically, an electric circuit (circuitry) in which circuit elements such as semiconductor elements are combined.

Further, the present invention includes a program that causes the computer to function as an image processing device according to the present invention by being installed in the computer, and a storage medium in which this program is recorded.

Further, the present invention is not limited to the above-described embodiment, and it goes without saying that various modifications can be made without departing from the spirit of the present invention.

2 Network 10 Drug 25 1st image 27 2nd image 100 Smartphone 101 Main control unit 101A Image extraction unit 101B Recognition device 101C Image generation unit 101D Communication control unit 102 Housing 110 Wireless communication unit 120 Display unit 121 Display panel 122 Operation panel 130 Calling unit 131 Speaker 132 Microphone 140 Operation unit 141 Camera unit 150 Storage unit 151 Internal storage unit 152 External storage unit 160 External input / output unit 170 GPS receiving unit 180 Motion sensor unit 190 Power supply unit 200 Server 210 Communication unit 220 CPU
230 drug DB
240 Memory 250 Drug identification unit 252 Recognizer 300 Image processing device 312 Image input unit 314 Database 316 Storage unit 318 Operation unit 316 Storage unit 320 CPU
322 RAM
324 ROM
326 Display unit 350 Machine learning device 352 Recognizer

352A Input layer

352B Intermediate layer

352C Output layer 354 Loss value calculation unit 356 Parameter control unit 360 Image output unit S10 to S32 Steps

Claims

A learning data set of a plurality of different recognition objects to which a marking or printing is added, the first image in which the marking or printing of the recognition object is not emphasized and the second image in which the marking or printing is emphasized. A recognizer for which machine learning was performed by the training data set for learning, which is a set of
An image input unit that causes the recognizer to input a third image that is an image of an arbitrary recognition object to which a stamp or print is added and the stamp or print is not emphasized.
An image output unit that outputs a recognition result obtained from the recognizer when the third image is input to the recognizer, and an image output unit.
Image processing device equipped with.
The image processing apparatus according to claim 1, wherein the recognition result is a fourth image in which the marking or printing added to the arbitrary recognition object is emphasized.
The image processing apparatus according to claim 2, further comprising an image generation unit that synthesizes the third image and the fourth image to generate a fifth image in which the marking or printing is emphasized.
The image processing device according to any one of claims 1 to 3, wherein the image output unit outputs the recognition result to the display unit and displays the recognition result on the display unit.
The image processing device according to any one of claims 1 to 4, wherein the recognition object is a drug.
The image processing device according to claim 5, wherein the image output unit outputs the recognition result to the drug recognition device.
The recognizer according to any one of claims 1 to 6, which is composed of a convolutional neural network in which the first image of the training data set is used as an input image and the second image is used as an output image and machine learning is performed. The image processing apparatus described.
The second image included in the learning data set has markings or prints added to the recognition object based on a plurality of images of the recognition object having different illumination directions of light on the recognition object. The image processing apparatus according to any one of claims 1 to 7, which includes an image that has been subjected to an enhancement process to be emphasized.
The image input unit includes a camera unit that captures an image including an arbitrary recognition object, and an image extraction unit that extracts a region corresponding to the recognition object from the captured image captured by the camera unit. The image processing apparatus according to any one of claims 1 to 8, wherein an image extracted by an image extraction unit is input to the recognizer as the third image.
A mobile terminal provided with the image processing device according to any one of claims 1 to 9.
A learning data set of a plurality of different recognition objects to which a marking or printing is added, the first image in which the marking or printing of the recognition object is not emphasized and the second image in which the marking or printing is emphasized. The step of preparing the training data set for training, which is a set of
The step of causing the recognizer to perform machine learning using the training data set,
A step of causing the machine-learned recognizer to input a third image which is an image of an arbitrary recognition object to which a stamp or print is added and whose stamp or print is not emphasized.
A step in which the image output unit outputs a recognition result obtained from the recognizer when the third image is input to the recognizer.
Image processing method including.
The image processing method according to claim 11, wherein the recognition result is a fourth image in which the marking or printing added to the arbitrary recognition object is emphasized.
The image processing method according to claim 12, wherein the image generation unit synthesizes the third image and the fourth image to generate a fifth image in which the marking or printing is emphasized.
The image processing method according to any one of claims 11 to 13, wherein the step of outputting the recognition result is to output the recognition result to the display unit and display the recognition result on the display unit.
The image processing method according to any one of claims 11 to 13, wherein the recognition object is a drug.
The image processing method according to claim 15, wherein the step of outputting the recognition result is a step of outputting the recognition result to the drug recognition device.
The second image included in the learning data set has markings or prints added to the recognition object based on a plurality of images of the recognition object having different illumination directions of light on the recognition object. The image processing method according to any one of claims 11 to 16, which includes an image that has been subjected to an enhancement process.
A program that, when installed on a computer, causes the computer to function as the image processing device according to any one of claims 1 to 9.
A recording medium that is non-temporary and computer-readable, and the computer is used as the image processing apparatus according to any one of claims 1 to 9 when a command stored in the recording medium is read by the computer. A recording medium that works.