WO2018040105A1 - System and method for food recognition, food model training method, refrigerator and server - Google Patents

System and method for food recognition, food model training method, refrigerator and server Download PDF

Info

Publication number
WO2018040105A1
WO2018040105A1 PCT/CN2016/098120 CN2016098120W WO2018040105A1 WO 2018040105 A1 WO2018040105 A1 WO 2018040105A1 CN 2016098120 W CN2016098120 W CN 2016098120W WO 2018040105 A1 WO2018040105 A1 WO 2018040105A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
food material
refrigerator
training
model
Prior art date
Application number
PCT/CN2016/098120
Other languages
French (fr)
Chinese (zh)
Inventor
杨世清
石周
唐红强
Original Assignee
合肥华凌股份有限公司
合肥美的电冰箱有限公司
美的集团股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 合肥华凌股份有限公司, 合肥美的电冰箱有限公司, 美的集团股份有限公司 filed Critical 合肥华凌股份有限公司
Priority to PCT/CN2016/098120 priority Critical patent/WO2018040105A1/en
Publication of WO2018040105A1 publication Critical patent/WO2018040105A1/en

Links

Images

Classifications

    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F25REFRIGERATION OR COOLING; COMBINED HEATING AND REFRIGERATION SYSTEMS; HEAT PUMP SYSTEMS; MANUFACTURE OR STORAGE OF ICE; LIQUEFACTION SOLIDIFICATION OF GASES
    • F25DREFRIGERATORS; COLD ROOMS; ICE-BOXES; COOLING OR FREEZING APPARATUS NOT OTHERWISE PROVIDED FOR
    • F25D11/00Self-contained movable devices, e.g. domestic refrigerators
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the invention belongs to the technical field of electrical appliances manufacturing, and particularly relates to a food material identification system, a food material identification method, a food material model training method, a refrigerator and a server.
  • refrigerators not only carry the function of preserving food, but also become part of the home network, providing more for family members.
  • Intelligent service As a module for front-end information collection provides a basis for the establishment of a subsequent food library.
  • the traditional image recognition technology has low recognition rate and low real-time performance in complex scenes, and can not be well applied to the identification of a large number of ingredients in the refrigerator.
  • the computational complexity is not easy to use in embedded systems.
  • the present invention aims to solve at least one of the technical problems in the related art to some extent.
  • an object of the present invention is to provide a food material identification system which has a high recognition rate and a strong model generalization ability.
  • the invention also provides a food material identification method, an food material model training method, and a refrigerator and a server.
  • a food material identification system includes: a refrigerator, the refrigerator includes an image collection device, the image collection device is configured to collect an image of the foodstuff in the refrigerator; and the server obtains the refrigerator.
  • An image of the inner food material, and the image is identified according to a food material model, the food material model being a neural network obtained by training through a deep learning algorithm.
  • the food material identification system of the embodiment of the invention combines local photographing and remote recognition, and uses the computing power of the server to apply the deep learning algorithm to image recognition, and the recognition rate is improved, which can be better applied to the foodstuffs with a larger amount in the refrigerator. Identification.
  • the image collection device includes: a camera module for collecting an image of the foodstuff in the refrigerator; a communication module, configured to send an image of the foodstuff in the refrigerator to the server; and a control module, respectively Controlling the camera module and the communication module.
  • the image capturing device further includes: a lighting module, configured to illuminate an environment in which the foodstuff in the refrigerator is located, and is more convenient for the camera module to collect the image of the foodstuff.
  • a lighting module configured to illuminate an environment in which the foodstuff in the refrigerator is located, and is more convenient for the camera module to collect the image of the foodstuff.
  • control module after receiving the door closing signal sent by the controller of the refrigerator, controls the camera module to collect an image of the foodstuff in the refrigerator, and controls the communication module to send the image to the office Said server.
  • the food material model is obtained by training: taking an image of the foodstuff in the refrigerator that is pre-acquired and calibrated as a training image, and determining a model parameter according to the depth learning algorithm based on the training image to obtain the food material model.
  • the food material model includes a convolutional neural network, a recurrent neural network, and a circulating neural network.
  • a food material identification method comprises the steps of: obtaining an image of the foodstuff in the refrigerator; and identifying the image according to the food material model to obtain the food material information, wherein the food material model is obtained by training through a deep learning algorithm Neural network.
  • the recognition rate is improved, and the more complicated food material identification in the refrigerator can be better dealt with.
  • the food material model is obtained by training: taking an image of the foodstuff in the refrigerator collected and calibrated in advance as a training image, and determining a model parameter according to the depth learning algorithm based on the training image to obtain the food material model.
  • the food material model includes one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
  • a food material model training method includes the steps of: obtaining an image of a food material in a refrigerator that is pre-acquired and calibrated as a training image; and processing the training image according to a depth learning algorithm. Determine model parameters to obtain a food model.
  • the food material model training method uses the deep learning algorithm to take the food image in the refrigerator as input data, and obtains the food material model which can be applied to the identification of the food in the refrigerator to improve the recognition rate.
  • the food material model includes one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
  • the convolutional neural network comprises a convolutional layer, a pooling layer, an excitation layer and a fully connected layer
  • the input feature of the first layer is the training image
  • the output features of each layer are used as input features of the next layer
  • the depth learning algorithm processing the training image further comprises: convolution layer performing feature compression on the input feature by convolution operation; the pooling layer performing pooling processing on the input feature; and the excitation layer obtaining the output feature by using the input feature through the excitation function
  • the output features are normalized; the input features of the fully connected layer and the output features are all reconnected between all nodes.
  • the input data is pooled (downsampled), which can reduce the redundancy of parameters. Normalizing the data after inputting the excitation function can improve the effectiveness of backpropagation and improve the generalization ability of the model.
  • the convolutional neural network includes a plurality of convolutional layers, a pooling layer, and an excitation layer, and a fully connected layer of the tail of the convolutional neural network, wherein different convolutional cores of the convolutional layer acquire different Output characteristics.
  • the input parameters are convoluted multiple times through different convolution kernels, and the parameters are more global.
  • the food material model training method further comprises: adjusting the model parameters of the food material model according to the actual food material information, thereby ensuring the generalization ability and the learning ability of the model.
  • a refrigerator includes: an image collecting device, the image capturing The apparatus is for taking an image of the foodstuff in the refrigerator and transmitting the image to a server, so that the server identifies the image according to the foodstuff model to obtain the foodstuff information, wherein the foodstuff model is trained by a deep learning algorithm And get the neural network.
  • the refrigerator of the embodiment of the invention sends the image of the collected food material to the server through the image collecting device, and provides a data foundation for the server to use the deep learning algorithm to identify the food material in the refrigerator.
  • the image collection device includes: a camera module for collecting an image of the foodstuff in the refrigerator; a communication module, configured to send an image of the foodstuff in the refrigerator to the server; and a control module, respectively Controlling the camera module and the communication module.
  • the image collecting device further includes: a lighting module, configured to illuminate an environment in which the foodstuff in the refrigerator is located, and more conveniently collect the image of the foodstuff.
  • control module after receiving the door closing signal sent by the controller of the refrigerator, controls the camera module to collect an image of the foodstuff in the refrigerator, and controls the communication module to send the image to the office Said server.
  • the server obtains an image of the foodstuff in the refrigerator, and identifies the image according to the food material model to obtain the food material information, wherein the food material model passes Neural network obtained by deep learning algorithm training.
  • the server of the embodiment of the present invention applies the deep learning algorithm to image recognition based on its powerful computing capability, and has a simple method and improved recognition rate, and can be better applied to the identification of a large number of food materials in the refrigerator.
  • the food material model is obtained by training: taking an image of the foodstuff in the refrigerator collected and calibrated in advance as a training image, and determining a model parameter according to the depth learning algorithm based on the training image to obtain the food material model.
  • the food material model includes one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
  • FIG. 1 is a block diagram of a food material identification system in accordance with an embodiment of the present invention.
  • FIG. 2 is a block diagram of a food material identification system in accordance with one embodiment of the present invention.
  • FIG. 3 is a block diagram of a refrigerator in accordance with another embodiment of the present invention.
  • FIG. 4 is a flow chart of a method for identifying a foodstuff according to an embodiment of the present invention.
  • FIG. 5 is a flowchart of a food material model training method according to an embodiment of the present invention.
  • FIG. 6 is a process flow diagram of a convolutional neural network based algorithm in accordance with an embodiment of the present invention.
  • FIG. 7 is a flow chart of feature extraction of a training image in accordance with another embodiment of the present invention.
  • Figure 8 is a flow chart of a food material model training process in accordance with yet another embodiment of the present invention.
  • FIG. 9 is a block diagram of a refrigerator in accordance with one embodiment of the present invention.
  • FIG. 10 is a block diagram of a refrigerator in accordance with another embodiment of the present invention.
  • the food material identification system 1000 includes a refrigerator 100 and a server 200.
  • the refrigerator 100 includes an image capture device 10 for collecting images of foodstuffs in the refrigerator.
  • the refrigerator 100 and the server 200 can perform data interaction.
  • the server 200 obtains an image of the foodstuff in the refrigerator, and identifies the obtained image according to the food material model to obtain the food material information, such as the category of the foodstuff and its coordinates, wherein the food material model is a neural network obtained by training the deep learning algorithm, and the deep learning is performed.
  • the algorithm introduces the identification of the food in the refrigerator, and by increasing the number of neural network layers, the mapping relationship between output and input can be better, and the recognition accuracy is improved.
  • the food material identification system 1000 of the embodiment of the present invention combines local photographing and remote recognition, and utilizes the computing power of the server 200 to apply the deep learning algorithm to image recognition.
  • the method is simple, the recognition rate is improved, and the method can be better applied to the refrigerator 100. Identification of a large number of ingredients.
  • the image capture device 10 includes a camera module 11, a communication module 12, and a control module 13.
  • the camera module 11 is configured to collect an image of the foodstuff in the refrigerator 100
  • the communication module 12 is configured to send an image of the foodstuff in the refrigerator 100 to the server 200
  • the control module 13 is configured to respectively control the camera module 11 and the communication module 12.
  • the image capture device 10 further includes a lighting module 14 for illuminating the environment in which the foodstuffs in the refrigerator 100 are located, so that the camera module 11 can capture the image of the foodstuff.
  • the control module 13 controls the camera module 11 to collect an image of the foodstuff in the refrigerator after receiving the door closing signal from the controller of the refrigerator 100, and controls the communication module 12 to send the image to the server 200.
  • the refrigerator in the embodiment of the present invention can be provided with an image capturing device 10, including a control module 13, such as a CPU (Central Processing Unit), and a refrigerator thereof.
  • a control module 13 such as a CPU (Central Processing Unit)
  • the control circuit 01 of the refrigerator 100 is controlled by a controller 02 such as an MCU (Microcontroller Unit), the CPU and the MCU can perform data interaction, and the CPU peripheral circuit can mount related peripherals, in the present invention.
  • the related peripherals include a lighting module 14, a camera module 11 and a communication module 12 such as a WIFI module.
  • the CPU controls the camera module 11 to cooperate with the illumination module 14 to complete the shooting of the local food image.
  • the control module 13 controlling the camera module 11 and the illumination module 14 to jointly capture an image, uploading the captured image to the server 200 through the WIFI module, and the server 200 performs identification of the food material information, after which the server 200 can feed the identification information to the local or provide Give relevant personnel the recognition results.
  • the food product model deployed on the server 200 can be obtained by training: taking an image of the food in the refrigerator that is pre-acquired and calibrated as a training image, and determining a model parameter according to the depth learning algorithm based on the training image.
  • the food material model may include one of a convolutional neural network, a recurrent neural network, and a cyclic neural network, and the model parameters may be determined by a BP algorithm.
  • the food model training can be completed on the off-line side, taking the image of the food in the refrigerator or in the simulated refrigerator environment and manually marking the position of various ingredients, obtaining the input vector through the operation of the deep learning algorithm, and then pre-calibrating As a result, the whole connection is performed, and the parameters of the model are determined by training a large number of training images, which is equivalent to determining the coefficient in the functional relationship.
  • the parameter training is completed, the food material model is obtained, and the food material model is deployed on the server 200.
  • the training process for the food model is described in detail below.
  • the food material identification method includes the following steps:
  • S1 obtaining an image of the food in the refrigerator.
  • an image of the food in the refrigerator is collected by an image acquisition device and sent to a server.
  • the image is identified according to the food material model to obtain the food material information, wherein the food material model is a neural network obtained by training through a deep learning algorithm.
  • the food material model is obtained by training in the following manner: an image of the foodstuff in the refrigerator that is pre-acquired and calibrated is used as a training image, and the model parameter is determined based on the training image according to the depth learning algorithm to obtain the food material model.
  • the food material model may include one of a convolutional neural network, a recurrent neural network, and a cyclic neural network, and the model parameters may be determined by a BP algorithm.
  • the food material identification method of the embodiment of the invention identifies the food material information by using the deep learning method, and the recognition rate is improved, and the method is simple, and the complex material identification in the refrigerator can be better dealt with.
  • FIG. 5 is a flowchart of a food material model training method according to an embodiment of the present invention. As shown in FIG. 5, the food material model training method includes the following steps:
  • the training image is processed according to a depth learning algorithm to determine a model parameter to obtain a food material model.
  • the food material model may include a convolutional neural network, a recurrent neural network, and a circulating neural network.
  • a convolutional neural network e.g., a convolutional neural network
  • a recurrent neural network e.g., a recurrent neural network
  • a circulating neural network e.g., a convolutional neural network
  • the training process of the model parameters of the neural network can be implemented by the BP algorithm, which has been described in many cases and will not be described here.
  • There are different processing flows for different neural networks and their differences in different data sets are mainly determined by the structure of the neural network.
  • the input parameters are different for the same neural network structure, and the model parameters are also different.
  • the food material model uses the image of the food material in the refrigerator as input data, determines the model parameters according to the deep learning method, and then deploys the food material model on the identification server, and applies the food material identification in the refrigerator to improve the recognition rate and facilitate Identification of complex situations.
  • the food material model in the refrigerator can be used as the input data by using the deep learning algorithm, and the food material model can be applied to the identification of the foodstuff in the refrigerator, and the recognition rate is improved.
  • the training process of the food model namely the food neural network
  • the deep network can better fit the relationship between the input food image and the output recognition result, and has a high recognition rate.
  • the convolutional neural network comprises a convolutional layer, a pooling layer, an excitation layer and a fully connected layer
  • the input features of the first layer are training images
  • the output features of each layer are used as input features of the next layer.
  • the processing of the training image according to the deep learning algorithm further comprises: the convolution layer performs feature compression on the input feature by convolution operation; the pooling layer performs pooling on the input feature, and the input data is pooled (downsampled), The redundancy of the parameters is reduced; the excitation layer obtains the output features through the excitation function, and normalizes the output features, and the commonly used excitation functions include Relu, Maxout, sigmoid, and the like.
  • the input data after the excitation function is normalized, which can improve the effectiveness of backpropagation and improve the generalization ability of the model; the input features of the fully connected layer and the output features have the right to reconnect between all nodes, ie, The output data of one layer and the output data of the next layer are fully connected.
  • the full connection layer is usually at the end of the network, and the data of the last layer is the calibration result of the training image.
  • the processing flow of the convolutional neural network is as shown in FIG. 6.
  • the feature extraction is first performed to obtain the feature image, that is, the convolution operation is performed, and the feature image is downsampled, that is, pooled, and then, The feature vector obtained after the normalization process is input to the neural network as fully connected layer data.
  • the parameters of the food material model are determined by a large amount of image training, thereby obtaining a food material model.
  • the food model can adjust and design the network structure according to the actual situation and needs.
  • the food material model adopts the convolutional neural network structure
  • the training image is convoluted to obtain a feature image
  • the feature image is pooled.
  • the redundant information contained in the input vector can be reduced, the dimension is reduced, and the amount of computation is greatly reduced.
  • the feature extraction process of the convolutional neural network includes: S710, obtaining a training image; S720, using a convolution kernel to perform convolution operation on the training image, S730, adding offset; S740, obtaining features image.
  • the training image is processed by the selected convolution kernel, firstly based on the principle of local perception field.
  • the principle of local perception field It is believed that people's perception of the outside world is from local to global, and the spatial connection of images is also a close connection of local pixels, while the correlation of pixels far away is weak. Therefore, it is not necessary for each neuron to perceive the global image. It only needs to perceive the local part. Then, the local information is combined at a higher level to obtain global information.
  • the statistical features of the various parts of the image are the same, which also means that the features learned in this part can also be used in another part, so the same learning characteristics can be used for all positions on the training image.
  • the same convolution kernel can be used for processing to obtain a feature image.
  • convolution operations Through convolution operations, dimensionality reduction of the image to be processed can be achieved. Using a feature to describe an image has a loss of information. Different convolution kernels can be used to convolve the training image separately to obtain a series of feature images.
  • the next step can be used to classify these features.
  • all the extracted features can be used to train the classifier, but the feature image expresses only a certain feature of the image, and there is still a large redundancy.
  • redundancy With the spatial correlation of static pictures, redundancy can be further reduced by downsampling.
  • This kind of aggregation operation is called pooling. According to the pooling method, it is also called average pooling or maximum pooling.
  • the problem of generalization ability is reduced, and the excitation function causes the data distribution to change, and the data is normalized after the excitation layer, that is, the data of the excitation function is normalized. deal with.
  • the normalization process can also increase the training speed. Normalization can simply rescale the input parameters of each layer to a mean of 0 and a distribution of variance. Through normalization, generalization ability can be improved and training speed can be improved.
  • the food model can be completed at the offline end, as shown in Figure 8.
  • the food model training process includes: S810, the training image is manually marked with the position of various ingredients; S820, convolution operation; S830, pool processing; S840, normalized The input operation obtains an input vector; in S850, the normalized processed data is fully connected with the calibration result of the training image.
  • the convolutional neural network generally includes a plurality of convolution layers, a pooling layer, and an excitation layer, and a convolutional neural network tail.
  • Fully connected layer That is to say, the training image is subjected to multiple convolution operations, pooling processes, and normalization, wherein different convolution kernels of the convolutional layer can be understood as acquiring different output features.
  • Features obtained by a layer of convolution operation are often local, and the higher the number of layers, the more comprehensive the features obtained. Therefore, the use of multi-layer convolution can make the features more global, and the food model can be made as needed. Specific design.
  • model parameters of the food material model can be adjusted based on the actual food material information.
  • the image of the food material uploaded to the server may be determined by manual calibration, and the actual food material information may be compared with the recognition result. For the image identifying the error or the food not included in the previous food product model, the above may be adopted.
  • the training process of the food model is retrained to adjust the parameters of the food model.
  • the food material training method of the present application can be based on the food material information for the food material model.
  • the model parameters are adjusted to suit the new ingredients.
  • there is no effective monitoring for the recognition result and the food with low recognition rate cannot be optimized.
  • the method of the present application uses the supervised learning method to train the food material model, and detects the recognition result, and Model parameter adjustment can improve the generalization ability and learning ability of the model.
  • FIG. 9 is a block diagram of a refrigerator including an image capture device 10, as shown in FIG. 9, in accordance with an embodiment of the present invention.
  • the image capture device 10 is configured to adopt an image of the foodstuff in the refrigerator, and transmit the image to a server, so that the server recognizes the image according to the food material model to obtain the food material information, wherein the food material model is a nerve obtained by training through a deep learning algorithm.
  • the internet is a block diagram of a refrigerator including an image capture device 10, as shown in FIG. 9, in accordance with an embodiment of the present invention.
  • the image capture device 10 is configured to adopt an image of the foodstuff in the refrigerator, and transmit the image to a server, so that the server recognizes the image according to the food material model to obtain the food material information, wherein the food material model is a nerve obtained by training through a deep learning algorithm.
  • the internet The internet.
  • the refrigerator 100 of the embodiment of the present invention sends an image of the collected food material to the server through the image capturing device 10, and provides a data foundation for the server to use the deep learning algorithm to identify the foodstuff in the refrigerator.
  • the image capture device 10 includes a camera module 11, a communication module 12, and a control module 13.
  • the camera module 11 is configured to collect an image of the foodstuff in the refrigerator;
  • the communication module 12 is configured to send an image of the foodstuff in the refrigerator to the server; and
  • the control module 13 is configured to respectively control the camera module 11 and the communication module 12.
  • the image capture device 10 further includes a lighting module 14 for illuminating the environment in which the foodstuffs in the refrigerator 100 are located, so that the camera module 11 can be photographed to obtain an image of the foodstuff.
  • the control module 13 controls the camera module 11 to collect an image of the foodstuff in the refrigerator after receiving the door closing signal from the controller of the refrigerator 100, and controls the communication module 12 to send the image to the server 200.
  • a food product model can be deployed in the server 100.
  • the server 100 obtains an image of the foodstuff in the refrigerator, and identifies the image according to the foodstuff model to obtain the foodstuff information, such as the category of the foodstuff and its coordinates, wherein the foodstuff model is A neural network obtained by training in deep learning algorithms.
  • the deep learning algorithm is introduced into the identification of the food in the refrigerator, and the mapping relationship between the output and the input can be better by increasing the number of neural network layers, thereby improving the recognition accuracy.
  • the server 200 of the embodiment of the present invention applies the deep learning algorithm to image recognition based on its powerful computing capability, and has a simple method and improved recognition rate, and can be better applied to the identification of a large number of food materials in the refrigerator.
  • the food material model is obtained by training in the following manner: an image of the food in the refrigerator that is pre-acquired and calibrated is used as a training image, and the model parameter is determined according to the depth learning algorithm based on the training image to obtain a food material model.
  • the food material model may include one of a convolutional neural network, a recurrent neural network, and a cyclic neural network, and the model parameters may be determined by a BP algorithm.
  • the food model training can be completed at the offline end, the image of the food in the refrigerator is photographed or the image of the food in the simulated refrigerator environment is photographed, and the position of various ingredients is manually marked, and the input vector is obtained by the operation of the deep learning algorithm, and further Fully connected to the pre-calibrated results, trained through a large number of images to determine model parameters, equivalent to The coefficient in the function relationship is obtained, and the parameter training is completed to obtain the food material model, and the food material model is deployed on the server 200, so that when the image recognition is performed, the server 200 can input the obtained food material image in the refrigerator as an input, according to the food material model. Realize the identification of the location and category of the ingredients, and provide identification results, which can be fed back to the refrigerator or provided to the appropriate personnel.
  • the embodiments of the present invention provide a food material identification system, an food material identification method, a food material model training method, a refrigerator, and a server, and apply a deep learning algorithm to the food material identification, using a combination of local camera and online recognition.
  • the computing power of the server performs image recognition.
  • image recognition In the training stage of the food material model, a large number of food material images are collected, and the food ingredients in the image are marked, for example, 4-point mark, and the spatial coordinate information and the pixel information are processed for the food material to establish a deep neural network model for various food materials.
  • the feature mapping method is used to extract various characteristic parameters of the image, that is, the features of the image are preserved, and the redundancy is removed, and the operation is greatly reduced.
  • the identification phase after uploading the local image, the food product model is matched on the server side to generate the food and its coordinates.
  • the inspection stage by manually calibrating the food and position of the image to be measured, comparing the recognition results of the server, the recognition rate result is obtained, and the model parameters are adjusted through retraining to improve the actual performance.
  • any process or method description in the flowcharts or otherwise described herein may be understood to include one or more steps for implementing a particular logic function or process. Modules, segments or portions of code of executable instructions, and the scope of preferred embodiments of the invention includes additional implementations, which may not be in the order shown or discussed, including in a substantially simultaneous manner depending on the functionality involved. The functions are performed in the reverse order, which should be understood by those skilled in the art to which the embodiments of the present invention pertain.
  • a "computer-readable medium” can be any apparatus that can contain, store, communicate, propagate, or transport a program for use in an instruction execution system, apparatus, or device, or in conjunction with the instruction execution system, apparatus, or device.
  • computer readable media include the following: electrical connections (electronic devices) having one or more wires, portable computer disk cartridges (magnetic devices), random access memory (RAM), Read only memory (ROM), erasable editable read only memory (EPROM or flash memory), fiber optic devices, and portable compact disk read only memory (CDROM).
  • the computer readable medium may even be a paper or other suitable medium on which the program can be printed, as it may be optically scanned, for example by paper or other medium, followed by editing, interpretation or, if appropriate, other suitable The method is processed to obtain the program electronically and then stored in computer memory.
  • portions of the invention may be implemented in hardware, software, firmware or a combination thereof.
  • multiple steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system to realise.
  • a suitable instruction execution system to realise.
  • if implemented in hardware as in another embodiment, it can be implemented by any one or combination of the following techniques well known in the art: having logic gates for implementing logic functions on data signals.
  • Discrete logic circuits application specific integrated circuits with suitable combinational logic gates, programmable gate arrays (PGAs), field programmable gate arrays (FPGAs), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Combustion & Propulsion (AREA)
  • Mechanical Engineering (AREA)
  • Thermal Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Cold Air Circulating Systems And Constructional Details In Refrigerators (AREA)
  • Image Analysis (AREA)

Abstract

Disclosed is a food recognition system. The food recognition system comprises: a refrigerator and a server, wherein the refrigerator comprises an image collection apparatus used for collecting images of food in the refrigerator. The server acquires the images of food in the refrigerator and recognizes the images according to a food model so as to acquire food information. The food model is a neural network acquired by training by means of a deep learning algorithm. The food recognition system has a high recognition ability and model generalization ability. Also disclosed are a food recognition method, a food model training method, a refrigerator and a server.

Description

食材识别系统和方法、食材模型训练方法、冰箱和服务器Food identification system and method, food model training method, refrigerator and server 技术领域Technical field
本发明属于电器制造技术领域,尤其涉及一种食材识别系统,以及食材识别方法、食材模型训练方法以及冰箱和服务器。The invention belongs to the technical field of electrical appliances manufacturing, and particularly relates to a food material identification system, a food material identification method, a food material model training method, a refrigerator and a server.
背景技术Background technique
随着集成电路、人工智能、互联网技术的发展,传统的白家电业出现了新的定位,例如,冰箱不仅仅承载了保存食物的功能,也成为家庭网络的一部分,可为家庭成员提供更多的智能化服务。其中,食物识别作为前端信息采集的模块,为后续的食物库的建立提供依据。With the development of integrated circuits, artificial intelligence, and Internet technologies, the traditional white goods industry has taken on a new position. For example, refrigerators not only carry the function of preserving food, but also become part of the home network, providing more for family members. Intelligent service. Among them, food identification as a module for front-end information collection provides a basis for the establishment of a subsequent food library.
传统的图像识别技术在复杂场景下识别率较低,实时性不高,不能够很好地应用于冰箱内的大量食材识别。在就是传统的识别算法计算复杂度,不易用于嵌入式系统。The traditional image recognition technology has low recognition rate and low real-time performance in complex scenes, and can not be well applied to the identification of a large number of ingredients in the refrigerator. In the traditional recognition algorithm, the computational complexity is not easy to use in embedded systems.
发明内容Summary of the invention
本发明旨在至少在一定程度上解决相关技术中的技术问题之一。The present invention aims to solve at least one of the technical problems in the related art to some extent.
为此,本发明的一个目的在于提出一种食材识别系统,该食材识别系统,识别率高,模型泛化能力强。To this end, an object of the present invention is to provide a food material identification system which has a high recognition rate and a strong model generalization ability.
本发明还提出一种食材识别方法、食材模型训练方法以及冰箱和服务器。The invention also provides a food material identification method, an food material model training method, and a refrigerator and a server.
为了解决上述问题,本发明一方面提出的食材识别系统,包括:冰箱,所述冰箱包括图像采集装置,所述图像采集装置用于采集冰箱内食材的图像;服务器,所述服务器获得所述冰箱内食材的图像,并根据食材模型对所述图像进行识别以获得食材信息,所述食材模型为通过深度学习算法训练而获得的神经网络。In order to solve the above problems, a food material identification system according to an aspect of the present invention includes: a refrigerator, the refrigerator includes an image collection device, the image collection device is configured to collect an image of the foodstuff in the refrigerator; and the server obtains the refrigerator. An image of the inner food material, and the image is identified according to a food material model, the food material model being a neural network obtained by training through a deep learning algorithm.
本发明实施例的食材识别系统,将本地拍照和远程识别结合,利用服务器的运算能力,将深度学习算法应用于图像识别,识别率提高,可以更好地应用于对冰箱内数量较多的食材的识别。The food material identification system of the embodiment of the invention combines local photographing and remote recognition, and uses the computing power of the server to apply the deep learning algorithm to image recognition, and the recognition rate is improved, which can be better applied to the foodstuffs with a larger amount in the refrigerator. Identification.
进一步地,所述图像采集装置包括:摄像模块,用于采集所述冰箱内食材的图像;通信模块,用于将所述冰箱内食材的图像发送至所述服务器;和控制模块,用于分别对所述摄像模块和所述通信模块进行控制。Further, the image collection device includes: a camera module for collecting an image of the foodstuff in the refrigerator; a communication module, configured to send an image of the foodstuff in the refrigerator to the server; and a control module, respectively Controlling the camera module and the communication module.
所述图像采集装置还包括:照明模块,用于对冰箱内食材所处环境进行照明,更加方便摄像模块采集食材图像。The image capturing device further includes: a lighting module, configured to illuminate an environment in which the foodstuff in the refrigerator is located, and is more convenient for the camera module to collect the image of the foodstuff.
具体地,所述控制模块,在接收到所述冰箱的控制器发出的关门信号之后,控制所述摄像模块采集所述冰箱内食材的图像,并控制所述通信模块将所述图像发送至所述服务器。 Specifically, the control module, after receiving the door closing signal sent by the controller of the refrigerator, controls the camera module to collect an image of the foodstuff in the refrigerator, and controls the communication module to send the image to the office Said server.
进一步地,所述食材模型通过以下方式训练来获得:以预先采集和标定的冰箱内食材的图像作为训练图像,基于所述训练图像根据所述深度学习算法确定模型参数以获得所述食材模型。Further, the food material model is obtained by training: taking an image of the foodstuff in the refrigerator that is pre-acquired and calibrated as a training image, and determining a model parameter according to the depth learning algorithm based on the training image to obtain the food material model.
具体地,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络。Specifically, the food material model includes a convolutional neural network, a recurrent neural network, and a circulating neural network.
本发明另一方面提出的食材识别方法,包括以下步骤:获得冰箱内食材的图像;根据食材模型对所述图像进行识别以获得食材信息,其中,所述食材模型为通过深度学习算法训练而获得的神经网络。A food material identification method according to another aspect of the present invention comprises the steps of: obtaining an image of the foodstuff in the refrigerator; and identifying the image according to the food material model to obtain the food material information, wherein the food material model is obtained by training through a deep learning algorithm Neural network.
本发明实施例的食材识别方法,通过采用深度学习方法来识别食材信息,识别率提高,可以更好地应对冰箱内比较复杂的食材识别。According to the food material identification method of the embodiment of the present invention, by using the deep learning method to identify the food material information, the recognition rate is improved, and the more complicated food material identification in the refrigerator can be better dealt with.
其中,所述食材模型通过以下方式训练来获得:以预先采集和标定的冰箱内食材的图像作为训练图像,基于所述训练图像根据所述深度学习算法来确定模型参数以获得所述食材模型。Wherein, the food material model is obtained by training: taking an image of the foodstuff in the refrigerator collected and calibrated in advance as a training image, and determining a model parameter according to the depth learning algorithm based on the training image to obtain the food material model.
具体地,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。Specifically, the food material model includes one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
为了解决上述问题,本发明又一方面实施例提出的食材模型训练方法,包括以下步骤:获得预先采集和标定的冰箱内食材的图像作为训练图像;根据深度学习算法对所述训练图像进行处理来确定模型参数以获得食材模型。In order to solve the above problems, a food material model training method according to another embodiment of the present invention includes the steps of: obtaining an image of a food material in a refrigerator that is pre-acquired and calibrated as a training image; and processing the training image according to a depth learning algorithm. Determine model parameters to obtain a food model.
本发明实施例的食材模型训练方法,通过采用深度学习算法,以冰箱内食材图像作为输入数据,获得食材模型可以应用于冰箱内食材的识别,提高识别率。The food material model training method according to the embodiment of the present invention uses the deep learning algorithm to take the food image in the refrigerator as input data, and obtains the food material model which can be applied to the identification of the food in the refrigerator to improve the recognition rate.
具体地,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。Specifically, the food material model includes one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
其中,所述卷积神经网络包括卷积层、池化层、激励层和全连接层,第一层的输入特征为所述训练图像,每层的输出特征作为下一层的输入特征,根据深度学习算法对所述训练图像进行处理进一步包括:卷积层对于输入特征通过卷积运算进行特征压缩;池化层对输入特征进行池化处理;激励层将输入特征通过激励函数得到输出特征,并对所述输出特征进行归一化处理;全连接层的输入特征和输出特征所有结点之间有权重连接。其中,将输入数据进行池化(降采样),可以减少参数的冗余度。输入经过激励函数之后的数据进行归一化处理,可以提高反向传播的有效性,提高模型泛化能力。Wherein the convolutional neural network comprises a convolutional layer, a pooling layer, an excitation layer and a fully connected layer, the input feature of the first layer is the training image, and the output features of each layer are used as input features of the next layer, according to The depth learning algorithm processing the training image further comprises: convolution layer performing feature compression on the input feature by convolution operation; the pooling layer performing pooling processing on the input feature; and the excitation layer obtaining the output feature by using the input feature through the excitation function, The output features are normalized; the input features of the fully connected layer and the output features are all reconnected between all nodes. Among them, the input data is pooled (downsampled), which can reduce the redundancy of parameters. Normalizing the data after inputting the excitation function can improve the effectiveness of backpropagation and improve the generalization ability of the model.
通常地,所述卷积神经网络包括多个的卷积层、池化层和激励层,以及所述卷积神经网络尾部的全连接层,其中,卷积层的不同卷积核获取不同的输出特征。对输入参数通过不同的卷积核进行多次卷积运算,参数更加全局化。Generally, the convolutional neural network includes a plurality of convolutional layers, a pooling layer, and an excitation layer, and a fully connected layer of the tail of the convolutional neural network, wherein different convolutional cores of the convolutional layer acquire different Output characteristics. The input parameters are convoluted multiple times through different convolution kernels, and the parameters are more global.
另外,该食材模型训练方法还包括:根据实际食材信息对所述食材模型的模型参数进行调整,可以保证模型的泛化能力和学习能力。In addition, the food material model training method further comprises: adjusting the model parameters of the food material model according to the actual food material information, thereby ensuring the generalization ability and the learning ability of the model.
为了解决上述问题,本发明再一方面提出的冰箱,包括:图像采集装置,所述图像采集 装置用于采用冰箱内食材的图像,并将所述图像传输至服务器,以使所述服务器根据食材模型对所述图像进行识别以获得食材信息,其中,所述食材模型为通过深度学习算法训练而获得的神经网络。In order to solve the above problems, a refrigerator according to still another aspect of the present invention includes: an image collecting device, the image capturing The apparatus is for taking an image of the foodstuff in the refrigerator and transmitting the image to a server, so that the server identifies the image according to the foodstuff model to obtain the foodstuff information, wherein the foodstuff model is trained by a deep learning algorithm And get the neural network.
本发明实施例的冰箱,通过图像采集装置将采集的食材的图像发送至服务器,为服务器采用深度学习算法对冰箱内食材的识别提供数据基础。The refrigerator of the embodiment of the invention sends the image of the collected food material to the server through the image collecting device, and provides a data foundation for the server to use the deep learning algorithm to identify the food material in the refrigerator.
进一步地,所述图像采集装置包括:摄像模块,用于采集所述冰箱内食材的图像;通信模块,用于将所述冰箱内食材的图像发送至所述服务器;和控制模块,用于分别对所述摄像模块和所述通信模块进行控制。Further, the image collection device includes: a camera module for collecting an image of the foodstuff in the refrigerator; a communication module, configured to send an image of the foodstuff in the refrigerator to the server; and a control module, respectively Controlling the camera module and the communication module.
所述图像采集装置还包括:照明模块,用于对冰箱内食材所处环境进行照明,更加方便采集食材图像。The image collecting device further includes: a lighting module, configured to illuminate an environment in which the foodstuff in the refrigerator is located, and more conveniently collect the image of the foodstuff.
具体地,所述控制模块,在接收到所述冰箱的控制器发出的关门信号之后,控制所述摄像模块采集所述冰箱内食材的图像,并控制所述通信模块将所述图像发送至所述服务器。Specifically, the control module, after receiving the door closing signal sent by the controller of the refrigerator, controls the camera module to collect an image of the foodstuff in the refrigerator, and controls the communication module to send the image to the office Said server.
为了解决上述问题,本发明又一方面提出的服务器,所述服务器,获得所述冰箱内食材的图像,并根据食材模型对所述图像进行识别以获得食材信息,其中,所述食材模型为通过深度学习算法训练而获得的神经网络。In order to solve the above problems, a server according to still another aspect of the present invention, the server obtains an image of the foodstuff in the refrigerator, and identifies the image according to the food material model to obtain the food material information, wherein the food material model passes Neural network obtained by deep learning algorithm training.
本发明实施例的服务器,基于其强大的运算能力,将深度学习算法应用于图像识别,方法简单,识别率提高,可以更好地应用于对冰箱内数量较多的食材的识别。The server of the embodiment of the present invention applies the deep learning algorithm to image recognition based on its powerful computing capability, and has a simple method and improved recognition rate, and can be better applied to the identification of a large number of food materials in the refrigerator.
其中,所述食材模型通过以下方式训练来获得:以预先采集和标定的冰箱内食材的图像作为训练图像,基于所述训练图像根据所述深度学习算法来确定模型参数以获得所述食材模型。Wherein, the food material model is obtained by training: taking an image of the foodstuff in the refrigerator collected and calibrated in advance as a training image, and determining a model parameter according to the depth learning algorithm based on the training image to obtain the food material model.
具体地,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。Specifically, the food material model includes one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
附图说明DRAWINGS
图1是根据本发明实施例的食材识别系统的框图;1 is a block diagram of a food material identification system in accordance with an embodiment of the present invention;
图2是根据本发明的一个实施例的食材识别系统的框图;2 is a block diagram of a food material identification system in accordance with one embodiment of the present invention;
图3是根据本发明的另一个实施例的冰箱的框图;Figure 3 is a block diagram of a refrigerator in accordance with another embodiment of the present invention;
图4是根据本发明实施例的食材识别方法的流程图;4 is a flow chart of a method for identifying a foodstuff according to an embodiment of the present invention;
图5是根据本发明实施例的食材模型训练方法的流程图;FIG. 5 is a flowchart of a food material model training method according to an embodiment of the present invention; FIG.
图6是根据本发明的一个具体实施例的基于卷积神经网络算法的处理流程图;6 is a process flow diagram of a convolutional neural network based algorithm in accordance with an embodiment of the present invention;
图7是根据本发明的另一个具体实施例的对训练图像进行特征提取的流程图;7 is a flow chart of feature extraction of a training image in accordance with another embodiment of the present invention;
图8是根据本发明的又一个具体实施例的食材模型训练过程的流程图;Figure 8 is a flow chart of a food material model training process in accordance with yet another embodiment of the present invention;
图9是根据本发明的一个实施例的冰箱的框图;以及 Figure 9 is a block diagram of a refrigerator in accordance with one embodiment of the present invention;
图10是根据本发明的另一个实施例的冰箱的框图。Figure 10 is a block diagram of a refrigerator in accordance with another embodiment of the present invention.
具体实施方式detailed description
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,旨在用于解释本发明,而不能理解为对本发明的限制。The embodiments of the present invention are described in detail below, and the examples of the embodiments are illustrated in the drawings, wherein the same or similar reference numerals are used to refer to the same or similar elements or elements having the same or similar functions. The embodiments described below with reference to the drawings are intended to be illustrative of the invention and are not to be construed as limiting.
下面参照附图描述根据本发明实施例提出的食材识别系统、食材识别方法、食材模型训练方法和冰箱和服务器。The food material identification system, the food material identification method, the food material model training method, and the refrigerator and the server according to an embodiment of the present invention are described below with reference to the accompanying drawings.
图1是根据本发明的一个实施例的食材识别系统的框图,如图1所示,该食材识别系统1000包括冰箱100和服务器200。1 is a block diagram of a food material identification system according to an embodiment of the present invention. As shown in FIG. 1, the food material identification system 1000 includes a refrigerator 100 and a server 200.
冰箱100包括图像采集装置10,图像采集装置10用于采集冰箱内食材的图像。冰箱100与服务器200可以进行数据交互。服务器200获得冰箱内食材的图像,并根据食材模型对获得图像进行识别以获得食材信息,例如食材的品类及其坐标,其中,食材模型为通过深度学习算法训练而获得的神经网络,将深度学习算法引入冰箱内食材的识别,通过增加神经网络层数可以更好地输出和输入的映射关系,提高识别准确度。The refrigerator 100 includes an image capture device 10 for collecting images of foodstuffs in the refrigerator. The refrigerator 100 and the server 200 can perform data interaction. The server 200 obtains an image of the foodstuff in the refrigerator, and identifies the obtained image according to the food material model to obtain the food material information, such as the category of the foodstuff and its coordinates, wherein the food material model is a neural network obtained by training the deep learning algorithm, and the deep learning is performed. The algorithm introduces the identification of the food in the refrigerator, and by increasing the number of neural network layers, the mapping relationship between output and input can be better, and the recognition accuracy is improved.
本发明实施例的食材识别系统1000,将本地拍照和远程识别结合,利用服务器200的运算能力,将深度学习算法应用于图像识别,方法简单,识别率提高,可以更好地应用于对冰箱100内数量较多的食材的识别。The food material identification system 1000 of the embodiment of the present invention combines local photographing and remote recognition, and utilizes the computing power of the server 200 to apply the deep learning algorithm to image recognition. The method is simple, the recognition rate is improved, and the method can be better applied to the refrigerator 100. Identification of a large number of ingredients.
进一步地,如图2所示,图像采集装置10包括摄像模块11、通信模块12和控制模块13。其中,摄像模块11用于采集冰箱100内食材的图像;通信模块12用于将冰箱100内食材的图像发送至服务器200;控制模块13用于分别对摄像模块11和通信模块12进行控制。Further, as shown in FIG. 2, the image capture device 10 includes a camera module 11, a communication module 12, and a control module 13. The camera module 11 is configured to collect an image of the foodstuff in the refrigerator 100; the communication module 12 is configured to send an image of the foodstuff in the refrigerator 100 to the server 200; and the control module 13 is configured to respectively control the camera module 11 and the communication module 12.
如图2所示,图像采集装置10还包括照明模块14,照明模块14用于对冰箱100内食材所处环境进行照明,方便摄像模块11拍摄以获得食材图像。As shown in FIG. 2, the image capture device 10 further includes a lighting module 14 for illuminating the environment in which the foodstuffs in the refrigerator 100 are located, so that the camera module 11 can capture the image of the foodstuff.
其中,控制模块13在接收到冰箱100的控制器发出的关门信号之后,控制摄像模块11采集冰箱内食材的图像,并控制通信模块12将该图像发送至服务器200。The control module 13 controls the camera module 11 to collect an image of the foodstuff in the refrigerator after receiving the door closing signal from the controller of the refrigerator 100, and controls the communication module 12 to send the image to the server 200.
具体地,如图3所示,相较于传统冰箱,本发明实施例中的冰箱,在硬件上可以设置图像采集装置10,包括控制模块13例如CPU(Central Processing Unit,中央处理器)及其相关外设,冰箱100的控制电路01通过控制器02例如MCU(Microcontroller Unit,微控制单元)进行控制,CPU与MCU可以进行数据交互,CPU外围电路可以挂载相关的外设,在本发明的实施例中,相关外设包括照明模块14、摄像模块11和通信模块12例如WIFI模块。其中,CPU控制摄像模块11配合照明模块14完成本地食材图像的拍摄。在进行食材识别过程中,当MCU检测到关门信号时,意味着冰箱内的食材可能会发生变化,此时,控制模块 13控制摄像模块11和照明模块14协同拍摄图像,将拍摄的图像通过WIFI模块将拍摄的图像上传至服务器200,服务器200进行食材信息的识别,之后,服务器200可以件识别信息反馈给本地或者提供给相关人员,获得识别结果。Specifically, as shown in FIG. 3, the refrigerator in the embodiment of the present invention can be provided with an image capturing device 10, including a control module 13, such as a CPU (Central Processing Unit), and a refrigerator thereof. Related peripherals, the control circuit 01 of the refrigerator 100 is controlled by a controller 02 such as an MCU (Microcontroller Unit), the CPU and the MCU can perform data interaction, and the CPU peripheral circuit can mount related peripherals, in the present invention. In an embodiment, the related peripherals include a lighting module 14, a camera module 11 and a communication module 12 such as a WIFI module. The CPU controls the camera module 11 to cooperate with the illumination module 14 to complete the shooting of the local food image. In the process of identifying the ingredients, when the MCU detects the door closing signal, it means that the ingredients in the refrigerator may change. At this time, the control module 13 controlling the camera module 11 and the illumination module 14 to jointly capture an image, uploading the captured image to the server 200 through the WIFI module, and the server 200 performs identification of the food material information, after which the server 200 can feed the identification information to the local or provide Give relevant personnel the recognition results.
在本发明的实施例中,部署于服务器200上的食材模型可以通过以下方式训练来获得:以预先采集和标定的冰箱内食材的图像作为训练图像,基于该训练图像根据深度学习算法确定模型参数以获得食材模型。其中,食材模型可以包括卷积神经网络、递归神经网络和循环神经网络中的一种,可以通过BP算法确定模型参数。In an embodiment of the present invention, the food product model deployed on the server 200 can be obtained by training: taking an image of the food in the refrigerator that is pre-acquired and calibrated as a training image, and determining a model parameter according to the depth learning algorithm based on the training image. To get a model of the ingredients. Among them, the food material model may include one of a convolutional neural network, a recurrent neural network, and a cyclic neural network, and the model parameters may be determined by a BP algorithm.
具体来说,食材模型训练可以在离线端完成,在冰箱内或者模拟冰箱内环境下拍摄食材图像并经过手工标记各种食材的位置,通过深度学习算法的操作获得输入矢量,进而与预先的标定结果进行全连接,通过大量的训练图像进行训练确定模型的参数,相当于确定函数关系中的系数,参数训练完成即获得食材模型,将食材模型部署在服务器200上。对于食材模型的训练过程以下有详细说明。Specifically, the food model training can be completed on the off-line side, taking the image of the food in the refrigerator or in the simulated refrigerator environment and manually marking the position of various ingredients, obtaining the input vector through the operation of the deep learning algorithm, and then pre-calibrating As a result, the whole connection is performed, and the parameters of the model are determined by training a large number of training images, which is equivalent to determining the coefficient in the functional relationship. When the parameter training is completed, the food material model is obtained, and the food material model is deployed on the server 200. The training process for the food model is described in detail below.
下面参照附图描述根据本发明另一方面实施例的食材识别方法。A food material identification method according to another embodiment of the present invention will now be described with reference to the accompanying drawings.
图4是根据本发明实施例的食材识别方法的流程图,如图4所示,该食材识别方法包括以下步骤:4 is a flow chart of a method for identifying a food material according to an embodiment of the present invention. As shown in FIG. 4, the food material identification method includes the following steps:
S1,获得冰箱内食材的图像。例如,通过图像采集装置采集冰箱内食材图像,并发送至服务器。S1, obtaining an image of the food in the refrigerator. For example, an image of the food in the refrigerator is collected by an image acquisition device and sent to a server.
S2,根据食材模型对图像进行识别以获得食材信息,其中,食材模型为通过深度学习算法训练而获得的神经网络。S2, the image is identified according to the food material model to obtain the food material information, wherein the food material model is a neural network obtained by training through a deep learning algorithm.
具体地,食材模型通过以下方式训练来获得:以预先采集和标定的冰箱内食材的图像作为训练图像,基于该训练图像根据深度学习算法确定模型参数以获得食材模型。其中,食材模型可以包括卷积神经网络、递归神经网络和循环神经网络中的一种,可以通过BP算法确定模型参数。Specifically, the food material model is obtained by training in the following manner: an image of the foodstuff in the refrigerator that is pre-acquired and calibrated is used as a training image, and the model parameter is determined based on the training image according to the depth learning algorithm to obtain the food material model. Among them, the food material model may include one of a convolutional neural network, a recurrent neural network, and a cyclic neural network, and the model parameters may be determined by a BP algorithm.
本发明实施例的食材识别方法,通过采用深度学习方法来识别食材信息,识别率提高,方法简单,可以更好地应对冰箱内比较复杂的食材识别。The food material identification method of the embodiment of the invention identifies the food material information by using the deep learning method, and the recognition rate is improved, and the method is simple, and the complex material identification in the refrigerator can be better dealt with.
下面对本发明实施例的食材模型训练方法进行说明。图5是根据本发明的实施例的食材模型训练方法的流程图,如图5所示,该食材模型训练方法包括以下步骤:The food material model training method of the embodiment of the present invention will be described below. FIG. 5 is a flowchart of a food material model training method according to an embodiment of the present invention. As shown in FIG. 5, the food material model training method includes the following steps:
S10,获得预先采集和标定的冰箱内食材的图像作为训练图像,例如通过冰箱的图像采集装置采集冰箱内食材图像或者拍摄模拟冰箱环境下的图像作为训练图像,并标定食材的位置,以及上传至食材模型训练端,食材模型训练可以在离线端进行。S10, obtaining an image of the foodstuff in the refrigerator collected and calibrated in advance as a training image, for example, collecting an image of the foodstuff in the refrigerator by using an image collecting device of the refrigerator or photographing an image in a simulated refrigerator environment as a training image, and calibrating the position of the foodstuff, and uploading to the image The food model training end, the food model training can be carried out at the offline end.
S20,根据深度学习算法对训练图像进行处理来确定模型参数以获得食材模型。S20. The training image is processed according to a depth learning algorithm to determine a model parameter to obtain a food material model.
在本发明的实施例中,食材模型可以包括卷积神经网络、递归神经网络和循环神经网络 中的一种。本领域技术人员了解,神经网络的模型参数的训练过程可以通过BP算法实现,该算法已经多有记载,在这里不再赘述。对于不同的神经网络有不同的处理流程,其在不同数据集上表现的不同主要是由神经网络的结构决定。对于相同的神经网络结构其输入数据不同,其模型参数也是不同的。在本发明实施例中,食材模型由冰箱内食材图像作为输入数据,根据深度学习方法确定模型参数,进而将该食材模型部署于识别服务器上,应用于冰箱内食材识别,可以提高识别率,方便复杂情况的识别。In an embodiment of the invention, the food material model may include a convolutional neural network, a recurrent neural network, and a circulating neural network. One of them. Those skilled in the art understand that the training process of the model parameters of the neural network can be implemented by the BP algorithm, which has been described in many cases and will not be described here. There are different processing flows for different neural networks, and their differences in different data sets are mainly determined by the structure of the neural network. The input parameters are different for the same neural network structure, and the model parameters are also different. In the embodiment of the present invention, the food material model uses the image of the food material in the refrigerator as input data, determines the model parameters according to the deep learning method, and then deploys the food material model on the identification server, and applies the food material identification in the refrigerator to improve the recognition rate and facilitate Identification of complex situations.
本发明实施例的食材模型训练方法,通过采用深度学习算法,以冰箱内食材图像作为输入数据,获得食材模型可以应用于冰箱内食材的识别,识别率提高。According to the food material model training method of the embodiment of the present invention, the food material model in the refrigerator can be used as the input data by using the deep learning algorithm, and the food material model can be applied to the identification of the foodstuff in the refrigerator, and the recognition rate is improved.
下面以卷积神经网络为例,对食材模型即食材神经网络的训练过程进行说明。基于卷积神经网络的深度学习方法,通过深层的网络可以更好地拟合输入食材图像和输出识别结果之间的关系,具有较高的识别率。Taking the convolutional neural network as an example, the training process of the food model, namely the food neural network, is described. Based on the deep learning method of convolutional neural network, the deep network can better fit the relationship between the input food image and the output recognition result, and has a high recognition rate.
在本发明的实施例中,卷积神经网络包括卷积层、池化层、激励层和全连接层,第一层的输入特征为训练图像,每层的输出特征作为下一层的输入特征,根据深度学习算法对训练图像进行处理进一步包括:卷积层对于输入特征通过卷积运算进行特征压缩;池化层对输入特征进行池化处理,将输入数据进行池化(降采样),可以减少参数的冗余度;激励层将输入特征通过激励函数得到输出特征,并对所述输出特征进行归一化处理,常用的激励函数包括Relu,Maxout,sigmoid等。其中,输入经过激励函数之后的数据进行归一化处理,可以提高反向传播的有效性,提高模型泛化能力;全连接层的输入特征和输出特征所有结点之间有权重连接,即将前一层的输出数据和下一层的输出数据进行全连接,全连接层通常在网络的尾部,此时最后一层的数据为训练图像的标定结果。In an embodiment of the invention, the convolutional neural network comprises a convolutional layer, a pooling layer, an excitation layer and a fully connected layer, the input features of the first layer are training images, and the output features of each layer are used as input features of the next layer. The processing of the training image according to the deep learning algorithm further comprises: the convolution layer performs feature compression on the input feature by convolution operation; the pooling layer performs pooling on the input feature, and the input data is pooled (downsampled), The redundancy of the parameters is reduced; the excitation layer obtains the output features through the excitation function, and normalizes the output features, and the commonly used excitation functions include Relu, Maxout, sigmoid, and the like. Wherein, the input data after the excitation function is normalized, which can improve the effectiveness of backpropagation and improve the generalization ability of the model; the input features of the fully connected layer and the output features have the right to reconnect between all nodes, ie, The output data of one layer and the output data of the next layer are fully connected. The full connection layer is usually at the end of the network, and the data of the last layer is the calibration result of the training image.
概括地说,卷积神经网络的处理流程如图6所示,对于输入的训练图像,首先进行特征提取得到特征图像即进行卷积运算,对于特征图像进行降采样即进行池化处理,继而,归一化处理后得到的特征矢量作为全连接层数据输入到神经网络。通过大量的图像训练确定食材模型的参数,从而获得食材模型。当然,食材模型可以根据实际情况和需要对网络结构进行相应调整和设计。In a nutshell, the processing flow of the convolutional neural network is as shown in FIG. 6. For the input training image, the feature extraction is first performed to obtain the feature image, that is, the convolution operation is performed, and the feature image is downsampled, that is, pooled, and then, The feature vector obtained after the normalization process is input to the neural network as fully connected layer data. The parameters of the food material model are determined by a large amount of image training, thereby obtaining a food material model. Of course, the food model can adjust and design the network structure according to the actual situation and needs.
具体来说,食材模型在采用卷积神经网络结构时,获得冰箱内食材图像即训练图像之后,对训练图像进行卷积运算以获得特征图像,并对该特征图像进行池化处理。通过卷积和池化的操作可以降低输入矢量包含的冗余信息,降低维度,大大地降低运算量。Specifically, when the food material model adopts the convolutional neural network structure, after obtaining the image of the food material in the refrigerator, that is, the training image, the training image is convoluted to obtain a feature image, and the feature image is pooled. Through convolution and pooling operations, the redundant information contained in the input vector can be reduced, the dimension is reduced, and the amount of computation is greatly reduced.
下面对采用卷积神经网络结构的食材模型的训练的各个步骤进一步说明。The various steps of the training of the food material model using the convolutional neural network structure are further explained below.
如图7所示,卷积神经网络的特征提取过程,包括:S710,获得一幅训练图像;S720,采用某种卷积核对训练图像进行卷积运算,S730,增加偏置;S740,获得特征图像。As shown in FIG. 7, the feature extraction process of the convolutional neural network includes: S710, obtaining a training image; S720, using a convolution kernel to perform convolution operation on the training image, S730, adding offset; S740, obtaining features image.
其中,通过选定的卷积核对训练图像进行处理,首先,是基于局部感知野的原理。一般 认为,人对外界的认知是从局部到全局的,而图像的空间联系也是局部的像素联系较为紧密,而距离较远的像素相关性则较弱。因而,每个神经元其实没有必要对全局图像进行感知,只需要对局部进行感知,然后,在更高层将局部的信息综合起来就得到了全局的信息。Among them, the training image is processed by the selected convolution kernel, firstly based on the principle of local perception field. General It is believed that people's perception of the outside world is from local to global, and the spatial connection of images is also a close connection of local pixels, while the correlation of pixels far away is weak. Therefore, it is not necessary for each neuron to perceive the global image. It only needs to perceive the local part. Then, the local information is combined at a higher level to obtain global information.
其次,可以认为图像的各个部分的统计特征是相同的,这也意味着在这一部分学习的特征也能用在另一部分上,所以对于此训练图像上的所有位置,都能使用同样的学习特征。就可以选用同样的卷积核进行处理,得到一副特征图像。Secondly, it can be considered that the statistical features of the various parts of the image are the same, which also means that the features learned in this part can also be used in another part, so the same learning characteristics can be used for all positions on the training image. . The same convolution kernel can be used for processing to obtain a feature image.
通过卷积运算,可以实现对待处理图像的降维。用一种特征描述图像会有信息量的丢失,可以采用不同的卷积核分别对训练图像进行卷积操作,得到一系列的特征图像。Through convolution operations, dimensionality reduction of the image to be processed can be achieved. Using a feature to describe an image has a loss of information. Different convolution kernels can be used to convolve the training image separately to obtain a series of feature images.
在通过卷积运算获得了特征图像之后,下一步可以利用这些特征去做分类。理论上讲,可以用所有提取得到的特征去训练分类器,但是特征图像表达的只是图像的某种特征,依然存在着比较大的冗余度。利用静态图片的空间相关性,可以通过降采样的方式进一步降低冗余度。这种聚合的操作就叫做池化,根据池化的方法,也称为平均池化或者最大池化。After obtaining the feature image by the convolution operation, the next step can be used to classify these features. In theory, all the extracted features can be used to train the classifier, but the feature image expresses only a certain feature of the image, and there is still a large redundancy. With the spatial correlation of static pictures, redundancy can be further reduced by downsampling. This kind of aggregation operation is called pooling. According to the pooling method, it is also called average pooling or maximum pooling.
为了防止训练数据和测试数据的分布不同,导致泛化能力降低的问题,以及激励函数导致数据分布发生变化,在激励层之后将数据进行归一化操作,即将通过激励函数的数据进行归一化处理。同时,如果每批训练数据的分布不同,采用归一化处理也可以使训练速度增加。归一化处理可以简单的将每层的输入参数重新规整至0均值、1为方差的分布。通过归一化处理,可以提高泛化能力,提高训练速度。In order to prevent the distribution of training data and test data from being different, the problem of generalization ability is reduced, and the excitation function causes the data distribution to change, and the data is normalized after the excitation layer, that is, the data of the excitation function is normalized. deal with. At the same time, if the distribution of training data is different, the normalization process can also increase the training speed. Normalization can simply rescale the input parameters of each layer to a mean of 0 and a distribution of variance. Through normalization, generalization ability can be improved and training speed can be improved.
食材模型可以在离线端完成训练,如图8所示,食材模型训练过程包括:S810,训练图像经过手工标记各种食材的位置;S820,卷积运算;S830,池化处理;S840,归一化操作得到输入矢量;S850,将归一化处理后的数据与训练图像的标定结果进行全连接。The food model can be completed at the offline end, as shown in Figure 8. The food model training process includes: S810, the training image is manually marked with the position of various ingredients; S820, convolution operation; S830, pool processing; S840, normalized The input operation obtains an input vector; in S850, the normalized processed data is fully connected with the calibration result of the training image.
参照图6所示,采用的是一层卷积运算,而在实际应用中,通常地,卷积神经网络包括多个的卷积层、池化层和激励层,以及卷积神经网络尾部的全连接层。即言,对训练图像进行多次的卷积运算、池化处理以及归一化,其中,卷积层的不同卷积核可以理解为获取不同的输出特征。采用一层卷积运算获得的特征往往是局部的,层数越高则获得的特征就会越全面化,所以,采用多层卷积可以使得获得特征更加全局化,可以根据需要对食材模型进行特定的设计。Referring to FIG. 6, a layer convolution operation is employed, and in practical applications, the convolutional neural network generally includes a plurality of convolution layers, a pooling layer, and an excitation layer, and a convolutional neural network tail. Fully connected layer. That is to say, the training image is subjected to multiple convolution operations, pooling processes, and normalization, wherein different convolution kernels of the convolutional layer can be understood as acquiring different output features. Features obtained by a layer of convolution operation are often local, and the higher the number of layers, the more comprehensive the features obtained. Therefore, the use of multi-layer convolution can make the features more global, and the food model can be made as needed. Specific design.
另外,可以根据实际食材信息对食材模型的模型参数进行调整。具体地,上传至服务器的食材图像可以通过人工标定的方式确定实际食材信息,并将实际食材信息与识别结果进行对比,对于识别错误的图像,或者之前食材模型中没有包含的食物,可以通过上述食材模型的训练过程进行再训练,来调整食材模型参数。In addition, the model parameters of the food material model can be adjusted based on the actual food material information. Specifically, the image of the food material uploaded to the server may be determined by manual calibration, and the actual food material information may be compared with the recognition result. For the image identifying the error or the food not included in the previous food product model, the above may be adopted. The training process of the food model is retrained to adjust the parameters of the food model.
相较于相关技术中无法应对食物种类的变化、新包装的更换等情况,以及对于识别种类增加,固定的食材模型无法应对,本申请的食材训练方法,可以根据食材信息对食材模型的 模型参数进行调整,适应新的食材种类。另外,相对于相关技术中,对于识别结果没有有效的监测,针对识别率较低的食物无法进行优化,本申请的方法,采用监督学习的方式训练食材模型,并对识别结果进行检测,以及对模型参数调整,可以提高模型的泛化能力和学习能力。Compared with the related art, it is unable to cope with the change of the food type, the replacement of the new package, and the like, and the fixed food model cannot be coped with the increase of the identification type. The food material training method of the present application can be based on the food material information for the food material model. The model parameters are adjusted to suit the new ingredients. In addition, compared with the related art, there is no effective monitoring for the recognition result, and the food with low recognition rate cannot be optimized. The method of the present application uses the supervised learning method to train the food material model, and detects the recognition result, and Model parameter adjustment can improve the generalization ability and learning ability of the model.
下面参照附图描述根据本发明的有一方面实施例提出的冰箱。A refrigerator according to an embodiment of the present invention will be described below with reference to the accompanying drawings.
图9是根据本发明的一个实施例的冰箱的框图,如图9所示,该冰箱100包括图像采集装置10。图像采集装置10用于采用冰箱内食材的图像,并将该图像传输至服务器,以使服务器根据食材模型对图像进行识别以获得食材信息,其中,食材模型为通过深度学习算法训练而获得的神经网络。9 is a block diagram of a refrigerator including an image capture device 10, as shown in FIG. 9, in accordance with an embodiment of the present invention. The image capture device 10 is configured to adopt an image of the foodstuff in the refrigerator, and transmit the image to a server, so that the server recognizes the image according to the food material model to obtain the food material information, wherein the food material model is a nerve obtained by training through a deep learning algorithm. The internet.
本发明实施例的冰箱100,通过图像采集装置10将采集的食材的图像发送至服务器,为服务器采用深度学习算法对冰箱内食材的识别提供数据基础。The refrigerator 100 of the embodiment of the present invention sends an image of the collected food material to the server through the image capturing device 10, and provides a data foundation for the server to use the deep learning algorithm to identify the foodstuff in the refrigerator.
进一步地,如图10所示,该图像采集装置10包括摄像模块11、通信模块12和控制模块13。其中,摄像模块11用于采集冰箱内食材的图像;通信模块12用于将冰箱内食材的图像发送至服务器;控制模块13用于分别对摄像模块11和通信模块12进行控制。Further, as shown in FIG. 10, the image capture device 10 includes a camera module 11, a communication module 12, and a control module 13. The camera module 11 is configured to collect an image of the foodstuff in the refrigerator; the communication module 12 is configured to send an image of the foodstuff in the refrigerator to the server; and the control module 13 is configured to respectively control the camera module 11 and the communication module 12.
参照图10所示,图像采集装置10还包括照明模块14,照明模块14用于对冰箱100内食材所处环境进行照明,方便摄像模块11拍摄以获得食材图像。Referring to FIG. 10, the image capture device 10 further includes a lighting module 14 for illuminating the environment in which the foodstuffs in the refrigerator 100 are located, so that the camera module 11 can be photographed to obtain an image of the foodstuff.
其中,控制模块13在接收到冰箱100的控制器发出的关门信号之后,控制摄像模块11采集冰箱内食材的图像,并控制通信模块12将该图像发送至服务器200。The control module 13 controls the camera module 11 to collect an image of the foodstuff in the refrigerator after receiving the door closing signal from the controller of the refrigerator 100, and controls the communication module 12 to send the image to the server 200.
下面对本发明实施例的服务器进行说明。服务器100中可以部署食材模型。具体来说,服务器200对冰箱内食材进行识别时,服务器100获得冰箱内食材的图像,并根据食材模型对该图像进行识别以获得食材信息,例如食材的品类及其坐标,其中,食材模型为通过深度学习算法训练而获得的神经网络。将深度学习算法引入冰箱内食材的识别,通过增加神经网络层数可以更好地输出和输入的映射关系,提高识别准确度。The server of the embodiment of the present invention will be described below. A food product model can be deployed in the server 100. Specifically, when the server 200 identifies the foodstuff in the refrigerator, the server 100 obtains an image of the foodstuff in the refrigerator, and identifies the image according to the foodstuff model to obtain the foodstuff information, such as the category of the foodstuff and its coordinates, wherein the foodstuff model is A neural network obtained by training in deep learning algorithms. The deep learning algorithm is introduced into the identification of the food in the refrigerator, and the mapping relationship between the output and the input can be better by increasing the number of neural network layers, thereby improving the recognition accuracy.
本发明实施例的服务器200,基于其强大的运算能力,将深度学习算法应用于图像识别,方法简单,识别率提高,可以更好地应用于对冰箱内数量较多的食材的识别。The server 200 of the embodiment of the present invention applies the deep learning algorithm to image recognition based on its powerful computing capability, and has a simple method and improved recognition rate, and can be better applied to the identification of a large number of food materials in the refrigerator.
其中,食材模型通过以下方式训练来获得:以预先采集和标定的冰箱内食材的图像作为训练图像,基于该训练图像根据深度学习算法确定模型参数以获得食材模型。在本发明的实施例中,食材模型可以包括卷积神经网络、递归神经网络和循环神经网络中的一种,可以通过BP算法确定模型参数。The food material model is obtained by training in the following manner: an image of the food in the refrigerator that is pre-acquired and calibrated is used as a training image, and the model parameter is determined according to the depth learning algorithm based on the training image to obtain a food material model. In an embodiment of the present invention, the food material model may include one of a convolutional neural network, a recurrent neural network, and a cyclic neural network, and the model parameters may be determined by a BP algorithm.
具体来说,食材模型训练可以在离线端完成,拍摄的冰箱内的食材图像或者拍摄模拟冰箱环境下的食材图像并经过手工标记各种食材的位置,通过深度学习算法的操作获得输入矢量,进而与预先的标定结果进行全连接,通过大量的图像进行训练确定模型参数,相当于确 定函数关系中的系数,参数训练完成即获得食材模型,将食材模型部署在服务器200上,从而,在进行图像识别时,服务器200可以将获得的冰箱内的食材图像作为输入,根据该食材模型实现对食材的位置和品类的识别,并提供识别结果,识别结果可以反馈会冰箱或者提供给相应人员。Specifically, the food model training can be completed at the offline end, the image of the food in the refrigerator is photographed or the image of the food in the simulated refrigerator environment is photographed, and the position of various ingredients is manually marked, and the input vector is obtained by the operation of the deep learning algorithm, and further Fully connected to the pre-calibrated results, trained through a large number of images to determine model parameters, equivalent to The coefficient in the function relationship is obtained, and the parameter training is completed to obtain the food material model, and the food material model is deployed on the server 200, so that when the image recognition is performed, the server 200 can input the obtained food material image in the refrigerator as an input, according to the food material model. Realize the identification of the location and category of the ingredients, and provide identification results, which can be fed back to the refrigerator or provided to the appropriate personnel.
综上所述,本发明实施例提供了食材识别系统、食材识别方法以及食材模型训练方法和冰箱、服务器,并且将深度学习算法应用在食材识别上,以本地摄像和在线识别结合的方式,利用服务器的运算能力进行图像识别。其中,在食材模型训练阶段,采集大量食材图像,对图像中的食材进行标记例如4点标记,对食材在空间坐标信息和像素信息经过处理,为各种食材建立深度神经网络模型。在食材模型训练过程中,采用特征映射的方法提取图像的各种特征参数,即保留图像的特征,并去处冗余度,大幅度减小运算。在识别阶段,将本地图像上传后,在服务器端匹配食材模型,生成食材以及其坐标。在检验阶段,通过人工标定被测图像的食材和位置,对比服务器的识别结果,得到识别率结果,并通过再训练调整模型参数,提升实际性能。In summary, the embodiments of the present invention provide a food material identification system, an food material identification method, a food material model training method, a refrigerator, and a server, and apply a deep learning algorithm to the food material identification, using a combination of local camera and online recognition. The computing power of the server performs image recognition. Among them, in the training stage of the food material model, a large number of food material images are collected, and the food ingredients in the image are marked, for example, 4-point mark, and the spatial coordinate information and the pixel information are processed for the food material to establish a deep neural network model for various food materials. In the training process of the food material model, the feature mapping method is used to extract various characteristic parameters of the image, that is, the features of the image are preserved, and the redundancy is removed, and the operation is greatly reduced. In the identification phase, after uploading the local image, the food product model is matched on the server side to generate the food and its coordinates. In the inspection stage, by manually calibrating the food and position of the image to be measured, comparing the recognition results of the server, the recognition rate result is obtained, and the model parameters are adjusted through retraining to improve the actual performance.
需要说明的是,在本说明书的描述中,流程图中或在此以其他方式描述的任何过程或方法描述可以被理解为,表示包括一个或更多个用于实现特定逻辑功能或过程的步骤的可执行指令的代码的模块、片段或部分,并且本发明的优选实施方式的范围包括另外的实现,其中可以不按所示出或讨论的顺序,包括根据所涉及的功能按基本同时的方式或按相反的顺序,来执行功能,这应被本发明的实施例所属技术领域的技术人员所理解。It should be noted that, in the description of the specification, any process or method description in the flowcharts or otherwise described herein may be understood to include one or more steps for implementing a particular logic function or process. Modules, segments or portions of code of executable instructions, and the scope of preferred embodiments of the invention includes additional implementations, which may not be in the order shown or discussed, including in a substantially simultaneous manner depending on the functionality involved. The functions are performed in the reverse order, which should be understood by those skilled in the art to which the embodiments of the present invention pertain.
在流程图中表示或在此以其他方式描述的逻辑和/或步骤,例如,可以被认为是用于实现逻辑功能的可执行指令的定序列表,可以具体实现在任何计算机可读介质中,以供指令执行系统、装置或设备(如基于计算机的系统、包括处理器的系统或其他可以从指令执行系统、装置或设备取指令并执行指令的系统)使用,或结合这些指令执行系统、装置或设备而使用。就本说明书而言,"计算机可读介质"可以是任何可以包含、存储、通信、传播或传输程序以供指令执行系统、装置或设备或结合这些指令执行系统、装置或设备而使用的装置。计算机可读介质的更具体的示例(非穷尽性列表)包括以下:具有一个或多个布线的电连接部(电子装置),便携式计算机盘盒(磁装置),随机存取存储器(RAM),只读存储器(ROM),可擦除可编辑只读存储器(EPROM或闪速存储器),光纤装置,以及便携式光盘只读存储器(CDROM)。另外,计算机可读介质甚至可以是可在其上打印所述程序的纸或其他合适的介质,因为可以例如通过对纸或其他介质进行光学扫描,接着进行编辑、解译或必要时以其他合适方式进行处理来以电子方式获得所述程序,然后将其存储在计算机存储器中。The logic and/or steps represented in the flowchart or otherwise described herein, for example, may be considered as an ordered list of executable instructions for implementing logical functions, and may be embodied in any computer readable medium, Used in conjunction with, or in conjunction with, an instruction execution system, apparatus, or device (eg, a computer-based system, a system including a processor, or other system that can fetch instructions and execute instructions from an instruction execution system, apparatus, or device) Or use with equipment. For the purposes of this specification, a "computer-readable medium" can be any apparatus that can contain, store, communicate, propagate, or transport a program for use in an instruction execution system, apparatus, or device, or in conjunction with the instruction execution system, apparatus, or device. More specific examples (non-exhaustive list) of computer readable media include the following: electrical connections (electronic devices) having one or more wires, portable computer disk cartridges (magnetic devices), random access memory (RAM), Read only memory (ROM), erasable editable read only memory (EPROM or flash memory), fiber optic devices, and portable compact disk read only memory (CDROM). In addition, the computer readable medium may even be a paper or other suitable medium on which the program can be printed, as it may be optically scanned, for example by paper or other medium, followed by editing, interpretation or, if appropriate, other suitable The method is processed to obtain the program electronically and then stored in computer memory.
应当理解,本发明的各部分可以用硬件、软件、固件或它们的组合来实现。在上述实施方式中,多个步骤或方法可以用存储在存储器中且由合适的指令执行系统执行的软件或固件 来实现。例如,如果用硬件来实现,和在另一实施方式中一样,可用本领域公知的下列技术中的任一项或他们的组合来实现:具有用于对数据信号实现逻辑功能的逻辑门电路的离散逻辑电路,具有合适的组合逻辑门电路的专用集成电路,可编程门阵列(PGA),现场可编程门阵列(FPGA)等。It should be understood that portions of the invention may be implemented in hardware, software, firmware or a combination thereof. In the above embodiments, multiple steps or methods may be implemented in software or firmware stored in a memory and executed by a suitable instruction execution system to realise. For example, if implemented in hardware, as in another embodiment, it can be implemented by any one or combination of the following techniques well known in the art: having logic gates for implementing logic functions on data signals. Discrete logic circuits, application specific integrated circuits with suitable combinational logic gates, programmable gate arrays (PGAs), field programmable gate arrays (FPGAs), etc.
本技术领域的普通技术人员可以理解实现上述实施例方法携带的全部或部分步骤是可以通过程序来指令相关的硬件完成,所述的程序可以存储于一种计算机可读存储介质中,该程序在执行时,包括方法实施例的步骤之一或其组合。One of ordinary skill in the art can understand that all or part of the steps carried by the method of implementing the above embodiments can be completed by a program to instruct related hardware, and the program can be stored in a computer readable storage medium. When executed, one or a combination of the steps of the method embodiments is included.
在本说明书的描述中,参考术语“一个实施例”、“一些实施例”、“示例”、“具体示例”、或“一些示例”等的描述意指结合该实施例或示例描述的具体特征、结构、材料或者特点包含于本发明的至少一个实施例或示例中。在本说明书中,对上述术语的示意性表述不必须针对的是相同的实施例或示例。而且,描述的具体特征、结构、材料或者特点可以在任一个或多个实施例或示例中以合适的方式结合。此外,在不相互矛盾的情况下,本领域的技术人员可以将本说明书中描述的不同实施例或示例以及不同实施例或示例的特征进行结合和组合。In the description of the present specification, the description with reference to the terms "one embodiment", "some embodiments", "example", "specific example", or "some examples" and the like means a specific feature described in connection with the embodiment or example. A structure, material or feature is included in at least one embodiment or example of the invention. In the present specification, the schematic representation of the above terms is not necessarily directed to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in a suitable manner in any one or more embodiments or examples. In addition, various embodiments or examples described in the specification, as well as features of various embodiments or examples, may be combined and combined.
尽管上面已经示出和描述了本发明的实施例,可以理解的是,上述实施例是示例性的,不能理解为对本发明的限制,本领域的普通技术人员在本发明的范围内可以对上述实施例进行变化、修改、替换和变型。 Although the embodiments of the present invention have been shown and described, it is understood that the above-described embodiments are illustrative and are not to be construed as limiting the scope of the invention. The embodiments are subject to variations, modifications, substitutions and variations.

Claims (21)

  1. 一种食材识别系统,其特征在于,包括:An food material identification system, comprising:
    冰箱,所述冰箱包括图像采集装置,所述图像采集装置用于采集冰箱内食材的图像;a refrigerator, the refrigerator includes an image capture device, and the image capture device is configured to collect an image of the foodstuff in the refrigerator;
    服务器,所述服务器获得所述冰箱内食材的图像,并根据食材模型对所述图像进行识别以获得食材信息,所述食材模型为通过深度学习算法训练而获得的神经网络。a server that obtains an image of the foodstuff in the refrigerator and identifies the image according to a food product model, the food material model being a neural network obtained by training through a deep learning algorithm.
  2. 如权利要求1所述的食材识别系统,其特征在于,所述图像采集装置包括:The food material identification system of claim 1 wherein said image capture device comprises:
    摄像模块,用于采集所述冰箱内食材的图像;a camera module, configured to collect an image of the foodstuff in the refrigerator;
    通信模块,用于将所述冰箱内食材的图像发送至所述服务器;和a communication module, configured to send an image of the foodstuff in the refrigerator to the server; and
    控制模块,用于分别对所述摄像模块和所述通信模块进行控制。And a control module, configured to separately control the camera module and the communication module.
  3. 如权利要求2所述的食材识别系统,其特征在于,所述图像采集装置还包括:The food material identification system of claim 2, wherein the image capture device further comprises:
    照明模块,用于对冰箱内食材所处环境进行照明。A lighting module for illuminating the environment in which the ingredients in the refrigerator are located.
  4. 如权利要求2所述的食材识别系统,其特征在于,所述控制模块,在接收到所述冰箱的控制器发出的关门信号之后,控制所述摄像模块采集所述冰箱内食材的图像,并控制所述通信模块将所述图像发送至所述服务器。The food material identification system according to claim 2, wherein the control module controls the camera module to collect an image of the foodstuff in the refrigerator after receiving a door closing signal from the controller of the refrigerator, and The communication module is controlled to send the image to the server.
  5. 如权利要求1所述的食材识别系统,其特征在于,所述食材模型通过以下方式训练来获得:The food material identification system according to claim 1, wherein the food material model is obtained by training in the following manner:
    以预先采集和标定的所述冰箱内食材的图像作为训练图像,基于所述训练图像根据所述深度学习算法确定模型参数以获得所述食材模型。An image of the foodstuff in the refrigerator pre-acquired and calibrated is used as a training image, and model parameters are determined according to the depth learning algorithm based on the training image to obtain the food material model.
  6. 如权利要求5所述的食材识别系统,其特征在于,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。The food material identification system according to claim 5, wherein the food material model comprises one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
  7. 一种食材识别方法,其特征在于,包括以下步骤:A food material identification method, comprising the steps of:
    获得冰箱内食材的图像;Obtain an image of the ingredients in the refrigerator;
    根据食材模型对所述图像进行识别以获得食材信息,其中,所述食材模型为通过深度学习算法训练而获得的神经网络。The image is identified based on a food product model, wherein the food material model is a neural network obtained by training through a deep learning algorithm.
  8. 如权利要求7所述的食材识别方法,其特征在于,所述食材模型通过以下方式训练来获得:The food material identifying method according to claim 7, wherein the food material model is obtained by training in the following manner:
    以预先采集和标定的所述冰箱内食材的图像作为训练图像,基于所述训练图像根据所述深度学习算法确定模型参数以获得所述食材模型。An image of the foodstuff in the refrigerator pre-acquired and calibrated is used as a training image, and model parameters are determined according to the depth learning algorithm based on the training image to obtain the food material model.
  9. 如权利要求8所述的食材识别方法,其特征在于,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。The food material identifying method according to claim 8, wherein the food material model comprises one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
  10. 一种食材模型训练方法,其特征在于,包括以下步骤: A food material model training method, comprising the steps of:
    获得预先采集和标定的冰箱内食材的图像作为训练图像;Obtaining an image of the foodstuff in the refrigerator that is pre-acquired and calibrated as a training image;
    根据深度学习算法对所述训练图像进行处理来确定模型参数以获得食材模型。The training image is processed according to a depth learning algorithm to determine model parameters to obtain a food product model.
  11. 如权利要求10所述的食材模型训练方法,其特征在于,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。The food material model training method according to claim 10, wherein the food material model comprises one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
  12. 如权利要求11所述的食材模型训练方法,其特征在于,其中,所述卷积神经网络包括卷积层、池化层、激励层和全连接层,第一层的输入特征为所述训练图像,每层的输出特征作为下一层的输入特征,根据深度学习算法对所述训练图像进行处理进一步包括:The food material model training method according to claim 11, wherein the convolutional neural network comprises a convolution layer, a pooling layer, an excitation layer and a fully connected layer, and an input characteristic of the first layer is the training The image, the output feature of each layer is used as an input feature of the next layer, and processing the training image according to the depth learning algorithm further includes:
    卷积层对于输入特征通过卷积运算进行特征压缩;The convolution layer performs feature compression on the input features by convolution operations;
    池化层对输入特征进行池化处理;The pooling layer performs pooling processing on input features;
    激励层将输入特征通过激励函数得到输出特征,并对所述输出特征进行归一化处理;The excitation layer obtains an output characteristic by the input characteristic through an excitation function, and normalizes the output feature;
    全连接层的输入特征和输出特征所有结点之间有权重连接。The input features and output features of the fully connected layer have the right to be reconnected between all nodes.
  13. 如权利要求12所述的食材模型训练方法,其特征在于,所述卷积神经网络包括多个的卷积层、池化层和激励层,以及所述卷积神经网络尾部的全连接层,其中,卷积层的不同卷积核获取不同的输出特征。The food material model training method according to claim 12, wherein said convolutional neural network comprises a plurality of convolution layers, a pooling layer and an excitation layer, and a fully connected layer at the tail of said convolutional neural network, Among them, the different convolution kernels of the convolutional layer acquire different output characteristics.
  14. 如权利要求10所述的食材模型训练方法,其特征在于,还包括:根据实际食材信息对所述食材模型的模型参数进行调整。The food material model training method according to claim 10, further comprising: adjusting a model parameter of the food material model based on actual food material information.
  15. 一种冰箱,其特征在于,包括:A refrigerator, comprising:
    图像采集装置,所述图像采集装置用于采用冰箱内食材的图像,并将所述图像传输至服务器,以使所述服务器根据食材模型对所述图像进行识别以获得食材信息,其中,所述食材模型为通过深度学习算法训练而获得的神经网络。An image capture device for taking an image of foodstuffs in a refrigerator and transmitting the image to a server to cause the server to identify the image according to a food product model to obtain food material information, wherein The food material model is a neural network obtained by training through a deep learning algorithm.
  16. 如权利要求15所述的冰箱,其特征在于,所述图像采集装置包括:The refrigerator according to claim 15, wherein said image capture device comprises:
    摄像模块,用于采集所述冰箱内食材的图像;a camera module, configured to collect an image of the foodstuff in the refrigerator;
    通信模块,用于将所述冰箱内食材的图像发送至所述服务器;和a communication module, configured to send an image of the foodstuff in the refrigerator to the server; and
    控制模块,用于分别对所述摄像模块和所述通信模块进行控制。And a control module, configured to separately control the camera module and the communication module.
  17. 如权利要求16所述的冰箱,其特征在于,所述图像采集装置还包括:The refrigerator according to claim 16, wherein the image capture device further comprises:
    照明模块,用于对冰箱内食材所处环境进行照明。A lighting module for illuminating the environment in which the ingredients in the refrigerator are located.
  18. 如权利要求16所述的冰箱,其特征在于,所述控制模块,在接收到所述冰箱的控制器发出的关门信号之后,控制所述摄像模块采集所述冰箱内食材的图像,并控制所述通信模块将所述图像发送至所述服务器。The refrigerator according to claim 16, wherein the control module controls the camera module to collect an image of the foodstuff in the refrigerator after receiving the door closing signal from the controller of the refrigerator, and controls the The communication module transmits the image to the server.
  19. 一种服务器,其特征在于,所述服务器,获得所述冰箱内食材的图像,并根据食材模型对所述图像进行识别以获得食材信息,其中,所述食材模型为通过深度学习算法训练而获得的神经网络。 A server, wherein the server obtains an image of the foodstuff in the refrigerator, and identifies the image according to the food material model to obtain the food material information, wherein the food material model is obtained by training through a deep learning algorithm Neural network.
  20. 如权利要求19所述的服务器,其特征在于,所述食材模型通过以下方式训练来获得:The server according to claim 19, wherein said food material model is obtained by training in the following manner:
    以预先采集和标定的冰箱内食材的图像作为训练图像,基于所述训练图像根据所述深度学习算法确定模型参数以获得所述食材模型。An image of the foodstuff in the refrigerator pre-captured and calibrated is used as a training image, and model parameters are determined based on the training image according to the depth learning algorithm to obtain the food material model.
  21. 如权利要求19所述的服务器,其特征在于,所述食材模型包括卷积神经网络、递归神经网络和循环神经网络中的一种。 The server according to claim 19, wherein said food material model comprises one of a convolutional neural network, a recurrent neural network, and a circulating neural network.
PCT/CN2016/098120 2016-09-05 2016-09-05 System and method for food recognition, food model training method, refrigerator and server WO2018040105A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/098120 WO2018040105A1 (en) 2016-09-05 2016-09-05 System and method for food recognition, food model training method, refrigerator and server

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2016/098120 WO2018040105A1 (en) 2016-09-05 2016-09-05 System and method for food recognition, food model training method, refrigerator and server

Publications (1)

Publication Number Publication Date
WO2018040105A1 true WO2018040105A1 (en) 2018-03-08

Family

ID=61299729

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/098120 WO2018040105A1 (en) 2016-09-05 2016-09-05 System and method for food recognition, food model training method, refrigerator and server

Country Status (1)

Country Link
WO (1) WO2018040105A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110084244A (en) * 2019-03-14 2019-08-02 上海达显智能科技有限公司 Method, smart machine and application based on image recognition object
US20190384990A1 (en) * 2018-06-15 2019-12-19 Samsung Electronics Co., Ltd. Refrigerator, server and method of controlling thereof
CN111797719A (en) * 2020-06-17 2020-10-20 武汉大学 Food component identification method
DE102020207371A1 (en) 2020-06-15 2021-12-16 BSH Hausgeräte GmbH Detection of stored goods in household storage devices
CN114422689A (en) * 2021-12-03 2022-04-29 国网山西省电力公司超高压变电分公司 Hard pressing plate state recognition device and method based on edge intelligence
WO2023125491A1 (en) * 2021-12-30 2023-07-06 青岛海尔电冰箱有限公司 Specific-contour-food management method, storage medium, and refrigerator
US20240068742A1 (en) * 2022-08-24 2024-02-29 Haier Us Appliance Solutions, Inc. Gasket leak detection in a refrigerator appliance
CN117975444A (en) * 2024-03-28 2024-05-03 广东蛟龙电器有限公司 Food material image recognition method for food crusher

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239433A (en) * 2014-08-27 2014-12-24 百度在线网络技术(北京)有限公司 Networking refrigerator and method and system for recommending information thereof
CN104482715A (en) * 2014-12-31 2015-04-01 合肥华凌股份有限公司 Method for managing food in refrigerator and refrigerator
CN105512676A (en) * 2015-11-30 2016-04-20 华南理工大学 Food recognition method at intelligent terminal
US20160165113A1 (en) * 2013-04-23 2016-06-09 Lg Electronics Inc. Refrigerator And Control Method For The Same
CN105701507A (en) * 2016-01-13 2016-06-22 吉林大学 Image classification method based on dynamic random pooling convolution neural network

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160165113A1 (en) * 2013-04-23 2016-06-09 Lg Electronics Inc. Refrigerator And Control Method For The Same
CN104239433A (en) * 2014-08-27 2014-12-24 百度在线网络技术(北京)有限公司 Networking refrigerator and method and system for recommending information thereof
CN104482715A (en) * 2014-12-31 2015-04-01 合肥华凌股份有限公司 Method for managing food in refrigerator and refrigerator
CN105512676A (en) * 2015-11-30 2016-04-20 华南理工大学 Food recognition method at intelligent terminal
CN105701507A (en) * 2016-01-13 2016-06-22 吉林大学 Image classification method based on dynamic random pooling convolution neural network

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11521391B2 (en) * 2018-06-15 2022-12-06 Samsung Electronics Co., Ltd. Refrigerator, server and method of controlling thereof
US20190384990A1 (en) * 2018-06-15 2019-12-19 Samsung Electronics Co., Ltd. Refrigerator, server and method of controlling thereof
EP4254285A3 (en) * 2018-06-15 2023-11-15 Samsung Electronics Co., Ltd. Refrigerator, server and method of controlling thereof
CN110084244B (en) * 2019-03-14 2023-05-30 上海达显智能科技有限公司 Method for identifying object based on image, intelligent device and application
CN110084244A (en) * 2019-03-14 2019-08-02 上海达显智能科技有限公司 Method, smart machine and application based on image recognition object
DE102020207371A1 (en) 2020-06-15 2021-12-16 BSH Hausgeräte GmbH Detection of stored goods in household storage devices
WO2021254740A1 (en) 2020-06-15 2021-12-23 BSH Hausgeräte GmbH Identifying stored products in domestic storage devices
CN111797719B (en) * 2020-06-17 2022-09-02 武汉大学 Food component identification method
CN111797719A (en) * 2020-06-17 2020-10-20 武汉大学 Food component identification method
CN114422689A (en) * 2021-12-03 2022-04-29 国网山西省电力公司超高压变电分公司 Hard pressing plate state recognition device and method based on edge intelligence
WO2023125491A1 (en) * 2021-12-30 2023-07-06 青岛海尔电冰箱有限公司 Specific-contour-food management method, storage medium, and refrigerator
US20240068742A1 (en) * 2022-08-24 2024-02-29 Haier Us Appliance Solutions, Inc. Gasket leak detection in a refrigerator appliance
CN117975444A (en) * 2024-03-28 2024-05-03 广东蛟龙电器有限公司 Food material image recognition method for food crusher

Similar Documents

Publication Publication Date Title
WO2018040105A1 (en) System and method for food recognition, food model training method, refrigerator and server
US20210227126A1 (en) Deep learning inference systems and methods for imaging systems
US11869227B2 (en) Image recognition method, apparatus, and system and storage medium
CN107798277A (en) Food materials identifying system and method, food materials model training method, refrigerator and server
US10599958B2 (en) Method and system for classifying an object-of-interest using an artificial neural network
US11741736B2 (en) Determining associations between objects and persons using machine learning models
WO2018157873A1 (en) Fine-grained object recognition in robotic systems
WO2018188453A1 (en) Method for determining human face area, storage medium, and computer device
WO2020125623A1 (en) Method and device for live body detection, storage medium, and electronic device
US20180157938A1 (en) Target detection method and apparatus
US20170124400A1 (en) Automatic video summarization
WO2019022798A1 (en) Image capture devices featuring intelligent use of lightweight hardware-generated statistics
US20120274755A1 (en) System and method for human detection and counting using background modeling, hog and haar features
US10726573B2 (en) Object detection method and system based on machine learning
CN113330450A (en) Method for identifying objects in an image
JP2018175226A5 (en)
KR102309111B1 (en) Ststem and method for detecting abnomalous behavior based deep learning
CN112082999A (en) Industrial product defect detection method and industrial intelligent camera
KR101944374B1 (en) Apparatus and method for detecting abnormal object and imaging device comprising the same
KR102274581B1 (en) Method for generating personalized hrtf
KR102135632B1 (en) Neural processing device and operating method thereof
KR20200139616A (en) Apparatus and method for detecting abnormal objects
KR20200009530A (en) System and method for detecting abnormal object
CN112329510A (en) Cross-domain metric learning system and method
CN110399782A (en) Children look after scene reply device and method automatically

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16914670

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16914670

Country of ref document: EP

Kind code of ref document: A1