CN108597582A

CN108597582A - A kind of method and apparatus for executing Faster R-CNN neural network computings

Info

Publication number: CN108597582A
Application number: CN201810352111.7A
Authority: CN
Inventors: 张团; 陈云霁
Original assignee: Institute of Computing Technology of CAS
Current assignee: Institute of Computing Technology of CAS
Priority date: 2018-04-18
Filing date: 2018-04-18
Publication date: 2018-09-28
Anticipated expiration: 2038-04-18
Also published as: CN108597582B

Abstract

A kind of method and apparatus for executing Faster R CNN neural network computings, the method includes：Obtain the multiple images of the different angle with serving；The recommendation region of pattern detection is determined using RPN；Recommend the classification and frame of food item in region using Fast R CNN predictions；According to the frame of the food of prediction, the volume ratio shared by each food item is predicted using Volume R CNN；According to food item classification and food item volume ratio, the volume ratio of different classes of food is calculated；The volume ratio of calculated various kinds of foods is multiplied respectively with the density of various kinds of foods, obtains the mass ratio of various kinds of foods；The mass ratio of various kinds of foods is multiplied with the gross mass of food, acquires the quality of various kinds of foods；The quality of various kinds of foods is multiplied with corresponding nutrient content, obtains food nutrition constituent content.The present invention can measure complicated, multiple types foods, by using artificial neural network technology and chip, more accurately and quickly to the identification of food.

Description

A kind of method and apparatus for executing Faster R-CNN neural network computings

Technical field

The present invention relates to technical field of image processing, and in particular to one kind is for executing Faster R-CNN neural networks fortune The method and apparatus of calculation.

Background technology

As the quickening of modern society's life rhythm and people's living standard improve, requirement of the people for diet is also got over Come higher.Can people are no longer concerned about have enough, but whether focus on diet healthy.However to lack enough diet strong by many people Kang Zhishi, so just needing a kind of can intelligently measure the equipment of food energy and nutritional ingredient to help people more reasonably to drink Food.

One of prior art is to calculate food energy by weight.Metering device is mainly made of following sections：Pallet, Weight-measuring device and display screen.Weight-measuring device passes to food weight information micro- for measuring food weight Computer processor；Processor calculates after food energy that the results are shown on liquid crystal display.

The problem of above-mentioned technology, is that processing capacity is weak, is only suitable for measuring the energy of single diet type.For comprising more The food of kind food qualification category measures just inaccuracy, can not also calculate corresponding nutritive element content.Do not have expansion, for not The food information of typing can not then be handled.

Another prior art is that the vertical view and side view that measure food are shot by mobile phone, passes through artificial neural network It identifies food species, the volume of each food is calculated further according to formula；Its nutritive element content is calculated according to food volume.

The problem of above-mentioned technology, is complicated for operation, requires input photo high；Side view is easy blocking there are food Problem.The parameters such as food length, width and height are predicted using focal length, and for different mobile phones, there may be certain deviations, use equation meter Food volume is calculated, is not suitable for food in irregular shape, error calculated is larger.

Invention content

In order to solve the problems in the existing technology, on the one hand, the present invention provides one kind for executing Faster R- The method of CNN neural network computings, including：

Obtain the multiple images of the different angle with serving；

The recommendation region of pattern detection is determined using RPN；

Recommend the classification and frame of food item in region using Fast R-CNN predictions；

According to the frame of the food of prediction, the volume ratio shared by each food item is predicted using Volume R-CNN；

According to the food item classification of Fast R-CNN predictions and the food item volume ratio of Volume R-CNN predictions Example, calculates the volume ratio of different classes of food；

The volume ratio of calculated various kinds of foods is multiplied respectively with the density of various kinds of foods, obtains the matter of various kinds of foods Amount ratio；

The mass ratio of various kinds of foods is multiplied with the gross mass of food, acquires the quality of various kinds of foods；

The quality of various kinds of foods is multiplied with corresponding nutrient content, obtains food nutrition constituent content；

Wherein, the RPN, Fast R-CNN and Volume R-CNN share convolutional layer.

Preferably, determining that the RPN carries out multilayer convolution operation, extraction to the picture of input first when recommending region The Feature Mapping for going out picture, reuse sliding window to Feature Mapping carry out convolution operation, then use Classification Loss function and Frame returns loss function Liang Ge branches and comes zoning classification and region recurrence, obtains and recommends region.

Preferably, the Fast R-CNN will recommend area maps to the Feature Mapping to obtain RoIs, then to each RoI Pondization operation is carried out, the characteristic pattern of same size is converted into, the RoIs after then being operated to pondization carries out two full connections respectively Network operations calculate the food item classification in each recommendation region and are accurately predicted frame.

Preferably, the frame parameter of prediction is mapped in the Feature Mapping by the Volume R-CNN, then to corresponding Mapping area carry out pondization operation, become the sample areas of same size, then to each sample areas progress multilayer connect entirely Network operations are connect, the volume intermediate variable v of each food item in figure is calculated_i, v_iFor positive number；It then again will be in the volume Between variables transformations be corresponding volume ratio f_i, calculation formula is：Wherein i=1,2 ... n, n are to be eaten in image The number of object object.

Preferably, it is by the method that the frame parameter of prediction is mapped on the characteristic pattern：Each coordinate data is multiplied by The ratio between the size of characteristic pattern and original image.

Preferably, the form of loss function Volume loss is in the Volume R-CNNWherein f_iFor the volume ratio of each food item of prediction, f_i ^*For actual value, instruction The flag data inputted when practicing.

Preferably, the output of neural network includes during prediction：By Volume R-CNN, calculated one indicates figure The n-dimensional vector of the shared volume ratio of each food item as in, each element be located at section (0,1] interior, and the sum of each element It is 1；By calculated one n*m matrix for indicating each food item generic in image of Fast R-CNN, m is recognizable The class number of food item, it is 1 that matrix, which only has an element per a line, remaining m-1 element is 0, and the corresponding row of element 1 are For food item generic；And the two-dimensional array of a n*4 for indicating each food item frame.

Preferably, the algorithm further includes that will indicate that the n-dimensional vector of each food item volume ratio is multiplied by indicate each The n*m two-dimensional arrays of food item generic obtain the volume ratio vector of various kinds of foods, are m dimensional vectors, the m dimensional vectors Per one-dimensional correspondence one group food, indicate the volume ratio shared by corresponding classification food per the numerical value on one-dimensional.

Preferably, the method further includes calculating a m for indicating food volume ratio of all categories to each image Then all m dimensional vectors are added again divided by the number of image by dimensional vector again, find out average vector as food of all categories finally The volume ratio vector of object.

Preferably, the method further includes adaptivity training step, including：

Step 1, RPN netinit network parameters calculate each detection according to the image information propagated forward of input The class label and region adjusting parameter in region；Come using stochastic gradient descent algorithm or Adam algorithms further according to backpropagation The relevant parameter for updating RPN includes the parameter of the unique portion parameter of RPN and shared conventional part, and training is until convergence；

Step 2, Fast R-CNN utilize the shared convolutional layer parameter initialization convolution layer parameter of step 1 training, and will The recommendation region that step 1 obtains is trained as the recommendation region during neural computing, and updates network parameter, Including shared convolutional network, until network convergence；

Step 3, the shared convolutional network that RPN networks are obtained using step 2 continue to train and update the only of RPN networks There are partial parameters, does not include the convolution layer parameter shared；

Step 4, the recommendation region that Fast R-CNN Web vector graphic step 3 obtains is trained, and only updates Fast The exclusive part of R-CNN networks, shared convolutional layer parameter constant；

The food item frame that step 4 obtains is mapped to shared convolutional network by step 5, Volume R-CNN networks Last layer of Feature Mapping, and the exclusive partial parameters of update are trained, until network convergence；

The training operation of each step is that input data is obtained the loss function of each part by network forward calculation, Backpropagation again updates network parameter using stochastic gradient descent or Adam algorithms；

The training process of wherein above-mentioned five steps can recycle execution.

On the other hand, the present invention provides a kind of devices for executing Faster R-CNN neural network computings, including

Information input unit obtains different in the multiple images, the gross mass of food, food of the different angle with serving The density and nutritive element content of classification food；

Information treatment part, for being handled described image and being calculated；

Wherein described information processing unit includes：

Storage unit, for storing described image, gross mass, density and nutritive element content；

Recommend region generation unit, the recommendation region of pattern detection is determined using RPN；

Classification and frame predicting unit recommend the classification and frame of food item in region using Fast R-CNN predictions；

Food item volume ratio predicting unit, it is pre- using Volume R-CNN according to the frame of the food item of prediction Volume ratio in altimetric image shared by each food item；

Food qualification category volume ratio predicting unit, according to the food item classification and Volume of Fast R-CNN predictions The food item volume ratio of R-CNN predictions, calculates the volume ratio shared by each classification food, and to the meter of different images Results are averaged for calculation；

Mass ratio predicting unit, by the density of the volume ratio and different classes of food of calculated different classes of food It is multiplied respectively, obtains the mass ratio of different classes of food；

The mass ratio of different classes of food is multiplied with the gross mass of food, acquires difference by quality of food predicting unit The quality of classification food；And

The quality of various kinds of foods is multiplied with corresponding nutrient content, obtains food nutrition by nutrient content predicting unit Constituent content；

Wherein, the RPN, Fast R-CNN and Volume R-CNN share convolutional layer.

Preferably, described information input unit includes image-input device and mass input device.

Preferably, described information processing unit further includes Date Conversion Unit, and the q dimension nutrition for exporting processing unit contains Amount vector is converted into corresponding output.

Preferably, described device further includes information output part, for receiving output information from described information processing unit, and will Presentation of information comes out.

Preferably, described device further includes networked components, for measurement data to be uploaded to database in real time, while also may be used Most recent parameters model is updated from high in the clouds.

Preferably, described information processing unit is neural network chip.

Compared with prior art, the invention has the advantages that：

1) compared to existing invention, increasingly complex, a greater variety of foods can be measured.

2) artificial neural network technology and chip are used, it is more accurate to the identification of food, quickly.

3) oblique upper is used to overlook picture, it is possible to prevente effectively from blocking between different foods, while having to practical judgment More comprehensive understanding.

4) artificial neural network technology and chip is used to calculate food volume, result of calculation is more accurate and with training Data are continuously increased, and precision of prediction can also improve.

5) artificial neural network chip computing capability is powerful, off-line operation neural network is supported, in no cloud server Assist the work that user terminal/front end can be realized food nourishment composition detection and accordingly controls offline in the case of calculating.When Chip is networked, and when obtaining cloud server assistance calculating, chip computing capability is more powerful.

6) device is easy to operate, more intelligent, meets people's daily life demand.

7) suggestion more rationalized can be provided with the diet of people, improves people's quality of life.

Description of the drawings

Fig. 1 is the structure chart of neural network of the embodiment of the present invention；

Fig. 2 is the prognostic chart of food item classification and frame in the embodiment of the present invention；

Fig. 3 is the network structure of Volume R-CNN in the embodiment of the present invention；

Fig. 4 is the structural schematic diagram of device in the embodiment of the present invention.

Specific implementation mode

To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific embodiment, and reference Attached drawing, the present invention is described in further detail.

A kind of method for executing Faster R-CNN neural network computings of the present invention mainly includes the following steps that：It is right The key feature of image is extracted and is handled, and identifies the type of food and various kinds of foods volume proportion in image；Again The weight ratio of various kinds of foods is calculated according to the density of various kinds of foods；Last processing unit is according to the weight ratio of various kinds of foods The actual mass that tested various kinds of foods is calculated with total weight, food can be found out in conjunction with the constituent content table of various foods Energy and nutritional ingredient.

Input picture includes the top view photograph to multiple different angles of same serving.

To the processing stage of single image, processor Faster Region improved to the imagery exploitation of input Convolutional Neural Network (faster R-CNN) network is calculated, and each food in image is marked The prediction volume ratio (volume) of not (class), each food frame (bounding box), and each food.Wherein Volume is expressed as the two-decimal of 0-1.

Neural network structure in the present invention is improved on the basis of Faster R-CNN networks, and prediction is added to The part of food volume ratio.Network structure is as shown in Figure 1：

The neural network can be divided into three parts：Region Proposal Networks for predicting recommendation region (RPN) network；For the classification of object in prognostic chart picture and the Fast R-CNN networks of accurate adjustment frame；For each in prognostic chart picture The Volume RCNN networks of volume ratio shared by a food item.Three network share convolutional layers, constitute one it is unified whole Volume grid.

The frame of food be refer to include some food of image minimum rectangle frame.It is specifically as shown in Fig. 2, oval in figure Food of different shapes is indicated with irregular figure, and dotted border is food frame.

Fig. 3 is Volume R-CNN network structures, and wherein convolutional layer (CNN) is shared part.Needed for this partial arithmetic Frame is obtained by second part (Fast R-CNN).The form of loss function Volume loss isWherein f_iFor the volume ratio of each object of prediction, f_i ^*For actual value, when training The flag data of input.

This nerve Web vector graphic Region Proposal Networks (RPN) come determine target detection recommend region.RPN Method first carries out multilayer convolution operation to the image of input, extracts the Feature Mapping (feature maps) of image, reuses The sliding window of 3*3 carries out convolution operation to Feature Mapping, carrys out zoning classification using Liang Ge branches later and region is returned Return, obtains and recommend region.Here territorial classification judges that estimation range belongs to the probability of foreground and background；Here recommendation The parameter in region is the parameter relative to original input picture.

In order to predict it is each recommend region food qualification category and by food frame accurate adjustment, so will recommend area maps extremely Above-mentioned feature maps obtain RoIs (Region of Interest), then carry out pondization operation to each RoI, are converted into same Etc. sizes characteristic pattern.It can carry out two fully-connected network operations respectively to the RoIs after pond later, calculate every The food qualification category in a region and frame is accurately predicted.

Finally, then by the frame parameter of frame branch prediction it is mapped on feature maps, to corresponding map section Domain carries out pondization operation, becomes the region of same size.The full attended operation of multilayer is carried out to each target area again, is calculated every The volume intermediate variable of a food, intermediate variable is positive number, it is not intended that food volume.Include one in each target area A food, a corresponding volume intermediate variable v_i, then converting volume intermediate variable to corresponding ratio f_i；Specific formula For：Wherein i=1,2 ... n, n are the number of food item in image.f_iNumber be food in image Number n, it is possible to which it is a vector for including n element to export food volume ratio.Food qualification category prediction output be The two-dimensional matrix of one n*m, wherein m are the quantity of comestible classification, one and only one element of each row vector of matrix is 1, Remaining element is 0；1 place columns of element corresponds to the classification of food.The output of food frame predicted branches is the two dimension of a n*4 Matrix, the element per a line correspond to the centre coordinate (x, y) of frame, high width (h, w) respectively.Frame is mapped to from original image Operation on characteristic pattern is：Each coordinate data is multiplied by the ratio between the size of characteristic pattern and original image.

So the output of neural network is during prediction：One n-dimensional vector for indicating each food volume ratio, often A element is located in section [0,1], and the sum of each element is 1；One n*m two-dimensional array for indicating each food generic； The two-dimensional array of one n*4 for indicating each food frame.Then, then it will indicate that the n-dimensional vector of each food volume ratio multiplies The upper two-dimensional array for indicating each food generic obtains the volume ratio vector of various kinds of foods, is m dimensional vectors.The m tie up to One group food of every one-dimensional correspondence of amount indicates the volume ratio shared by corresponding classification food per the numerical value on one-dimensional.

The method of the present invention further includes to each figure in one group of image (to the photo of the different angle of same disk food) All m dimensional vectors, are then added again divided by every group by the m dimensional vectors as all calculating an expression various kinds of foods volume ratio again Image number finds out average vector as final classification volume ratio in kind vector.

The method of the present invention further include by calculated food qualification category volume ratio vector and food qualification category intensity vector by Position is multiplied, and obtains food qualification category mass ratio vector, then be multiplied with the food gross mass of input, acquire food qualification category quality to Amount, each indicates the quality of the food of respective classes.

The method of the present invention further includes that m is tieed up food qualification category quality vector and corresponding food qualification category nutrient content matrix It is multiplied, obtains food nutrition constituent content vector, each indicates certain nutritive element content in food.Wherein food qualification category is sought It supports containing the two-dimensional matrix that moment matrix is a m*q, wherein q is the detectable nutrient type number of system.Food qualification category is sought It supports every a line containing moment matrix and corresponds to a group food, a kind of corresponding nutrient of each row is indicated per in group food unit mass The nutritive element content contained.Finally obtained food nutrition constituent content vector is a q dimensional vector.

The method of the present invention further includes the method for carrying out adaptivity training to information processing unit.

Input data is the image with flag data, and the corresponding flag data of every image is the class of each food in figure Not (n-dimensional vector), each food frame information (two-dimensional matrix of n*4), each food shared volume ratio (n tie up to Amount)；Wherein n is the sum of food item in image.Processing unit first pre-processes the data information of input, if for example, Food qualification category information is word, then is translated into the corresponding number of classification.

Training process is divided into five steps, RPN networks, the Fast R-CNN nets returned to food qualification category detection and frame Network predicts the network cross-training of food volume ratio.

Step 1, RPN netinit network parameters calculate each detection according to the image information propagated forward of input The class label and region adjusting parameter in region；Come using stochastic gradient descent algorithm or Adam algorithms further according to backpropagation The relevant parameter for updating RPN, includes the parameter of the unique portion parameter of RPN and shared conventional part.Training is until convergence.

Step 2, Fast R-CNN utilize the shared convolutional layer parameter initialization convolution layer parameter of step 1 training, and will The recommendation region that step 1 obtains is trained as the recommendation region during network calculations, and updates network parameter, including Shared convolutional network.Until network convergence.

Step 3, the shared convolutional network that RPN networks are obtained using step 2 continue to train and update the only of RPN networks There are partial parameters, does not include the convolution layer parameter shared.

Step 4, the recommendation region that Fast R-CNN Web vector graphic step 3 obtains is trained, and only updates Fast The exclusive part of R-CNN networks, shared convolutional layer parameter constant.

The food frame that step 4 obtains is mapped to the last of shared convolutional network by step 5, Volume R-CNN networks One layer of Feature Mapping, and the exclusive partial parameters of update are trained, until network convergence.

The training operation of each step is that input data is obtained the loss function of each part by network forward calculation, Backpropagation again updates network parameter using stochastic gradient descent or Adam algorithms.

The training process of above-mentioned five steps can recycle execution.

The present invention also provides a kind of devices for executing Faster R-CNN neural network computings, including information input Component, information processing apparatus, information output part, as shown in Figure 4.

Wherein, information input part includes one or more cameras, and the food for inputting one group of different angle is overlooked Image；One apparatus for measuring quality, for measuring quality of food and being transferred to processing component.

Information processing apparatus includes storage unit and data processing unit, and storage unit is for receiving and storing input number According to, instruction and output data, wherein input data include one group of image and a positive number (quality of food)；Data processing unit is first The key feature for including in input data is extracted and handled using neural network, generating one to every image indicates food Nutritive element content vector in object finds out all images and corresponds to the average value of vector as test food for same group of image Final nutrient content vector.

Information processing apparatus further includes a data conversion module, the q dimension nutrient contents for export processing unit to Amount is converted into corresponding output, and output can be table, the form of pie chart.

Information output part includes a liquid crystal display, and display receives output information from information processing apparatus, and will Presentation of information comes out.

Output knot of the information processing apparatus according to food nutrition content vectorial (q dimensional vectors) control predicted on the screen Fruit.Data translation processor converts q dimensional vectors to corresponding storage information, and format is：Nutrient name, content.Nutrition member Plain name can be corresponded to by each position index subscript of q dimensional vectors and be obtained, 0 element in vector is ignored.In addition, the device can be with Storage or networking obtain the nutrient intake that the people of each age group recommends daily, and assess test food, instant Various nutritive element contents in object are often eaten required too high levels or too low compared to human body, provide rational dietary recommendation. Nutritive element content and the dietary recommendation in food are finally exported on a display screen.The output form of nutritive element content can be Table and pie chart.

The device can also include networked components, it may be connected to which measurement data is uploaded database by internet in real time, is expanded Big data quantity, while also most recent parameters model can be updated from high in the clouds, improve operation efficiency and precision.

Data processing unit uses neural network chip, is suitable for neural computing, computing capability is powerful.

Particular embodiments described above has carried out further in detail the purpose of the present invention, technical solution and advantageous effect Describe in detail bright, it should be understood that the above is only a specific embodiment of the present invention, is not intended to restrict the invention, it is all Within the spirit and principles in the present invention, any modification, equivalent substitution, improvement and etc. done should be included in the protection of the present invention Within the scope of.

Claims

1. a kind of method for executing Faster R-CNN neural network computings, including：

Obtain the multiple images of the different angle with serving；

The recommendation region of pattern detection is determined using RPN；

According to the food item classification of Fast R-CNN predictions and the food item volume ratio of Volume R-CNN predictions, meter Calculate the volume ratio of different classes of food；

The volume ratio of calculated various kinds of foods is multiplied respectively with the density of various kinds of foods, obtains the mass ratio of various kinds of foods Example；

Wherein, the RPN, Fast R-CNN and Volume R-CNN share convolutional layer.

2. according to the method described in claim 1, wherein, determine recommend region when, the RPN first to the picture of input into Row multilayer convolution operation, extracts the Feature Mapping of picture, reuses sliding window and carries out convolution operation to Feature Mapping, then Loss function Liang Ge branches are returned using Classification Loss function and frame and come zoning classification and region recurrence, obtain recommended area Domain.

3. according to the method described in claim 1, wherein, the Fast R-CNN will recommend area maps to the Feature Mapping RoIs is obtained, then pondization operation is carried out to each RoI, the characteristic pattern of same size is converted into, after then being operated to pondization RoIs carries out two fully-connected network operations respectively, calculate it is each recommendation region in food item classification and to frame into The accurate prediction of row.

4. according to the method described in claim 1, wherein, the frame parameter of prediction is mapped to described by the Volume R-CNN In Feature Mapping, then pondization operation is carried out to corresponding mapping area, becomes the sample areas of same size, then to each sample One's respective area carries out multilayer fully-connected network operation, calculates the volume intermediate variable v of each food item in figure_i, v_iFor positive number； Then the volume intermediate variable is converted to corresponding volume ratio f again_i, calculation formula is：Wherein i=1, 2 ... n, n are the number of food item in image.

5. according to the method described in claim 1, wherein, the method frame parameter of prediction being mapped on the characteristic pattern For：Each coordinate data is multiplied by the ratio between the size of characteristic pattern and original image.

6. according to the method described in claim 1, wherein, the shape of loss function Volume loss in the Volume R-CNN Formula isWherein f_iFor the volume ratio of each food item of prediction, f_i ^*For reality Value, the flag data inputted when training.

7. according to the method described in claim 1, wherein, during prediction the output of neural network include：By Volume R- Calculated one of CNN indicates that the n-dimensional vector of the shared volume ratio of each food item in image, each element are located at section (0,1] in, and the sum of each element is 1；By Fast R-CNN, calculated one indicates each affiliated class of food item in image Other n*m matrixes, m are the class number of recognizable food item, and it is 1 that matrix, which only has an element per a line, remaining m-1 member Element is 0, and the corresponding row of element 1 are food item generic；And the two of a n*4 for indicating each food item frame Dimension group.

8. according to the method described in claim 1, wherein, the algorithm further includes that will indicate each food item volume ratio N-dimensional vector is multiplied by the n*m two-dimensional arrays for indicating each food item generic, obtains the volume ratio vector of various kinds of foods, For m dimensional vectors, one group food of every one-dimensional correspondence of the m dimensional vectors is indicated per the numerical value on one-dimensional shared by corresponding classification food Volume ratio.

9. according to the method described in claim 1, wherein, the method further includes calculating one to each image to indicate each Then all m dimensional vectors are added again divided by the number of image by the m dimensional vectors of classification food volume ratio again, find out it is average to Measure the volume ratio vector as final food of all categories.

10. according to the method described in claim 1, wherein, the method further includes adaptivity training step, including：

Step 1, RPN netinit network parameters calculate each detection zone according to the image information propagated forward of input Class label and region adjusting parameter；It is updated using stochastic gradient descent algorithm or Adam algorithms further according to backpropagation The relevant parameter of RPN includes the parameter of the unique portion parameter of RPN and shared conventional part, and training is until convergence；

Step 2, Fast R-CNN utilize the shared convolutional layer parameter initialization convolution layer parameter of step 1 training, and by step One obtained recommendation region is trained as the recommendation region during neural computing, and updates network parameter, including Shared convolutional network, until network convergence；

Step 3, the shared convolutional network that RPN networks are obtained using step 2 continue the exclusive portion for training and updating RPN networks Divide parameter, does not include the convolution layer parameter shared；

Step 4, the recommendation region that Fast R~CNN Web vector graphic step 3 obtains are trained, and only update Fast R- The exclusive part of CNN networks, shared convolutional layer parameter constant；

The food item frame that step 4 obtains is mapped to the last of shared convolutional network by step 5, Volume R-CNN networks One layer of Feature Mapping, and the exclusive partial parameters of update are trained, until network convergence；

The training of each step operates input data obtaining the loss function of each part by network forward calculation, then instead To propagation, network parameter is updated using stochastic gradient descent or Adam algorithms；

11. a kind of device for executing Faster R-CNN neural network computings, including

Information input unit obtains different classes of in the multiple images, the gross mass of food, food of the different angle with serving The density and nutritive element content of food；

Wherein described information processing unit includes：

Food item volume ratio predicting unit utilizes Volume R-CNN prognostic charts according to the frame of the food item of prediction Volume ratio as in shared by each food item；

Food qualification category volume ratio predicting unit, according to the food item classification and Volume R-CNN of Fast R-CNN predictions The food item volume ratio of prediction calculates the volume ratio shared by each classification food, and to the calculating knot of different images Fruit is averaged；

Mass ratio predicting unit distinguishes the density of the volume ratio of calculated different classes of food and different classes of food It is multiplied, obtains the mass ratio of different classes of food；

The mass ratio of different classes of food is multiplied with the gross mass of food, acquires different classes of by quality of food predicting unit The quality of food；And

The quality of various kinds of foods is multiplied with corresponding nutrient content, obtains food nutrition element by nutrient content predicting unit Content；

Wherein, the RPN, Fast R-CNN and Volume R-CNN share convolutional layer.

12. according to the devices described in claim 11, wherein described information input unit includes image-input device and mass input Device.

13. according to the devices described in claim 11, wherein described information processing unit further includes Date Conversion Unit, and being used for will The q dimension nutrient content vectors of processing unit output are converted into corresponding output.

14. according to the devices described in claim 11, wherein described device further includes information output part, is used for from described information Processing unit receives output information, and presentation of information is come out.

15. according to the devices described in claim 11, wherein described device further includes networked components, for measurement data is real When upload to database, while also can update most recent parameters model from high in the clouds.

16. according to the devices described in claim 11, wherein described information processing unit is neural network chip.