CN110378952A - A kind of image processing method and device - Google Patents

A kind of image processing method and device Download PDF

Info

Publication number
CN110378952A
CN110378952A CN201910619862.5A CN201910619862A CN110378952A CN 110378952 A CN110378952 A CN 110378952A CN 201910619862 A CN201910619862 A CN 201910619862A CN 110378952 A CN110378952 A CN 110378952A
Authority
CN
China
Prior art keywords
image
under test
object under
pixel
target area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910619862.5A
Other languages
Chinese (zh)
Inventor
吴子凡
张潮宇
黄阳琨
梁议丹
魏征
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WeBank Co Ltd
Original Assignee
WeBank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WeBank Co Ltd filed Critical WeBank Co Ltd
Priority to CN201910619862.5A priority Critical patent/CN110378952A/en
Publication of CN110378952A publication Critical patent/CN110378952A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/60Analysis of geometric attributes
    • G06T7/62Analysis of geometric attributes of area, perimeter, diameter or volume
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Geometry (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)

Abstract

The embodiment of the invention discloses a kind of image processing method and devices, wherein method includes: the first image and the second image for obtaining photographic device shooting object under test and obtaining by the scene where object under test when predeterminated position, the corresponding target area of object under test is determined from the first image, and according in the first image target area and the corresponding depth information of the second objective area in image, determine the volume of object under test.In the embodiment of the present invention, by being measured in real time under motion state, so that the dependence to environment is smaller, the object volume under complex environment is detected so as to realize, and no setting is required measuring station and fixed test position, so as to improve the efficiency of deposit;In addition, shooting to obtain a color image by photographic device and a depth image can determine the volume of object under test, it is not necessarily to staff, and no setting is required that multiple photographic devices are easy to implement so that cost is relatively low, and easy to operate.

Description

A kind of image processing method and device
Technical field
The present invention relates to financial technology (Fintech) technical field more particularly to a kind of image processing methods and device.
Background technique
With the development of computer technology, more and more technical applications are in financial field, and traditional financial industry is also Gradually change to financial technology (Fintech), however, due to the safety of financial industry, requirement of real-time, so that financial technology To technology, higher requirements are also raised in field.Financial technology field generally involves the Auto-Memory problem of cargo, such as Collateral security is stored in front of warehouse, it usually needs the volume (and/or quantity) of detection collateral security in advance is pledged so as to basis The volume of product is the shelf that collateral security distributes corresponding volume, improves the flexibility of collateral security Auto-Memory.In that case, if The accuracy of the volume detection of collateral security is lower, may will affect the efficiency of collateral security Auto-Memory, in some instances it may even be possible to can make certainly The wasting of resources of the process check or warehouse of dynamic deposit.
In a kind of existing implementation, measuring station, detection can be set in a certain position on route of depositing in advance Fixed test position is provided in standing, and can in each orientation (such as front, rear, the left and right) around fixed test position Image of multiple camera is respectively set.In specific implementation, before collateral security is stored in warehouse, it can be controlled and be pitched by staff Vehicle carries collateral security and moves to the fixed test position in measuring station, in this way, the photographic device around fixed test position can be shot Collateral security obtains plurality of pictures, can determine the volume of collateral security by analyzing multiple images i.e..However, this kind of implementation due to Needing to guarantee that collateral security remains static can shoot, thus the process of depositing may need to take a substantial amount of time, manpower at Originally with material resources cost, cause the efficiency of deposit lower;And this kind of implementation to the condition depended around fixed test position compared with Greatly, it cannot achieve and the object volume under complex environment detected.
To sum up, a kind of image processing method is needed at present, and the object volume under complex environment is detected to realize, And improve the efficiency of deposit.
Summary of the invention
The embodiment of the present invention provides a kind of image processing method and device, to realize to the physical quantities under complex environment And/or volume is detected, and improves the efficiency of deposit.
In a first aspect, a kind of image processing method provided in an embodiment of the present invention, comprising:
Obtain that scene when photographic device shooting object under test passes through predeterminated position where the object under test obtains the One image and the second image, scene corresponding color image of the first image where the object under test, described second Image is the corresponding depth image of scene where the object under test;Further, according to pixel each in the first image Characteristic value, the corresponding target area of the object under test is determined from the first image, and then according to the target area And the corresponding depth information in target area described in second image, obtain the volume of the object under test.
In above-mentioned design, by realizing measurement process in real time under motion state, so that above-mentioned design is to environment It relies on smaller, the object volume under complex environment is detected so as to realize, and no setting is required measuring station and fixed inspection Location, so as to improve the efficiency of deposit;In addition, above-mentioned design shoots to obtain a color image and one by photographic device Depth image is the volume that can determine object under test, is not necessarily to staff, and no setting is required multiple photographic devices, thus cost It is lower and easy to operate, it is easy to implement.
It is described that the corresponding target area of the object under test is obtained from the first image in a kind of possible design Domain, comprising: the first image is inputted into the first preset model, so that the first preset model is according to each in the first image The first image is divided into one or more subgraphs by the characteristic value of pixel;It is every in one or more of subgraphs A subgraph includes at least one object;Further, obtain from one or more of subgraphs includes the determinand The target subgraph of body, and the target subgraph is inputted into the second preset model, obtain the corresponding target of the object under test Region.
In above-mentioned design, simple model analysis is carried out to the first image by the first preset model, it can be from multiple areas Obtain the rough region where object under test in domain, so by the rough region where the second preset model measuring targets into The analysis of row accurate model, the precise region where available object under test;It follows that above-mentioned design is pre- by setting first If model and the second preset model, it may not need and accurate model analysis is carried out to entire first image, so as to realize mould The efficiency of model analysis is improved while the accuracy of type analysis.
It is described that the target subgraph is inputted into the second preset model in a kind of possible design, it obtains described to be measured The corresponding target area of object, comprising: it is down-sampled to target subgraph progress, the first subgraph is obtained, and described in determination Classification belonging to multiple first pixels that first subgraph includes, and then first subgraph is up-sampled to obtain Two subgraphs;Second subgraph includes multiple second pixels, the first pixel of each of the multiple first pixel At least one corresponding second pixel, classification belonging at least one corresponding second pixel of each first pixel with Classification belonging to each first pixel is identical;Further, it is pre- for classification being chosen from the multiple second pixel If the second pixel of one or more targets of classification, and according to one or more of the second pixels of target obtain it is described to Survey the corresponding target area of object;The pre-set categories are classification belonging to the object under test.
In above-mentioned design, lesser first son of resolution ratio is converted by target subgraph by using down-sampled mode Image, so that image procossing can be reduced by analyzing to obtain classification belonging to pixel based on lesser first subgraph of resolution ratio Complexity improves the efficiency of image procossing;And it is larger by using the mode of up-sampling by the first subgraph to convert resolution ratio The second subgraph, the pixel of block of pixels and the first subgraph on the second subgraph can be made mutually to map, in this way, can To determine the classification of each pixel in the second subgraph according to mapping relations, the accuracy of image procossing thereby may be ensured that.
In a kind of possible design, the target area according to the target area and second image Corresponding depth information determines the volume of the object under test, comprising: the target area is divided into one or more nets Lattice, according to the corresponding depth letter of one or more grid described in one or more of grids and second image Breath determines the volume of one or more of grids, sums up to obtain to the volume of one or more of grids described to be measured The volume of object.
It, can be according to the depth information of multiple grids by the way that target area is divided into multiple grids in above-mentioned design The volume of multiple grids is determined respectively, to sum up the body that object under test can be obtained by the volume to multiple grids Product;The process obtains the volume of object under test by unit of grid, can make the volume of determining obtained object under test more Accurately, and realize that logic is relatively simple, so as to improve the efficiency of processing.
It is described that the corresponding target area of the object under test is determined from the first image in a kind of possible design Before, also the first image is filtered.
In above-mentioned design, by being filtered in advance to the first image before handling the first image, it can be adapted for What the scene of the first image dynamic fuzzy, i.e. the first image can obtain for shooting in motion process;In this way, above-mentioned design can be with It executes during measuring targets are deposited, (for example is needed in stationary state without changing original work flow Execute, or need to move to measuring station execution), so as to improve the efficiency of image procossing.
Second aspect, a kind of image processing apparatus provided in an embodiment of the present invention, described device include:
Module is obtained, when for obtaining photographic device shooting object under test by predeterminated position where the object under test The first image and the second image that scene obtains, the first image are the corresponding cromogram of scene where the object under test Picture, second image are the corresponding depth image of scene where the object under test;
Determining module determines institute for the characteristic value according to pixel each in the first image from the first image State the corresponding target area of object under test;
Processing module is used for the corresponding depth in target area according to the target area and second image Information obtains the volume of the object under test.
In a kind of possible design, the determining module is specifically used for: the first image is inputted the first default mould Type, so that the first image is divided into one according to the characteristic value of pixel each in the first image by the first preset model A or multiple subgraphs;Each subgraph in one or more of subgraphs includes at least one object;Further, from The target subgraph including the object under test is obtained in one or more of subgraphs, and the target subgraph is inputted Second preset model obtains the corresponding target area of the object under test.
In a kind of possible design, the determining module is specifically used for: it is down-sampled to target subgraph progress, it obtains To the first subgraph, and determine classification belonging to multiple first pixels that first subgraph includes, and then to described the One subgraph is up-sampled to obtain the second subgraph;Second subgraph includes multiple second pixels, and the multiple The first pixel of each of one pixel corresponds at least one second pixel, each first pixel corresponding at least one Classification belonging to a second pixel is identical as classification belonging to each first pixel;Further, from the multiple The second pixel of one or more targets that classification is pre-set categories is chosen in second pixel, and according to one or more of The second pixel of target obtains the corresponding target area of the object under test;The pre-set categories are belonging to the object under test Classification.
In a kind of possible design, the processing module is specifically used for: the target area is divided into one or more A grid, according to the corresponding depth of one or more grid described in one or more of grids and second image Degree information determines the volume of one or more of grids, sums up to obtain to the volume of one or more of grids described The volume of object under test.
In a kind of possible design, the determining module determines that the object under test is corresponding from the first image Before target area, the determining module is also filtered the first image.
The third aspect, a kind of computer readable storage medium provided in an embodiment of the present invention, including instruction, when it is being calculated When being run on the processor of machine, so that the processor of computer executes the image processing method as described in above-mentioned first aspect is any Method.
Fourth aspect, a kind of computer program product provided in an embodiment of the present invention make when run on a computer Obtain image processing method of the computer execution as described in above-mentioned first aspect is any.
The aspects of the invention or other aspects can more straightforwards in the following description.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly introduced, it should be apparent that, drawings in the following description are only some embodiments of the invention, for this For the those of ordinary skill in field, without any creative labor, it can also be obtained according to these attached drawings His attached drawing.
Fig. 1 is a kind of corresponding configuration diagram of Auto-Memory system provided in an embodiment of the present invention;
Fig. 2 is a kind of corresponding flow diagram of image processing method provided in an embodiment of the present invention;
Fig. 3 is the schematic diagram that a kind of first preset model provided in an embodiment of the present invention handles the first image;
Fig. 4 is a kind of structural schematic diagram of image processing apparatus provided in an embodiment of the present invention.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with attached drawing to the present invention make into It is described in detail to one step, it is clear that described embodiments are only a part of the embodiments of the present invention, rather than whole implementation Example.Based on the embodiments of the present invention, obtained by those of ordinary skill in the art without making creative efforts All other embodiment, shall fall within the protection scope of the present invention.
Financial technology (Fintech), which refers to, behind information technology involvement financial field, to be that financial field bring is a kind of new Creative Science and Technology Co. Ltd, assist realizing that financial operation, transaction execute and financial system is improved by using advanced information technology, can To promote treatment effeciency, the business scale of financial system, and cost and financial risks can be reduced.In general, financial technology Field would generally be related to largely transaction and the friendship between data, such as user and the transaction of trade company and data, different trade companies How easy and data etc. are traded from a large amount of using the means of science and technology and excavate feature required for financial field in data, The always target of financial technology field pursuit.
Fig. 1 is a kind of corresponding configuration diagram of Auto-Memory system provided in an embodiment of the present invention, as shown in Figure 1, should System may include warehouse 100, deposit route 110 and carrier 120, and can be set on carrier 120 placed side by side has a row or more Collateral security 130 is arranged, the quantity of any two rows of collateral securities in a row or multi-row collateral security 130 can be identical or or not Together, a row or multi-row collateral security 130 can be placed in sequence, or can also arbitrarily place, and specifically be not construed as limiting;Accordingly Multiple cabinets have can be set in warehouse 100 in ground, and multiple cabinets can be used for storing collateral security.
In the embodiment of the present invention, binocular camera, binocular camera can be set on the predeterminated position for route 110 of depositing In can be set there are two camera, such as camera shown in FIG. 1 141 and camera 142;Wherein, the position of photographic device It can be rule of thumb configured, or can also be configured according to actual scene by those skilled in the art, do not made specifically It limits.In the embodiment of the present invention, predeterminated position can on deposit route 110 close to warehouse 100 position (such as position a), To which camera 141 and camera 142 also can be set in position a;In this way, when carrier 120 carries a row or multi-row collateral security 130 when moving across position a along deposit route 110, and camera 141 and camera 142 can shoot to obtain a row or multi-row The corresponding image of scene where collateral security 130.In one example, camera 141 can be optical camera, camera 142 can be depth camera;In this way, being moved when carrier 120 carries a row or multi-row collateral security 130 along deposit route 110 When by position a, camera 141 can shoot to obtain the corresponding color image of scene where a row or multi-row collateral security 130, Camera 142 can shoot to obtain the corresponding depth image of scene where a row or multi-row collateral security 130.
In one possible implementation, camera 141 and camera 142 can (Fig. 1 not be carried out with server respectively Signal) connection, server can obtain the color image that camera 141 is shot according to predetermined period and camera 142 is shot Depth image, and then the volume of a row or multi-row collateral security 130 can be determined based on color image and depth image.Further Ground, also can be set photographic device in warehouse 100, and server can be according to the photographic device in warehouse 100 from multiple cabinets In select the spare cabinet being in idle condition, and then can be according to the volume of a row or multi-row collateral security 130 from guest machine Selection target cabinet in cabinet;In this way, server can send control instruction to carrier 120 (or staff on carrier), So that carrier 120 carries a row or multi-row collateral security 130 and moves to the position where target cabinet, and pledged a row or multi-row Product 130 are stored on target cabinet.
Based on system architecture illustrated in Figure 1, Fig. 2 is that a kind of image processing method provided in an embodiment of the present invention is corresponding Flow diagram, the executing subject of this method can be server.This method comprises:
Step 201, scene when obtaining photographic device shooting object under test by predeterminated position where the object under test Obtained the first image and the second image.
In one example, camera 141 and camera 142 can according to the scene around a of predetermined period camera site, In this way, if a certain moment carrier 120 carries a row or multi-row collateral security 130 by position a, camera 141 and camera 142 The corresponding color image of scene and the depth image around a row or multi-row collateral security 130 can be shot to obtain respectively.Wherein, in advance If the period can be rule of thumb configured by those skilled in the art, or can also be configured according to actual needs, tool Body is not construed as limiting.
In another example, induction coil can be set at the position a of deposit route 110, if a certain moment carrier 120 carry a row or multi-row collateral security 130 by position a, then induction coil can report induction information to server;In this way, Server can control the scene around 142 camera site a of camera 141 and camera, to obtain a row or multi-row pledge The corresponding color image of scene and depth image around product 130.
It should be noted that carrier 120 can not stop after deposit line-of-road movement to position a in the embodiment of the present invention Only, in this way, camera 141 and camera 142 can shoot the field around a row or multi-row collateral security 130 being kept in motion Scape obtains the color image and depth image under motion state;It, can be with by realizing measurement process in real time under motion state Improve the efficiency of deposit.
Step 202, according to the characteristic value of pixel each in color image, object under test is determined from the color image Corresponding target area.
In one possible implementation, if server gets color image and depth map under being kept in motion Picture then can be used default filtering mode and be filtered to color image;Wherein, default filtering mode can be by art technology Personnel are rule of thumb configured, and are specifically not construed as limiting.The embodiment of the present invention can using Wiener filter to color image into Row filtering;Specifically, color image can be inputted Wiener filter by server, since color image is useful signal and nothing With the sum of noise, therefore, Wiener filter can parse that color image obtains the corresponding matrix of image respectively and noise is corresponding Matrix, further, Wiener filter can determine the part of image according to the corresponding matrix of image and the corresponding matrix of noise Variance, and the filtering parameter of filter output can be adjusted according to the local variance of image, guarantee the output letter of Wiener filter Number and desired signal difference mean-square value it is minimum, in this way, being filtered using the filtering parameter of output to color image, can obtain To preferable filter effect.
In the embodiment of the present invention, by being filtered to the color image under being kept in motion, it can remove in fortune Dynamic fuzzy in the color image of dynamic state, so as to improve the essence for executing image procossing based on filtered color image Exactness;That is, the image processing method in the embodiment of the present invention can execute during carrier 120 moves, and nothing The original work flow of carrier 120 need to be changed, so as to improve the efficiency of image procossing.
In one example, server in the color image obtained under remaining static or can obtain filtered After color image, alignment operation can be carried out to color image and depth image, such as can be by color image and depth image On same object be aligned;Further, server, can be according to cromogram after alignment color image and depth image The characteristic value of each pixel as in determines the corresponding target area of object under test from the color image.The present invention is implemented In example, by alignment operation, so that each pixel on color image can correspond to a depth on depth image Information, i.e. color image and depth image can have one-to-one relationship.
In one possible implementation, server can according to the mode of deep learning from color image determine to The corresponding target area of object is surveyed, for example, color image directly can be inputted trained target image language in advance by server Justice segmentation (such as target U-NET3) model, and target U-NET3 model can be identified on color image not with different colours Same object, in this way, server can (this mode be known as first and shows using the corresponding color region of object under test as target area Example);Alternatively, color image first can be inputted trained target detection (such as target YOLO2) model in advance by server, and Target YOLO2 model can identify the region where the different objects on color image with different block diagrams, further, service Device can be by area image (for ease of description, the referred to as corresponding son of object under test where the corresponding block diagram of object under test Image) trained image, semantic divides (such as target U-NET3) model in advance for input, so that target U-NET3 model can be with The different objects on the corresponding subgraph of object under test are identified with different colours, in this way, server can be by object under test pair The corresponding color region of object under test is as target area on the subgraph answered (this mode is known as the second example);Alternatively, service Device can also determine the corresponding target area of object under test using other deep learning modes from color image, not limit specifically It is fixed.
In the embodiment of the present invention, simple model analysis, Ke Yicong are carried out to color image by using target YOLO2 model The rough region where object under test is obtained in color image, and then by using where target U-NET3 model measuring targets Rough region carry out accurate model analysis, the precise region where available object under test;It follows that the present invention is implemented Example may not need by setting target YOLO2 model and target U-NET3 model and carry out accurate model point to entire color image Analysis, so as to analyzed in implementation model accuracy while improve model analysis efficiency.
The specific implementation process of the corresponding target area of determining object under test is described by taking the second example as an example below.
In the embodiment of the present invention, target YOLO2 model can be obtained for server according to the training of multiple first sample images 's.Specifically, the scene of the available camera 141 of server and camera 142 around different moments camera site a obtains The quantity of the multiple first sample images arrived, collateral security included by each first sample image can be different, and each first Marked block diagram, the block diagram where carrier and the block diagram where staff where collateral security out on sample image.Into one Step ground, server can be used multiple first sample image training YOLO2 models, obtain target YOLO2 model, target YOLO2 Model can be to the image tagged block diagram of input target YOLO2 model.
Correspondingly, target U-NET3 model can obtain for server according to the training of multiple second sample images.Specifically Say that server can be from corresponding multiple second sample graphs of block diagram where obtaining collateral security in multiple first sample images in ground Picture may include color different collateral security region and background area on each second sample image, or also may include face The quantity in color difference carrier region and/or staff region, collateral security included by each second sample image can be different, And in multiple second sample images collateral security region color it is identical.Further, multiple second samples can be used in server Image trains U-NET3 model, obtains target U-NET3 model, and target U-NET3 model can be to input target U-NET3 model Image on collateral security region, carrier region, staff region, background area mark different colors respectively.
In specific implementation, color image can be inputted target YOLO2 model by server, in this way, target YOLO2 model can To mark multiple block diagrams on color image according to the one or more objects for including on color image;Wherein, any two frame Included object can be different in figure.For example, as shown in figure 3, if on color image include " automobile ", " bicycle " and " dog " three objects, then target YOLO2 model can mark block diagram a on color image1, block diagram a2With block diagram a3, wherein frame Scheme a1In may include " automobile ", block diagram a2In may include " bicycle ", block diagram a3In may include " dog ".
In one example, it can determine which block diagram is the pixel on color image belong to using sliding window mode.Specifically Ground says, can be directed to each pixel and 9 windows are arranged, by by other pictures in each pixel and 9 windows Vegetarian refreshments combines, and can analyze and determines classification belonging to each pixel.In this example, picture is determined by using sliding window mode The classification of vegetarian refreshments can make pixel correspond to multiple classifications, so, it is ensured that the integrality of the block diagram where object.
It should be noted that above-mentioned is only a kind of illustrative simple declaration, cited by window quantity be only for Convenient for illustrating scheme, the restriction to scheme is not constituted, in specific implementation, the quantity of window can be greater than 9, such as can Think 16, or might be less that 9, for example can be 4, is specifically not construed as limiting.
It at least may include that collateral security is corresponding on the color image of target YOLO2 model output in the embodiment of the present invention The corresponding block diagram of block diagram, carrier, the corresponding block diagram of staff and background, further, server can be from color image The target subgraph where the corresponding block diagram of collateral security is obtained, and target subgraph can be inputted into target U-NET3 model, such as This, the one or more object tags for including on target subgraph can be different colors by target U-NET3 model;Wherein, The color of any two object can be different.For example, collateral security is corresponding on the target subgraph of target U-NET3 model output It can be yellow, the corresponding region of staff can be that black, background are corresponding that region, which can be red, the corresponding region of carrier, Region can for blue.
In one example, target U-NET3 model is on target subgraph to the inside of different object tag different colours Realization process can be as shown in step a~step c:
Step a, it is down-sampled to the progress of target subgraph, obtain the first image.
Specifically, if the resolution ratio of target subgraph be greater than the first default resolution ratio, can to target subgraph into Row is down-sampled to obtain the first image, so that the resolution ratio of the first image is less than or equal to the first default resolution ratio.For example, can incite somebody to action Down-sampled the first image for being 28*28 of the target subgraph that resolution ratio is 128*128;Wherein, every on the first image of 28*28 A pixel can correspond to a block of pixels on the target subgraph of 128*128.Correspondingly, if the resolution of target subgraph Rate is less than or equal to the first default resolution ratio, then can not carry out to target subgraph down-sampled.
Step b determines classification belonging to each pixel on the first image.
In specific implementation, any pixel point (such as first pixel) being directed on the first image of 28*28 can be obtained The characteristic value of the first pixel is taken, if the collateral security characteristic value stored in the characteristic value of the first pixel and target U-NET3 model Matching can then determine that classification belonging to the pixel is collateral security;Alternatively, if the characteristic value of the first pixel and target U- The carrier features value matching stored in NET3 model can then determine that classification belonging to the pixel is carrier;Alternatively, if first The staff's characteristic value stored in the characteristic value of pixel and target U-NET3 model matches, then can determine the pixel institute The classification of category is staff;Alternatively, if the background characteristics stored in the characteristic value of the first pixel and target U-NET3 model Value matching can then determine that classification belonging to the pixel is background.
Step c, according to classification belonging to pixel each on the first image, to the different object tags on target subgraph Different colours.
It,, can be from the first figure in specific implementation for marking the corresponding region of collateral security in a kind of possible situation Classification belonging to determining in multiple pixels of picture is one or more target pixel points of collateral security, and then from target subgraph In find one or more target pixel blocks corresponding with one or more target pixel points, and can be by one or more targets Block of pixels is labeled as red.
In alternatively possible situation, determine that affiliated classification is to pledge in multiple pixels from the first image After one or more target pixel points of product, the first image can also be up-sampled to obtain the second image, the second image Size is identical as the size of target subgraph, and each pixel on the first image can correspond to a picture on the second image Plain block;In this way, one or more object pixels corresponding with one or more target pixel points can be found from the second image Block, and can be by one or more target pixel blocks labeled as red.
In the embodiment of the present invention, resolution ratio lesser first is converted by target subgraph by using down-sampled mode Subgraph improves figure so that the complexity of image procossing can be reduced by carrying out analysis based on lesser first subgraph of resolution ratio As the efficiency of processing;And by the way that the pixel of block of pixels and the first subgraph on target subgraph is mapped, Ke Yibao Demonstrate,prove the accuracy of image procossing.
Step 203, believed according to the corresponding target area of object under test and the corresponding depth of the second objective area in image Breath, determines the volume of object under test.
In one possible implementation, server determines the corresponding target area of collateral security in color image Afterwards, depth areas corresponding with target area can be intercepted from depth image, and then can be carried out a cloud to depth areas and be turned It changes;In this way, depth areas can be converted into depth information corresponding with target area.It is shown in Figure 1, due to camera 141 Being arranged above collateral security with camera 142, i.e., color image and depth image are to obtain at the top of shooting collateral security, because This, depth information can identify the top of collateral security at a distance from camera 142.
In the embodiment of the present invention, target area can be divided into one or more grids by server;Wherein, each grid It may include a pixel, or also may include multiple pixels, for example, each grid may include 1*1 pixel, or Person also may include 2*2 pixel, or can also include 3*2 pixel, specifically be not construed as limiting.In the embodiment of the present invention, it is Guarantee the accuracy of measurement volume, each grid can be set includes 1*1 pixel.
Further, server can obtain the corresponding depth information of each grid, each grid pair from depth areas The depth information answered can identify the top of the corresponding collateral security of each grid (h as shown in figure 1 at a distance from camera 1421); In this way, passing through 120 loading end of carrier measured in advance (h as shown in figure 1 at a distance from camera 1422) and each net The top of the corresponding collateral security of lattice (i.e. h at a distance from camera 1421), the height of the corresponding collateral security of available each grid Spend (i.e. h1-h2)。
Further, server can be according to pair of the length and width of the length of color image, width and actual scene It should be related to, determine the ratio of color image, and then can be according to the ratio of color image by the length of grid each in target area The length and width that the corresponding collateral security of each grid is converted into width is spent, so as to pledge according to each grid is corresponding Height (the i.e. h of the length and width of product and the corresponding collateral security of each grid1-h2), determine the volume of each grid.In this way, Server can sum it up the volume of one or more grids included by the corresponding target area of collateral security, obtain collateral security Volume.It, can be according to the depth information of multiple grids point by the way that target area is divided into multiple grids in the embodiment of the present invention The volume of multiple grids is not determined, to sum up the volume that object under test can be obtained by the volume to multiple grids; The process obtains the volume of object under test by unit of grid, can make the volume for determining obtained object under test compared with subject to Really, and realize that logic is relatively simple, so as to improve the efficiency of processing.
In the above embodiment of the present invention, obtains photographic device shooting object under test and pass through determinand when predeterminated position The first image and the second image that scene where body obtains, the first image are that the scene where the object under test is corresponding Color image, second image be the object under test where the corresponding depth image of scene;Further, according to institute The characteristic value for stating each pixel in the first image determines the corresponding target area of the object under test from the first image Domain, and the corresponding depth information in target area according to the target area and second image, determine it is described to Survey the volume of object.In the embodiment of the present invention, by realizing measurement process in real time under motion state so as to environment according to Rely smaller, the object volume under complex environment is detected so as to realize, and no setting is required measuring station and fixed test Position, so as to improve the efficiency of deposit;In addition, shooting to obtain a color image and a depth image by photographic device It can determine the volume of object under test, be not necessarily to staff, and no setting is required multiple photographic devices, so that cost is relatively low, and grasp Make simply, to be easy to implement.
For above method process, the embodiment of the present invention also provides a kind of image processing apparatus, the particular content of the device It is referred to above method implementation.
Fig. 4 is a kind of structural schematic diagram of image processing apparatus provided in an embodiment of the present invention, comprising:
Module 401 is obtained, object under test institute when for obtaining photographic device shooting object under test by predeterminated position Scene obtained the first image and the second image, the first image is the corresponding coloured silk of scene where the object under test Chromatic graph picture, second image are the corresponding depth image of scene where the object under test;
Determining module 402, for the characteristic value according to pixel each in the first image, from the first image really Determine the corresponding target area of the object under test;
Processing module 403, it is corresponding for the target area according to the target area and second image Depth information determines the volume of the object under test.
Optionally, the determining module 402 is specifically used for:
The first image is inputted into the first preset model, so that the first preset model is according to each in the first image The first image is divided into one or more subgraphs by the characteristic value of pixel;It is every in one or more of subgraphs A subgraph includes at least one object;
Obtained from one or more of subgraphs include the object under test target subgraph, and by the target Subgraph inputs the second preset model, obtains the corresponding target area of the object under test.
Optionally, the determining module 402 is specifically used for:
It is down-sampled to target subgraph progress, the first subgraph is obtained, and determine that first subgraph includes Classification belonging to multiple first pixels, and then first subgraph is up-sampled to obtain the second subgraph;Described Two subgraphs include multiple second pixels, the first pixel of each of the multiple first pixel correspond at least one second Pixel, classification and each first pixel belonging at least one corresponding second pixel of each first pixel Classification belonging to point is identical;
The second pixel of one or more targets that classification is pre-set categories is chosen from the multiple second pixel, and The corresponding target area of the object under test is obtained according to one or more of the second pixels of target;The pre-set categories are Classification belonging to the object under test.
Optionally, the processing module 403 is specifically used for:
The target area is divided into one or more grids, according to one or more of grids and described second The corresponding depth information of one or more grid described in image determines the volume of one or more of grids, to described The volume of one or more grids sums up to obtain the volume of the object under test.
Optionally, the determining module 402 determines the corresponding target area of the object under test from the first image Before, the determining module 402 is also used to:
The first image is filtered.
It can be seen from the above: in the above embodiment of the present invention, where acquisition photographic device measuring targets The first image and the second image that scene is shot, the first image are that the scene where the object under test is corresponding Color image, second image be the object under test where the corresponding depth image of scene;Further, according to institute The characteristic value for stating each pixel in the first image determines the corresponding target area of the object under test from the first image Domain, and the corresponding depth information in target area according to the target area and second image, determine it is described to Survey the volume of object.In the embodiment of the present invention, by realizing measurement process in real time under motion state so as to environment according to Rely smaller, the object volume under complex environment is detected so as to realize, and no setting is required measuring station and fixed test Position, so as to improve the efficiency of deposit;In addition, shooting to obtain a color image and a depth image by photographic device It can determine the volume of object under test, be not necessarily to staff, and no setting is required multiple photographic devices, so that cost is relatively low, and grasp Make simply, to be easy to implement.
Based on the same inventive concept, the embodiment of the invention also provides a kind of computer readable storage mediums, including instruct, When it runs on the processor of computer, so that the processor of computer executes at the image as described in Fig. 1 or Fig. 1 is any Reason method.
Based on the same inventive concept, the embodiment of the invention also provides a kind of computer program products, when it is in computer When upper operation, so that computer executes the image processing method as described in Fig. 1 or Fig. 1 is any.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method or computer program product. Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the present invention Form.It is deposited moreover, the present invention can be used to can be used in the computer that one or more wherein includes computer usable program code The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to include these modifications and variations.

Claims (12)

1. a kind of image processing method, which is characterized in that the described method includes:
Obtain the first figure that scene when photographic device shooting object under test passes through predeterminated position where the object under test obtains Picture and the second image, the first image are the corresponding color image of scene where the object under test, second image For the corresponding depth image of scene where the object under test;
According to the characteristic value of pixel each in the first image, the corresponding mesh of the object under test is determined from the first image Mark region;
According to the corresponding depth information in target area described in the target area and second image, obtain described to be measured The volume of object.
2. the method according to claim 1, wherein the feature according to pixel each in the first image Value determines the corresponding target area of the object under test from the first image, comprising:
The first image is inputted into the first preset model, so that the first preset model is according to pixel each in the first image The first image is divided into one or more subgraphs by the characteristic value of point;Every height in one or more of subgraphs Image includes at least one object;
Obtained from one or more of subgraphs include the object under test target subgraph, and by the target subgraph As the second preset model of input, the corresponding target area of the object under test is obtained.
3. according to the method described in claim 2, it is characterized in that, described input the second default mould for the target subgraph Type obtains the corresponding target area of the object under test, comprising:
It is down-sampled to target subgraph progress, the first subgraph is obtained, and determine that first subgraph includes multiple Classification belonging to first pixel, and then first subgraph is up-sampled to obtain the second subgraph;Second son Image includes multiple second pixels, and the first pixel of each of the multiple first pixel corresponds at least one second pixel Point, classification and each first pixel institute belonging at least one corresponding second pixel of each first pixel The classification of category is identical;
Classification is chosen from the multiple second pixel as the second pixel of one or more targets of pre-set categories, and according to One or more of the second pixels of target obtain the corresponding target area of the object under test;The pre-set categories are described Classification belonging to object under test.
4. the method according to claim 1, wherein described according to the target area and second image Described in the corresponding depth information in target area, determine the volume of the object under test, comprising:
The target area is divided into one or more grids, according to one or more of grids and second image Described in the corresponding depth information of one or more grid determine the volumes of one or more of grids, to one Or the volume of multiple grids sums up to obtain the volume of the object under test.
5. method according to claim 1 to 4, which is characterized in that described to be determined from the first image Before the corresponding target area of the object under test, further includes:
The first image is filtered.
6. a kind of image processing apparatus, which is characterized in that described device includes:
Module is obtained, scene when for obtaining photographic device shooting object under test by predeterminated position where the object under test Obtained the first image and the second image, the first image are the corresponding color image of scene where the object under test, Second image is the corresponding depth image of scene where the object under test;
Determining module, for the characteristic value according to pixel each in the first image, determined from the first image it is described to Survey the corresponding target area of object;
Processing module, for the corresponding depth letter in the target area according to the target area and second image Breath, obtains the volume of the object under test.
7. device according to claim 6, which is characterized in that the determining module is specifically used for:
The first image is inputted into the first preset model, so that the first preset model is according to pixel each in the first image The first image is divided into one or more subgraphs by the characteristic value of point;Every height in one or more of subgraphs Image includes at least one object;
Obtained from one or more of subgraphs include the object under test target subgraph, and by the target subgraph As the second preset model of input, the corresponding target area of the object under test is obtained.
8. device according to claim 7, which is characterized in that the determining module is specifically used for:
It is down-sampled to target subgraph progress, the first subgraph is obtained, and determine that first subgraph includes multiple Classification belonging to first pixel, and then first subgraph is up-sampled to obtain the second subgraph;Second son Image includes multiple second pixels, and the first pixel of each of the multiple first pixel corresponds at least one second pixel Point, classification and each first pixel institute belonging at least one corresponding second pixel of each first pixel The classification of category is identical;
Classification is chosen from the multiple second pixel as the second pixel of one or more targets of pre-set categories, and according to One or more of the second pixels of target obtain the corresponding target area of the object under test;The pre-set categories are described Classification belonging to object under test.
9. device according to claim 6, which is characterized in that the processing module is specifically used for:
The target area is divided into one or more grids, according to one or more of grids and second image Described in the corresponding depth information of one or more grid determine the volumes of one or more of grids, to one Or the volume of multiple grids sums up to obtain the volume of the object under test.
10. device according to any one of claims 6 to 9, which is characterized in that the determining module is from first figure Before determining the corresponding target area of the object under test as in, the determining module is also used to:
The first image is filtered.
11. a kind of computer readable storage medium, which is characterized in that including instruction, when it runs on the processor of computer When, so that the processor of computer executes such as method described in any one of claim 1 to 5.
12. a kind of computer program product, which is characterized in that when run on a computer, so that computer is executed as weighed Benefit requires 1 to 5 described in any item methods.
CN201910619862.5A 2019-07-10 2019-07-10 A kind of image processing method and device Pending CN110378952A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910619862.5A CN110378952A (en) 2019-07-10 2019-07-10 A kind of image processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910619862.5A CN110378952A (en) 2019-07-10 2019-07-10 A kind of image processing method and device

Publications (1)

Publication Number Publication Date
CN110378952A true CN110378952A (en) 2019-10-25

Family

ID=68250875

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910619862.5A Pending CN110378952A (en) 2019-07-10 2019-07-10 A kind of image processing method and device

Country Status (1)

Country Link
CN (1) CN110378952A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111524180A (en) * 2020-04-23 2020-08-11 Oppo广东移动通信有限公司 Object volume calculation method and device, electronic equipment and storage medium
CN114088207A (en) * 2020-07-17 2022-02-25 北京京东尚科信息技术有限公司 Temperature detection method and system
WO2023124442A1 (en) * 2021-12-29 2023-07-06 京东方科技集团股份有限公司 Method and device for measuring depth of accumulated water

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120001902A1 (en) * 2010-07-02 2012-01-05 Samsung Electronics Co., Ltd. Apparatus and method for bidirectionally inpainting occlusion area based on predicted volume
CN107067431A (en) * 2017-01-16 2017-08-18 河海大学常州校区 A kind of object volume computational methods based on Kinect
CN108876236A (en) * 2018-05-31 2018-11-23 清华大学 Logistics system of returning goods of receiving based on UAV Intelligent express delivery cabinet
CN109147216A (en) * 2018-07-25 2019-01-04 上海与德科技有限公司 Intelligent express delivery cabinet, express delivery are sent with charge free and collection methods, device, server and medium
CN109978939A (en) * 2019-05-21 2019-07-05 芜湖智久机器人有限公司 A kind of dimension of object estimating system and method, warehouse capacity estimating system and method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120001902A1 (en) * 2010-07-02 2012-01-05 Samsung Electronics Co., Ltd. Apparatus and method for bidirectionally inpainting occlusion area based on predicted volume
CN107067431A (en) * 2017-01-16 2017-08-18 河海大学常州校区 A kind of object volume computational methods based on Kinect
CN108876236A (en) * 2018-05-31 2018-11-23 清华大学 Logistics system of returning goods of receiving based on UAV Intelligent express delivery cabinet
CN109147216A (en) * 2018-07-25 2019-01-04 上海与德科技有限公司 Intelligent express delivery cabinet, express delivery are sent with charge free and collection methods, device, server and medium
CN109978939A (en) * 2019-05-21 2019-07-05 芜湖智久机器人有限公司 A kind of dimension of object estimating system and method, warehouse capacity estimating system and method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
LIANG HAN: "Livestock detection in aerial images using a fully convolutional network", 《COMPUTATIONAL VISUAL MEDIA》 *
杨顺辽等: "《数字信号处理实现与实践》", 30 September 2011 *
马伟等: "新型乳腺磁共振增强图像肿瘤区域的自动分割模型", 《中国生物医学工程学报》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111524180A (en) * 2020-04-23 2020-08-11 Oppo广东移动通信有限公司 Object volume calculation method and device, electronic equipment and storage medium
CN114088207A (en) * 2020-07-17 2022-02-25 北京京东尚科信息技术有限公司 Temperature detection method and system
WO2023124442A1 (en) * 2021-12-29 2023-07-06 京东方科技集团股份有限公司 Method and device for measuring depth of accumulated water

Similar Documents

Publication Publication Date Title
CN110264468B (en) Point cloud data mark, parted pattern determination, object detection method and relevant device
CN108154168B (en) Comprehensive cargo inspection system and method
CN110023994B (en) Histopathological image analysis
CN106707293B (en) Obstacle recognition method and device for vehicle
CN108875600A (en) A kind of information of vehicles detection and tracking method, apparatus and computer storage medium based on YOLO
CN108154105B (en) Underwater biological detection and identification method and device, server and terminal equipment
CN109447169A (en) The training method of image processing method and its model, device and electronic system
CN110378952A (en) A kind of image processing method and device
CN110443969A (en) A kind of fire point detecting method, device, electronic equipment and storage medium
CN108898109A (en) The determination methods, devices and systems of article attention rate
CN111242852A (en) Boundary aware object removal and content filling
CN107358182A (en) Pedestrian detection method and terminal device
CN110443862A (en) Lithologic map filling method and system based on unmanned aerial vehicle and electronic equipment
CN108961547A (en) A kind of commodity recognition method, self-service machine and computer readable storage medium
CN109508583A (en) A kind of acquisition methods and device of distribution trend
CN109840503A (en) A kind of method and device of determining information
CN109154938A (en) Using discrete non-trace location data by the entity classification in digitized map
CN109740609A (en) A kind of gauge detection method and device
CN109684910A (en) A kind of method and system of network detection transmission line of electricity ground surface environment variation
CN113191204A (en) Multi-scale blocking pedestrian detection method and system
CN109543617A (en) The detection method of intelligent vehicle movement traffic information based on YOLO target detection technique
Yadav et al. An improved deep learning-based optimal object detection system from images
CN110490058A (en) Training method, device, system and the computer-readable medium of pedestrian detection model
Nassar et al. Assessing data quality of annotations with Krippendorff alpha for applications in computer vision
CN109492573A (en) A kind of pointer read method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination