US11120556B2 - Iterative method for salient foreground detection and multi-object segmentation - Google Patents

Iterative method for salient foreground detection and multi-object segmentation Download PDF

Info

Publication number
US11120556B2
US11120556B2 US16/880,505 US202016880505A US11120556B2 US 11120556 B2 US11120556 B2 US 11120556B2 US 202016880505 A US202016880505 A US 202016880505A US 11120556 B2 US11120556 B2 US 11120556B2
Authority
US
United States
Prior art keywords
image
saliency
score
graph
superpixel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US16/880,505
Other versions
US20200286239A1 (en
Inventor
Alexander C. Loui
David Kloosterman
Michal KUCER
Nathan Cahill
David Messinger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Kodak Alaris Inc
Original Assignee
Kodak Alaris Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Kodak Alaris Inc filed Critical Kodak Alaris Inc
Assigned to KODAK ALARIS INC. reassignment KODAK ALARIS INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LOUI, ALEXANDER, CAHILL, NATHAN, MESSINGER, DAVID, KLOOSTERMAN, DAVID, KUCER, MICHAL
Priority to US16/880,505 priority Critical patent/US11120556B2/en
Publication of US20200286239A1 publication Critical patent/US20200286239A1/en
Assigned to KPP (NO. 2) TRUSTEES LIMITED reassignment KPP (NO. 2) TRUSTEES LIMITED SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KODAK ALARIS INC.
Publication of US11120556B2 publication Critical patent/US11120556B2/en
Application granted granted Critical
Assigned to THE BOARD OF THE PENSION PROTECTION FUND reassignment THE BOARD OF THE PENSION PROTECTION FUND ASSIGNMENT OF SECURITY INTEREST Assignors: KPP (NO. 2) TRUSTEES LIMITED
Assigned to THE BOARD OF THE PENSION PROTECTION FUND reassignment THE BOARD OF THE PENSION PROTECTION FUND IP SECURITY AGREEMENT SUPPLEMENT (FISCAL YEAR 2022) Assignors: KODAK ALARIS INC.
Assigned to FGI WORLDWIDE LLC reassignment FGI WORLDWIDE LLC SECURITY AGREEMENT Assignors: KODAK ALARIS INC.
Assigned to KODAK ALARIS INC. reassignment KODAK ALARIS INC. RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: THE BOARD OF THE PENSION PROTECTION FUND
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • G06K9/4671
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/12Edge-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/136Segmentation; Edge detection involving thresholding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/162Segmentation; Edge detection involving graph-based methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10004Still image; Photographic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20072Graph-based image processing

Definitions

  • a system and method overcomes the deficiencies in prior art systems and methods by employing an iterative approach so that a wider array of images, including images with multiple subjects, can be analyzed for salient foreground objects.
  • the present invention is directed to a system and method for iterative foreground detection and multi-object segmentation.
  • a new “background prior” is introduced to improve the foreground segmentation results.
  • three complimentary embodiments are presented and demonstrated to detect and segment foregrounds containing multiple objects.
  • the first embodiment performs an iterative segmentation of the image to “pull out” the various salient objects in the image.
  • a higher dimensional embedding of the image graph is used to estimate the saliency score and extract multiple salient objects.
  • a newly proposed metric is used to automatically pick the number of eigenvectors to consider in an alternative method to iteratively compute the image saliency map.
  • Experimental results show that the proposed methods succeed in extracting multiple foreground objects from an image with a much better accuracy than previous methods.
  • FIG. 1 shows a comparison of the saliency maps after an application of the method of the present invention, specifically showing an original image and corresponding saliency maps according to prior art methods and an improved saliency map according to the present invention
  • FIG. 2 shows an original image having a plurality of separate objects and a plurality of saliency maps with non-zero eigenvectors according to the present invention
  • FIG. 3 shows an original image having a single object and a plurality of saliency maps with non-zero eigenvectors according to the present invention
  • FIG. 4 shows a flowchart corresponding to a first embodiment of the invention
  • FIG. 5 shows an example of the progression of the method of the present invention, specifically showing an original image with a plurality of separate objects, and a selection of an optimum saliency map associated with a number of iterations of the method of the present invention
  • FIG. 6 shows an example of the progression of the method of the present invention, specifically showing an original image with a single object, and a selection of an optimum saliency map associated with a number of iterations of the method of the present invention
  • FIG. 7 shows an example of the progression of the method of the present invention, specifically showing an original image with four separate objects, and a selection of an optimum saliency map associated with a number of iterations of the method of the present invention
  • FIG. 8 shows a flowchart corresponding to a second embodiment of the invention.
  • FIG. 9 shows another example of the progression of the method of the present invention, wherein the total number of eigenvectors is three, and the best saliency map for an image with a single object corresponds to one iteration, and the best saliency map for an image with four objects corresponds to three iterations;
  • FIG. 10 shows a flowchart corresponding to a third embodiment of the invention.
  • FIG. 11 shows another example of the progression of the method of the present invention for an image with multiple salient subjects and corresponding saliency maps for a total of six iterations of the method of the present invention
  • FIG. 12 shows another example of the progression of the method of the present invention for an image with a single salient subject wherein only one iteration is performed;
  • FIG. 13 shows examples of saliency maps obtained according to the third method of the present invention.
  • FIG. 14 shows examples of improved performance in saliency maps using a higher dimensional node embedding according to the method of the present invention.
  • the present invention is directed to a system and method for automatically detecting salient foreground objects and iteratively segmenting these objects in the scene of an image.
  • RAG Image Region Adjacency Graph
  • each vertex v ⁇ V represents a superpixel from SLIC and is assigned a value of the mean Lab color of the superpixel.
  • the edge set E consists of the edges connecting vertices i and j, if their corresponding superpixels share a border in the segmented image. Each edge is assigned a weighed that is proportional to the Lab color difference between neighboring superpixels:
  • the graph G can be augmented with a background node b, which is assigned the mean color of the boundary, and a set of edges that connects the background node and the superpixels on the edge of the image with weights computed by equation (1).
  • Embodiments of the present invention are directed to augmenting the “background prior,” which will be described in more detail.
  • the background is often very cluttered, and thus computing the edge weights by considering the average background color will fail to capture the background prior effectively by computing very small weights, since the average background color will be sufficiently different from each of the border superpixels and thus resulting in an unsatisfying saliency map.
  • a set of colors representing the background is assigned to the background node.
  • a K-Means clustering of the border colors is performed, and then the K-Means cluster centers, ⁇ c 1 b , . . . , c k b ⁇ , are used to represent the background prior in the node.
  • the maximum of the weights are computed between region i and each of the k cluster center colors:
  • w i , b max j ⁇ ⁇ 1 , ... ⁇ , k ⁇ ⁇ 1 ⁇ c i - c j b ⁇ 2 + ⁇ ( 4 )
  • FIG. 1 shows a comparison of the saliency maps after such enforcement. Specifically, FIG. 1 shows the original image 101 and its saliency map ground truth 104 , as well as the saliency map 102 produced according to the present invention, which is much better than the saliency map 103 produced by Perazzi et al.
  • Embodiments of the present invention are also directed to detecting multiple objects, which will be described in more detail.
  • the foreground segmentation method allows for detecting multiple salient subjects in the image by using the following schemes: (1) an iterative foreground segmentation scheme, and (2) two alternative multi-object foreground segmentation schemes, which use the eigenvector of the Image Region Adjacency Graph (“RAG”) as an embedding for the nodes and analysis of the presence of additional objects. This embedding is then used to calculate an alternative saliency score.
  • RAG Image Region Adjacency Graph
  • Both schemes use a metric to determine the ideal foreground segmentation.
  • the metric used for picking the best saliency map, and the Silhouette score which is a main component of the metric, is described.
  • the Silhouette score is now described in further detail.
  • the K-Means clustering to cluster the saliency score into two (Foreground/Background) clusters is used, and then a metric is computed known as the “Silhouette score”, first introduced by Rousseeuw (P. Rousseeuw, “Silhouettes: A graphical aid to the interpretation and validation of cluster analysis,” Journal of Computational and Applied Mathematics, 20:53-65, 1987).
  • the Silhouette score is one of the possible metrics that is used in interpretation and validation of cluster analysis.
  • s ⁇ ( i ) b ⁇ ( i ) - a ⁇ ( i ) max ⁇ ⁇ ⁇ a ⁇ ( i ) , b ⁇ ( i ) ⁇ ( 5 ) which is then combined into a final score f sil for the image by taking the average of s(i) for all of the superpixels.
  • Stopping criterion/metric is now described. Both of the above multi-object segmentation schemes detailed in the next section relies on some sort of stopping criterion or metric, which would determine either the ideal number of iterations or eigenvectors to consider when computing the saliency map for images with multiple objects. In order to determine the ideal number of iterations or number of eigenvectors, a metric that combines the Silhouette score, f sil , and mean image saliency of the image is used:
  • FIG. 2 shows a first original image 201 a , including a plurality of objects, a saliency map 202 a from a first non-zero eigenvector, a saliency map 203 a from a second non-zero eigenvector, a saliency map 204 a from a third non-zero eigenvector, and a final saliency map 205 a .
  • FIG. 2 also shows a second original image 201 b , including a plurality of objects, a saliency map 202 b from a first non-zero eigenvector, a saliency map 203 b from a second non-zero eigenvector, a saliency map 204 b from a third non-zero eigenvector, and a final saliency map 205 b .
  • the same cannot be said of many of the images that only contain a single salient object, as can be seen in FIG. 3 .
  • the fielder vector picks out the most salient object in the image and the subsequent eigenvector (at times several) contains redundant information regarding the object. Shown in FIG.
  • FIG. 3 is a first original image including a single salient object 301 a , a corresponding saliency map from a first non-zero eigenvector 302 a , and a corresponding saliency map from a second non-zero eigenvector 303 a . Also shown in FIG. 3 is a second original image including a single salient object 301 b , a corresponding saliency map from a first non-zero eigenvector 302 b , and a corresponding saliency map from a second non-zero eigenvector 303 b.
  • Stopping criterion based on the eigenvalue difference is now described.
  • a different Stopping criterion is based on the percentage eigenvalue difference between subsequent dimensions.
  • ⁇ i ⁇ i + 1 - ⁇ i ⁇ i + 1 ( 7 )
  • ⁇ i the i th eigenvalue.
  • Multi-object segmentation schemes according to embodiments of the present invention are now described.
  • a first iterative foreground segmentation scheme is described below:
  • FIG. 4 shows a flowchart that corresponds to the first scheme described above.
  • step 402 decide the number of iterations, n, to consider in choosing the best Saliency map.
  • step 406 compute the score image for iteration i.
  • step 410 find the set, £, of nodes or superpixels for which the saliency score is greater than a threshold S th .
  • step 412 cut out the nodes from the RAG that belongs to the set E. Compute saliency scores for the reduced graph as previously described.
  • step 414 combine the Saliency scores of the smaller region with the scores for the nodes from the set E.
  • step 416 compute the Saliency map using the new saliency scores, and return to step 406 .
  • FIG. 5 shows an example embodiment of the progression of the method of the present invention, wherein the best saliency map is chosen having either three or four iterations.
  • FIG. 5 shows an example embodiment of the progression of the method of the present invention, wherein the best saliency map is chosen having either three or four iterations.
  • saliency map 506 a is chosen
  • FIG. 6 shows the original image 601 of a scene with one salient object and the corresponding saliency maps as the number of eigenvectors is varied for superpixel embedding: one eigenvector 602 , two eigenvectors 603 , and three eigenvectors 604 .
  • the saliency map 602 with one eigenvector was selected to be the best according to the score.
  • FIG. 6 shows the original image 601 of a scene with one salient object and the corresponding saliency maps as the number of eigenvectors is varied for superpixel embedding: one eigenvector 602 , two eigenvectors 603 , and three eigenvectors 604 .
  • the saliency map 602 with one eigenvector was selected to be the best according to the score.
  • FIG. 7 shows the original image 701 of a scene with multiple salient objects and the corresponding saliency maps as the number of eigenvectors is varied for superpixel embedding: one eigenvector 702 , two eigenvectors 703 , and three eigenvectors 704 .
  • the saliency map 704 with three eigenvectors was selected to be the best according to the score.
  • FIG. 8 shows the flowchart that corresponds to the second method of the present invention.
  • the total number of iterations, n, to consider in choosing the best Saliency map is decided.
  • the RAG of the image as described in Perazzi et al. is constructed and augmented with the improved background node.
  • a Laplacian matrix of the image RAG is constructed and its decomposition is computed, and k is set equal to 1.
  • the k smallest eigenvectors corresponding to k smallest nonzero eigenvalues are considered and are used as k-dimensional embedding of the graph nodes.
  • the k-dimensional embedding is a numerical representation of each of the nodes of the image RAG.
  • the embedding includes k numerical descriptors that are obtained from the k eigenvectors in consideration (i.e., the component of each eigenvector that corresponds to a particular node is used, e.g., if the node called i is represented by the m th component of an eigenvector, the k-dimensional embedding of node i includes the m th components of each eigenvector.)
  • the distance between the k-dimensional embedding of the background node and node i is calculated.
  • step 812 all of the distances are rescaled to lie in the range between [0,1], which gives the relevant saliency scores S.
  • step 814 the saliency map and the new saliency scores are computed.
  • step 816 the score image for iteration i is computed.
  • FIG. 9 shows an example of the progression of a embodiment of the present invention, when the total number of eigenvectors to consider to be chosen is three.
  • the number of iterations, n is selected.
  • n is set equal to three.
  • FIG. 9 shows an original image 901 a , which includes a single object.
  • FIG. 9 also shows a second original image 901 b , which includes four objects.
  • An alternative embodiment of the present invention is directed to a method comprising extracting multiple salient objects.
  • the method first computes the desired number of eigenvectors to consider and subsequently constructs the saliency map.
  • an adaptive way is used to calculate a threshold.
  • the adaptive threshold was proposed in “Frequency-tuned Salient Region Detection,” by R. Achanta, S. Hemami, F. Estrada and S. Süsstrunk, IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1597-1604 (2009).
  • the adaptive threshold is defined as twice the mean image saliency:
  • FIG. 10 shows a flowchart that corresponds to this method.
  • the number of iterations, n, to consider in choosing the best Saliency map is decided.
  • the RAG of the image as described in Perazzi et al. is constructed, and augmented with an improved background node.
  • the image threshold T k for dimension k is computed.
  • the new vector of the Saliency score S k for each superpixel i is computed as set forth above.
  • Step 1014 asks if k is equal to n. If yes, then the method terminates at step 1016 . If no, then k is set to k+1, and the method continues at step 1008 .
  • FIG. 11 shows an example of the progression of the method illustrated in FIG. 10 and described above.
  • FIG. 11 includes an original image 1100 , which includes multiple salient objects.
  • the best dimension (which is six in this case) is chosen according to equation (8), as is shown in graph 1107 .
  • FIG. 12 shows an example of the progression of the method illustrated in FIG. 10 and described above.
  • FIG. 12 includes an original image 1200 , which includes a single salient object.
  • the best dimension (which is one in this case) is chosen according to equation (8) as is shown in graph 1202 .
  • FIG. 13 shows an example of the saliency maps as obtained by the method illustrated in FIG. 10 and described above. Specifically, FIG. 13 shows example plots ( 1305 , 1306 )—one of the Eigenvalue Function Difference as defined by equation (7) for a multi-subject image (plot 1305 ), and one of the Eigenvalue Function Difference for a single subject image (plot 1306 ).
  • Plot 1305 corresponds to the original image 1301 , which contains multiple salient objects.
  • the final saliency map for original image 1301 is shown as 1302 .
  • Plot 1306 corresponds to the original image 1302 , which contains just a single salient object.
  • the final saliency map for original image 1303 is shown as 1304 .
  • Segmentation results of the present invention are described below.
  • the background node By assigning the background node a set of most frequent colors, in the case where the image has a “complicated” background or multiple colors in the image, the resulting graph will have higher weights on the edges connecting the border to the background node, which often produces good foreground detection results.
  • an embodiment of the present invention iteratively detects the most salient objects in the foreground. As can be seen from the example output depicted in FIG. 14 , improved results in detecting multiple salient subjects as compared to the prior art methods are obtained.
  • FIG. 14 shows three original images ( 1401 a , 1401 b , and 1401 c ), their corresponding saliency maps as obtained pursuant to the prior art method described by Perazzi et al. ( 1402 a , 1402 b , and 1402 c ), and corresponding saliency maps obtained pursuant to the present invention described herein ( 1403 a , 1403 b , and 1403 c ).
  • the method of the present invention provides:
  • w i , b max j ⁇ ⁇ 1 , ... ⁇ , k ⁇ ⁇ 1 ⁇ c i - c j b ⁇ 2 + ⁇ .

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Image Analysis (AREA)
  • Image Processing (AREA)

Abstract

A system and method that performs iterative foreground detection and multi-object segmentation in an image is disclosed herein. A new background prior is introduced to improve the foreground segmentation results. Three complimentary methods detect and segment foregrounds containing multiple objects. The first method performs an iterative segmentation of the image to pull out the salient objects in the image. In a second method, a higher dimensional embedding of the image graph is used to estimate the saliency score and extract multiple salient objects. A third method uses a metric to automatically pick the number of eigenvectors to consider in an alternative method to iteratively compute the image saliency map. Experimental results show that these methods succeed in accurately extracting multiple foreground objects from an image.

Description

CROSS REFERENCE TO RELATED APPLICATIONS
This is a divisional of U.S. application Ser. No. 15/847,050, filed on Dec. 19, 2017, which is a non-provisional of and claims priority to U.S. provisional patent application No. 62/436,803, filed Dec. 20, 2016. The disclosures of the above-referenced applications are hereby incorporated by reference in their entirety.
BACKGROUND OF THE INVENTION
With continuous miniaturization of silicon technology and proliferation of consumer and cell-phone cameras, there has been an exponential increase in the number of images that are captured. Whether the images are stored on personal computers or reside on social networks (e.g. Instagram, Flickr), the sheer number of images calls for methods to determine various image properties, such as object presence or appeal for the purpose of automatic image management. One of the central problems in consumer photography centers on determining the aesthetic appeal of the image. The problem itself is challenging, because the overall aesthetic value of an image is dependent on its technical quality, composition, emotional value, and the like. In order to combine all of these aspects, sophisticated systems must be built to take into account all of the aspects of image aesthetics. One such aspect is the prediction of the presence and possible segmentation of a salient object in the image, which could inform the system about the features that the system should consider in determining the aesthetics appeal.
An efficient method for the computation of salient foreground in consumer quality images has recently been proposed in F. Perazzi, O. Sorkine-Hornung, A. Sorkine-Hornung, “Efficient Salient Foreground Detection for Images and Video using Fiedler Vectors,” Eurographics Workshop on Intelligent Cinematography and Editing, Zurich, Switzerland, May 5, 2015. However, this method is not able to deal with an image that contains multiple objects in the scene effectively. Specifically, in the case of multiple disconnected objects, this method can only correctly detect a single salient object in the scene. What is desired is a system and method that can overcome this major deficiency of the prior art method.
SUMMARY OF THE INVENTION
According to the present invention, a system and method overcomes the deficiencies in prior art systems and methods by employing an iterative approach so that a wider array of images, including images with multiple subjects, can be analyzed for salient foreground objects.
The present invention is directed to a system and method for iterative foreground detection and multi-object segmentation. A new “background prior” is introduced to improve the foreground segmentation results. Furthermore, three complimentary embodiments are presented and demonstrated to detect and segment foregrounds containing multiple objects. The first embodiment performs an iterative segmentation of the image to “pull out” the various salient objects in the image. In the second embodiment, a higher dimensional embedding of the image graph is used to estimate the saliency score and extract multiple salient objects. In the third embodiment, a newly proposed metric is used to automatically pick the number of eigenvectors to consider in an alternative method to iteratively compute the image saliency map. Experimental results show that the proposed methods succeed in extracting multiple foreground objects from an image with a much better accuracy than previous methods.
BRIEF DESCRIPTION OF THE DRAWINGS
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee. These and other aspects of the invention will be described in detail with reference to the accompanying drawings, wherein:
FIG. 1 shows a comparison of the saliency maps after an application of the method of the present invention, specifically showing an original image and corresponding saliency maps according to prior art methods and an improved saliency map according to the present invention;
FIG. 2 shows an original image having a plurality of separate objects and a plurality of saliency maps with non-zero eigenvectors according to the present invention;
FIG. 3 shows an original image having a single object and a plurality of saliency maps with non-zero eigenvectors according to the present invention;
FIG. 4 shows a flowchart corresponding to a first embodiment of the invention;
FIG. 5 shows an example of the progression of the method of the present invention, specifically showing an original image with a plurality of separate objects, and a selection of an optimum saliency map associated with a number of iterations of the method of the present invention;
FIG. 6 shows an example of the progression of the method of the present invention, specifically showing an original image with a single object, and a selection of an optimum saliency map associated with a number of iterations of the method of the present invention;
FIG. 7 shows an example of the progression of the method of the present invention, specifically showing an original image with four separate objects, and a selection of an optimum saliency map associated with a number of iterations of the method of the present invention;
FIG. 8 shows a flowchart corresponding to a second embodiment of the invention;
FIG. 9 shows another example of the progression of the method of the present invention, wherein the total number of eigenvectors is three, and the best saliency map for an image with a single object corresponds to one iteration, and the best saliency map for an image with four objects corresponds to three iterations;
FIG. 10 shows a flowchart corresponding to a third embodiment of the invention;
FIG. 11 shows another example of the progression of the method of the present invention for an image with multiple salient subjects and corresponding saliency maps for a total of six iterations of the method of the present invention;
FIG. 12 shows another example of the progression of the method of the present invention for an image with a single salient subject wherein only one iteration is performed;
FIG. 13 shows examples of saliency maps obtained according to the third method of the present invention; and
FIG. 14 shows examples of improved performance in saliency maps using a higher dimensional node embedding according to the method of the present invention.
DETAILED DESCRIPTION
The present invention is directed to a system and method for automatically detecting salient foreground objects and iteratively segmenting these objects in the scene of an image.
To efficiently represent the image, Perazzi et al. use a modified version of the SLIC (Simple Linear Iterative Clustering) Superpixel segmentation algorithm proposed in the paper “Saliency filters: Contrast based filtering for salient region detection,” by F. Perazzi, P. Krahenbuhl, Y. Pritch, and A. Hornung, proceedings of the 2012 Computer Vision and Pattern Recognition conference, pages 733-740. In this case, the image is segmented into superpixels using K-means clustering in the Color-XY space (it uses CIELab color space instead of the traditional RGB space). After Superpixel segmentation, the image is represented as a Graph G={V, E} also known as the Image Region Adjacency Graph (“RAG”), where each vertex v∈V represents a superpixel from SLIC and is assigned a value of the mean Lab color of the superpixel. To model the local relationships in the image, the edge set E consists of the edges connecting vertices i and j, if their corresponding superpixels share a border in the segmented image. Each edge is assigned a weighed that is proportional to the Lab color difference between neighboring superpixels:
w i , j = 1 c i - c j 2 + ɛ ( 1 )
where ci is a mean Lab color of the ith superpixel and e is a small constant to ensure the numerical stability of the algorithm (e.g., e=10−4). In order to represent the assumption that most of the border pixels belong to the background, the graph G can be augmented with a background node b, which is assigned the mean color of the boundary, and a set of edges that connects the background node and the superpixels on the edge of the image with weights computed by equation (1).
In order to assign a saliency score to each of the superpixels of the image, Perazzi et al. compute the Eigen decomposition of the graph Laplacian matrix L of the Image RAG. Then the Fiedler vector, the second smallest eigenvector is used to compute the saliency scores. Given the Fiedler vector f, the saliency score S is computed as:
S=−sign(f bf  (2)
And S is then scaled to the range [0,1], where fb represents the entry of the Fiedler vector corresponding to the background node.
Because an embodiment of the present invention uses a high dimensional embedding/representation, saliency scores are computed in the following way:
S(i)=∥f i −f b∥  (3)
where S(i) is the ith component of the vector S and the saliency score for the ith superpixel and fi and fb are the embedding of the ith and background superpixels.
Embodiments of the present invention are directed to augmenting the “background prior,” which will be described in more detail. There are images in which the background is often very cluttered, and thus computing the edge weights by considering the average background color will fail to capture the background prior effectively by computing very small weights, since the average background color will be sufficiently different from each of the border superpixels and thus resulting in an unsatisfying saliency map. To correct for such a pitfall, instead of assigning to the image background node the average border background color (average color of the border superpixels), a set of colors representing the background is assigned to the background node. A K-Means clustering of the border colors is performed, and then the K-Means cluster centers, {c1 b, . . . , ck b}, are used to represent the background prior in the node. To compute the edge weight between the background node and the border regions, the maximum of the weights are computed between region i and each of the k cluster center colors:
w i , b = max j { 1 , , k } 1 c i - c j b 2 + ɛ ( 4 )
When augmenting a background prior with multiple “colors,” the background prior is better enforced, as can be seen in FIG. 1. FIG. 1 shows a comparison of the saliency maps after such enforcement. Specifically, FIG. 1 shows the original image 101 and its saliency map ground truth 104, as well as the saliency map 102 produced according to the present invention, which is much better than the saliency map 103 produced by Perazzi et al.
Embodiments of the present invention are also directed to detecting multiple objects, which will be described in more detail. The foreground segmentation method according to embodiments of the present invention allows for detecting multiple salient subjects in the image by using the following schemes: (1) an iterative foreground segmentation scheme, and (2) two alternative multi-object foreground segmentation schemes, which use the eigenvector of the Image Region Adjacency Graph (“RAG”) as an embedding for the nodes and analysis of the presence of additional objects. This embedding is then used to calculate an alternative saliency score. Both schemes use a metric to determine the ideal foreground segmentation. Next, the metric used for picking the best saliency map, and the Silhouette score, which is a main component of the metric, is described.
The Silhouette score is now described in further detail. In order to judge the quality of the foreground segmentation, the K-Means clustering to cluster the saliency score into two (Foreground/Background) clusters is used, and then a metric is computed known as the “Silhouette score”, first introduced by Rousseeuw (P. Rousseeuw, “Silhouettes: A graphical aid to the interpretation and validation of cluster analysis,” Journal of Computational and Applied Mathematics, 20:53-65, 1987). The Silhouette score is one of the possible metrics that is used in interpretation and validation of cluster analysis.
To compute the Silhouette score, the resulting clustering and the matrix of distances (or dissimilarities as used by Rousseeuw) between the different points (e.g., superpixels and the Saliency score assigned to them in our algorithm) is needed. For each point i the method of the present invention computes:
    • a(i): average distance to the points in the same cluster as i (label that cluster A)
    • D(i, C): average distance to the points in cluster C
    • b(i)=minC·AD(i, C): by choosing a minimum of D(i, C), we compute the distance to the next best cluster assignment for i.
      The final Silhouette score for point i is computed as:
s ( i ) = b ( i ) - a ( i ) max { a ( i ) , b ( i ) } ( 5 )
which is then combined into a final score fsil for the image by taking the average of s(i) for all of the superpixels.
The Stopping criterion/metric is now described. Both of the above multi-object segmentation schemes detailed in the next section relies on some sort of stopping criterion or metric, which would determine either the ideal number of iterations or eigenvectors to consider when computing the saliency map for images with multiple objects. In order to determine the ideal number of iterations or number of eigenvectors, a metric that combines the Silhouette score, fsil, and mean image saliency of the image is used:
s c o r e i m a g e = f sil · x = 1 m y = 1 n S ( x , y ) A ( I ) ( 6 )
where S(x, y) is the image saliency score at location (x, y) and A(I) represents the area of the image, and the mean image saliency is the summation of the image saliency score at each location (x, y) divided by the area of the image A(I). Then, in order to pick the final saliency map, the map with the highest overall image saliency score defined in equation (6) is chosen.
Presence of objects in eigenvectors is now described. It is important to note the presence of multiple salient objects embedded in higher dimensions of the RAG Laplacian matrix eigen decomposition. This can be seen in FIG. 2, where the plot of the image and its eigenvectors (we compute the saliency of an eigenvector by computing the scaled distance of each superpixel to the background node) is shown. For example, FIG. 2 shows a first original image 201 a, including a plurality of objects, a saliency map 202 a from a first non-zero eigenvector, a saliency map 203 a from a second non-zero eigenvector, a saliency map 204 a from a third non-zero eigenvector, and a final saliency map 205 a. FIG. 2 also shows a second original image 201 b, including a plurality of objects, a saliency map 202 b from a first non-zero eigenvector, a saliency map 203 b from a second non-zero eigenvector, a saliency map 204 b from a third non-zero eigenvector, and a final saliency map 205 b. However, the same cannot be said of many of the images that only contain a single salient object, as can be seen in FIG. 3. As can be seen, the fielder vector picks out the most salient object in the image and the subsequent eigenvector (at times several) contains redundant information regarding the object. Shown in FIG. 3 is a first original image including a single salient object 301 a, a corresponding saliency map from a first non-zero eigenvector 302 a, and a corresponding saliency map from a second non-zero eigenvector 303 a. Also shown in FIG. 3 is a second original image including a single salient object 301 b, a corresponding saliency map from a first non-zero eigenvector 302 b, and a corresponding saliency map from a second non-zero eigenvector 303 b.
The Stopping criterion based on the eigenvalue difference is now described. A different Stopping criterion is based on the percentage eigenvalue difference between subsequent dimensions. First the full Eigen decomposition of the augmented Region Adjacency Graph is computed. Then a subset of the first k non-zero eigenvalues is taken, and the percentage difference between the subsequent dimensions is computed:
Δ i = λ i + 1 - λ i λ i + 1 ( 7 )
where λi is the ith eigenvalue.
Then, in order to get the ideal dimension n, the dimension that produces the largest difference is chosen:
n=argmax1≤i<ki}  (8)
Multi-object segmentation schemes according to embodiments of the present invention are now described. A first iterative foreground segmentation scheme is described below:
    • Perform an initial foreground segmentation as described in Perazzi et al. with the improved background model introduced earlier, and compute the scoreimage for this map.
    • Now, iteratively perform the following steps:
      • 1. Find the set, £, of nodes or superpixels for which the saliency score Si is greater than a threshold Sth.
      • 2. Modify the Image RAG by cutting out the nodes that belong to the set £ (store the saliency scores of these nodes for later processing).
      • 3. Find new saliency scores for the region which remained in RAG by computing the Fiedler Vector of the new graph and computing and modifying it the same way described in Perazzi et al.
      • 4. Combine the Saliency scores of the smaller region with the scores for the nodes from the set £, to obtain the new saliency image and compute its scoreimage.
      • 5. Repeat for predetermined number of iterations.
      • 6. Choose the segmentation map with highest scoreimage.
FIG. 4 shows a flowchart that corresponds to the first scheme described above. At step 402, decide the number of iterations, n, to consider in choosing the best Saliency map. At step 404, compute the initial Saliency scores S (also Si) as described previously (consider this i=1 iteration). At step 406, compute the scoreimage for iteration i. At step 408, ask if i<n. If yes, then i=i+1. If no, then, at step 418, choose the map with the best scoreimage. At step 410, find the set, £, of nodes or superpixels for which the saliency score is greater than a threshold Sth. At step 412, cut out the nodes from the RAG that belongs to the set E. Compute saliency scores for the reduced graph as previously described. At step 414, combine the Saliency scores of the smaller region with the scores for the nodes from the set E. At step 416, compute the Saliency map using the new saliency scores, and return to step 406.
FIG. 5 shows an example embodiment of the progression of the method of the present invention, wherein the best saliency map is chosen having either three or four iterations. For example, in FIG. 5, an original image 501 a has corresponding saliency maps 502 a, 503 a, 504 a, and 505 a for iterations k=1, k=2, k=3, and k=4. Notice that saliency map 506 a is chosen as the best saliency map, which happens to correspond to the saliency map after three iterations (i.e., k=3). FIG. 5 also shows an original image 501 b having corresponding saliency maps 502 b, 503 b, 504 b, and 505 b for iterations k=1, k=2, k=3, and k=4. Again, saliency map 506 b is chosen as the best saliency map, but in this instance, that corresponds to the saliency map after four iterations (i.e., k=4).
An alternative scheme for foreground segmentation according to an embodiment of the present invention proceeds as follows:
    • Construct the RAG of the image as described in Perazzi et al. and augmented with the improved background node.
    • Construct the Laplacian matrix of the Image RAG.
    • Consider the k smallest eigenvectors corresponding to nonzero eigenvalues and use them as a k-dimensional embedding of the graph nodes.
    • Calculate the new saliency score by:
      • 1. Calculate the distance between the k-dimensional embedding of the background node and node i.
      • 2. Rescale all the distances to lie in the range between [0, 1], which will give us the relevant saliency scores S.
    • Compute a metric (such as the one described above) for maps created by considering projections with varying number of eigenvectors (we consider up to four eigenvectors for the embedding of our graph) and choose the map with highest score achieved by the metric (i.e., highest scoreimage if using the above metric).
Now consider the images and the corresponding sequences of saliency maps depicted in FIG. 6 and FIG. 7. FIG. 6 shows the original image 601 of a scene with one salient object and the corresponding saliency maps as the number of eigenvectors is varied for superpixel embedding: one eigenvector 602, two eigenvectors 603, and three eigenvectors 604. The saliency map 602 with one eigenvector was selected to be the best according to the score. FIG. 7 shows the original image 701 of a scene with multiple salient objects and the corresponding saliency maps as the number of eigenvectors is varied for superpixel embedding: one eigenvector 702, two eigenvectors 703, and three eigenvectors 704. In this case, the saliency map 704 with three eigenvectors was selected to be the best according to the score.
FIG. 8 shows the flowchart that corresponds to the second method of the present invention. At step 802 the total number of iterations, n, to consider in choosing the best Saliency map is decided. At step 804, the RAG of the image as described in Perazzi et al. is constructed and augmented with the improved background node. At step 806, a Laplacian matrix of the image RAG is constructed and its decomposition is computed, and k is set equal to 1. At step 808, the k smallest eigenvectors corresponding to k smallest nonzero eigenvalues are considered and are used as k-dimensional embedding of the graph nodes. The k-dimensional embedding is a numerical representation of each of the nodes of the image RAG. The embedding includes k numerical descriptors that are obtained from the k eigenvectors in consideration (i.e., the component of each eigenvector that corresponds to a particular node is used, e.g., if the node called i is represented by the mth component of an eigenvector, the k-dimensional embedding of node i includes the mth components of each eigenvector.) At step 810, the distance between the k-dimensional embedding of the background node and node i is calculated. At step 812, all of the distances are rescaled to lie in the range between [0,1], which gives the relevant saliency scores S. At step 814, the saliency map and the new saliency scores are computed. At step 816, the scoreimage for iteration i is computed. At step 818, ask if k<n? If yes, k=k+1, and the method continues at step 808. If no, the method terminates at step 820 and the saliency map with the best scoreimage is chosen.
FIG. 9 shows an example of the progression of a embodiment of the present invention, when the total number of eigenvectors to consider to be chosen is three. In other words, in a first step, the number of iterations, n, is selected. In the embodiment shown in FIG. 9, n is set equal to three. FIG. 9 shows an original image 901 a, which includes a single object. FIG. 9 also shows a second original image 901 b, which includes four objects. Images 902 a and 902 b correspond to an iteration with k=1, and a saliency map with one eigenvector. Images 903 a and 903 b correspond to an iteration with k=2, and a saliency map with two eigenvectors. Images 904 a and 904 b correspond to an iteration with k=3, and a saliency map with three eigenvectors. Note that for original image 901 a (which contains just one object), the best map is image 902 a with one iteration. For original image 901 b (which contains four objects), the best map is image 904 b with three iterations.
An alternative embodiment of the present invention is directed to a method comprising extracting multiple salient objects. According to this embodiment, the method first computes the desired number of eigenvectors to consider and subsequently constructs the saliency map. As part of this method, an adaptive way is used to calculate a threshold. The adaptive threshold was proposed in “Frequency-tuned Salient Region Detection,” by R. Achanta, S. Hemami, F. Estrada and S. Süsstrunk, IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1597-1604 (2009). The adaptive threshold is defined as twice the mean image saliency:
T a = 2 W * H x = 1 m y = 1 n S ( x , y ) ( 9 )
This embodiment involves a method comprising the following steps:
    • First, pre-compute the number, n, of eigenvectors to consider.
    • Compute the vector of Saliency scores, S, for the superpixels using the improved background prior.
    • If the n=1, then the method is completed. Otherwise repeat the following procedure for n≥2. Assume the saliency scores for the first k, k<n dimensions, which we will call Sk, have been computed. To incorporate the k+1th dimension in the computation of the final saliency scores S, proceed as follows:
      • Compute the saliency scores for the k+1th dimension, Sk+1, by computing the distance of each superpixel to the background node and rescaling the score between [0;1].
      • Compute the threshold Ta k+1 based on Sk+1 and extract the set of superpixels i for which it is true that Si k+1≥Ta k+1 and call the set N.
      • For i∈N, let Si k+1:=max{Si k+1,Si k}, otherwise Si k+1:=Si k.
      • If k+1<n, then repeat the procedure, else construct the image saliency map.
FIG. 10 shows a flowchart that corresponds to this method. At step 1002, the number of iterations, n, to consider in choosing the best Saliency map is decided. At step 1004, the RAG of the image as described in Perazzi et al. is constructed, and augmented with an improved background node. Step 1006 asks if n>1? If no, then the method is terminated at step 1016 by computing the saliency map from the saliency scores obtained by computing second smallest eigenvector from the Laplacian matrix of the Image Adjacency Graph constructed from the image using the augmented background prior. If yes, the method continues at step 1008. Starting with k=2, the Saliency scores of the current dimension K are computed. At step 1010, the image threshold Tk, for dimension k is computed. At step 1012, the new vector of the Saliency score Sk for each superpixel i is computed as set forth above. Step 1014 asks if k is equal to n. If yes, then the method terminates at step 1016. If no, then k is set to k+1, and the method continues at step 1008.
FIG. 11 shows an example of the progression of the method illustrated in FIG. 10 and described above. FIG. 11 includes an original image 1100, which includes multiple salient objects. A dimension saliency map 1101 a and a total saliency map 1101 b are shown for a first iteration (k=1). A dimension saliency map 1102 a and a total saliency map 1102 b are shown for a second iteration (k=2). A dimension saliency map 1103 a and a total saliency map 1103 b are shown for a third iteration (k=3). A dimension saliency map 1104 a and a total saliency map 1104 b are shown for a fourth iteration (k=4). A dimension saliency map 1105 a and a total saliency map 1105 b are shown for a fifth iteration (k=5). Finally, a dimension saliency map 1106 a and a total saliency map 1106 b are shown for a sixth iteration (k=6). The best dimension (which is six in this case) is chosen according to equation (8), as is shown in graph 1107.
FIG. 12 shows an example of the progression of the method illustrated in FIG. 10 and described above. FIG. 12 includes an original image 1200, which includes a single salient object. A dimension saliency map 1201 a and a total saliency map 1201 b are shown for a first iteration (k=1). The best dimension (which is one in this case) is chosen according to equation (8) as is shown in graph 1202.
FIG. 13 shows an example of the saliency maps as obtained by the method illustrated in FIG. 10 and described above. Specifically, FIG. 13 shows example plots (1305, 1306)—one of the Eigenvalue Function Difference as defined by equation (7) for a multi-subject image (plot 1305), and one of the Eigenvalue Function Difference for a single subject image (plot 1306). Plot 1305 corresponds to the original image 1301, which contains multiple salient objects. The final saliency map for original image 1301 is shown as 1302. Plot 1306 corresponds to the original image 1302, which contains just a single salient object. The final saliency map for original image 1303 is shown as 1304.
The Computational performance of the method of the present invention is described below. From experience, the K-Means clustering of the colors incurs little performance penalty to the overall algorithm due to the fact that the clustering is done on the average-superpixel colors and the number, of superpixels in the border is much smaller than the pixels in the border.
Segmentation results of the present invention are described below. By assigning the background node a set of most frequent colors, in the case where the image has a “complicated” background or multiple colors in the image, the resulting graph will have higher weights on the edges connecting the border to the background node, which often produces good foreground detection results.
When the saliency maps of the original images are analyzed, it can be seen that the method of the present invention tends to pick out a single subject with a distinctive color. Therefore, in order to detect multiple foreground subjects, an embodiment of the present invention iteratively detects the most salient objects in the foreground. As can be seen from the example output depicted in FIG. 14, improved results in detecting multiple salient subjects as compared to the prior art methods are obtained.
The last three sets of images in FIG. 14 show the results of the foreground segmentation in the case when a higher dimensional node embedding is used (as opposed to the using only the Fiedler vector as in Perazzi et al.). FIG. 14 shows three original images (1401 a, 1401 b, and 1401 c), their corresponding saliency maps as obtained pursuant to the prior art method described by Perazzi et al. (1402 a, 1402 b, and 1402 c), and corresponding saliency maps obtained pursuant to the present invention described herein (1403 a, 1403 b, and 1403 c). In summary, the method of the present invention provides:
    • 1. Modification of the image prior: instead of assigning to the image “background” node the average border background color (average color of the border superpixels), the method first performs a K-Means clustering of the colors. Then the method attaches to the background node a set of colors that represent the cluster centers. To compute the edge weight between the background node and the border regions, the maximum of the weights computed between region i and each of the k cluster center colors is taken:
w i , b = max j { 1 , , k } 1 c i - c j b 2 + ɛ .
    • 2. An iterative segmentation scheme, which extends the foreground segmentation to allow for the presence of multiple salient subjects in the image.
    • 3. Alternative multi-object foreground segmentation, which uses the eigenvector of the Image RAG as an embedding for the nodes. This embedding is then used to calculate an alternative saliency score.
    • 4. A new stopping criterion and metric for multi-object segmentation is used.
While the foregoing written description describes exemplary embodiments of the present invention, persons of ordinary skill in the art will appreciate that the inventors have contemplated the existence of variations and combinations of these embodiments. The invention is therefore not to be limited strictly by any exemplary embodiments described herein. Alterations, modifications, and deviations from the embodiments may be made and still achieve the advantages of the invention without departing from the spirit or scope of the invention. Such alterations, modifications, and deviations should be understood by persons of ordinary skill in the art to be covered by the appended claims.

Claims (11)

The invention claimed is:
1. A method for detecting and segmenting multiple foreground objects, comprising:
(a) constructing an image adjacency graph for an image by performing superpixel image segmentation of the image with an augmented background model;
(b) constructing a Laplacian matrix of the image adjacency graph;
(c) embedding the k smallest eigenvectors corresponding to nonzero eigenvalues as a k-dimensional embedding of graph nodes in the image adjacency graph;
(d) calculating a new saliency score by:
(i) calculating a distance between a k-dimensional embedding of a background node and a node i;
(ii) renormalizing all of the distances to lie in the range between [0, 1] to generate relevant saliency scores;
(e) computing an overall image saliency score for a saliency map generated by the relevant saliency scores;
(f) repeating steps (c) to (e) for a different k ranging from one to a predetermined number; and
(g) choosing the saliency map with highest overall image saliency score.
2. The method of claim 1 wherein the image adjacency graph comprises a reduced image representation of the image, in which a group of pixels are represented by an average color of pixels in a corresponding superpixel and represented by a node in a graph, and local relationships in the image are represented by connecting two regions in the graph if the corresponding superpixels share a border in the original image.
3. The method of claim 1, wherein the augmented background model comprises a multi-color background model obtained by clustering the superpixel colors represented in the Lab color space.
4. The method of claim 1 wherein the k-dimensional embedding comprises a numerical representation for each node of the image region adjacency graph, where each embedding consists of k numerical descriptors corresponding to a particular node that are obtained from the k eigenvectors in consideration.
5. The method of claim 1 wherein the overall image saliency score is computed by combining a silhouette score and a mean image saliency.
6. The method of claim 1 wherein the overall image saliency map is computed by creating a new greyscale image of a size identical to the image and assigning a saliency score to each pixel that corresponds to the saliency score of the superpixel to which the pixel belongs.
7. A method for detecting and segmenting multiple foreground objects in an image, comprising;
(a) computing a number of iterations, n, which represents the largest percentage difference between dimensions of two subsequent eigenvalues;
(b) constructing an image adjacency graph by performing superpixel image segmentation of the image with an augmented background model;
(c) computing a set of saliency scores from the image adjacency graph for a current dimension k when n does not equal 1 and beginning with dimension k equal to 2;
(d) extracting a set of superpixels having a saliency score larger than a predetermined threshold;
(e) computing a new saliency score for each superpixel in the set of extracted superpixels;
(f) repeat steps (c) to (e) until dimension k is equal to n;
(g) computing an image saliency map based on a set of newest saliency scores.
8. The method of claim 7 wherein the image adjacency graph comprises a reduced image representation of the image, in which a group of pixels are represented by an average color of the pixels in a corresponding superpixel and represented by a node in a graph, and local relationships in the image are represented by connecting two regions in the graph if the corresponding superpixels share a border in the original image.
9. The method of claim 7, wherein the augmented background model comprises a multi-color background model obtained by clustering superpixel colors represented in the Lab color space.
10. The method of claim 7 wherein all saliency scores are computed by using the Fiedler vector and rescaling the resulting scores between [0; 1].
11. The method of claim 7, wherein the threshold is computed as a function of mean image saliency score.
US16/880,505 2016-12-20 2020-05-21 Iterative method for salient foreground detection and multi-object segmentation Active US11120556B2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US16/880,505 US11120556B2 (en) 2016-12-20 2020-05-21 Iterative method for salient foreground detection and multi-object segmentation

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201662436803P 2016-12-20 2016-12-20
US15/847,050 US10706549B2 (en) 2016-12-20 2017-12-19 Iterative method for salient foreground detection and multi-object segmentation
US16/880,505 US11120556B2 (en) 2016-12-20 2020-05-21 Iterative method for salient foreground detection and multi-object segmentation

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US15/847,050 Division US10706549B2 (en) 2016-12-20 2017-12-19 Iterative method for salient foreground detection and multi-object segmentation

Publications (2)

Publication Number Publication Date
US20200286239A1 US20200286239A1 (en) 2020-09-10
US11120556B2 true US11120556B2 (en) 2021-09-14

Family

ID=60991573

Family Applications (2)

Application Number Title Priority Date Filing Date
US15/847,050 Active 2038-08-27 US10706549B2 (en) 2016-12-20 2017-12-19 Iterative method for salient foreground detection and multi-object segmentation
US16/880,505 Active US11120556B2 (en) 2016-12-20 2020-05-21 Iterative method for salient foreground detection and multi-object segmentation

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US15/847,050 Active 2038-08-27 US10706549B2 (en) 2016-12-20 2017-12-19 Iterative method for salient foreground detection and multi-object segmentation

Country Status (4)

Country Link
US (2) US10706549B2 (en)
EP (1) EP3559906B1 (en)
CN (1) CN110088805B (en)
WO (1) WO2018118914A2 (en)

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018142496A1 (en) * 2017-02-01 2018-08-09 株式会社日立製作所 Three-dimensional measuring device
CN109359654B (en) * 2018-09-18 2021-02-12 北京工商大学 Image segmentation method and system based on frequency tuning global saliency and deep learning
CN110111338B (en) * 2019-04-24 2023-03-31 广东技术师范大学 Visual tracking method based on superpixel space-time saliency segmentation
JP7475959B2 (en) * 2020-05-20 2024-04-30 キヤノン株式会社 Image processing device, image processing method, and program
CN111815582B (en) * 2020-06-28 2024-01-26 江苏科技大学 Two-dimensional code region detection method for improving background priori and foreground priori
CN112200826B (en) * 2020-10-15 2023-11-28 北京科技大学 Industrial weak defect segmentation method
CN112163589B (en) * 2020-11-10 2022-05-27 中国科学院长春光学精密机械与物理研究所 Image processing method, device, equipment and storage medium
CN112418218B (en) * 2020-11-24 2023-02-28 中国地质大学(武汉) Target area detection method, device, equipment and storage medium
CN112991361B (en) * 2021-03-11 2023-06-13 温州大学激光与光电智能制造研究院 Image segmentation method based on local graph structure similarity
CN113160251B (en) * 2021-05-24 2023-06-09 北京邮电大学 Automatic image segmentation method based on saliency priori
CN113705579B (en) * 2021-08-27 2024-03-15 河海大学 Automatic image labeling method driven by visual saliency
CN114998320B (en) * 2022-07-18 2022-12-16 银江技术股份有限公司 Method, system, electronic device and storage medium for visual saliency detection
CN115631208B (en) * 2022-10-13 2023-06-16 中国矿业大学 Unmanned aerial vehicle image mining area ground crack extraction method based on improved active contour model
CN116703939B (en) * 2023-05-16 2024-08-20 绿萌科技股份有限公司 Color fruit image segmentation method based on color difference condition of super pixel region

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090254236A1 (en) * 2005-10-11 2009-10-08 Peters Ii Richard A System and method for image mapping and visual attention
US20100226564A1 (en) * 2009-03-09 2010-09-09 Xerox Corporation Framework for image thumbnailing based on visual similarity
US20110295515A1 (en) * 2010-05-18 2011-12-01 Siemens Corporation Methods and systems for fast automatic brain matching via spectral correspondence
US20120275701A1 (en) * 2011-04-26 2012-11-01 Minwoo Park Identifying high saliency regions in digital images
US20160239981A1 (en) * 2013-08-28 2016-08-18 Aselsan Elektronik Sanayi Ve Ticaret Anonim Sirketi A semi automatic target initialization method based on visual saliency
US20160379055A1 (en) * 2015-06-25 2016-12-29 Kodak Alaris Inc. Graph-based framework for video object segmentation and extraction in feature space
US20170337711A1 (en) * 2011-03-29 2017-11-23 Lyrical Labs Video Compression Technology, LLC Video processing and encoding
US20180295375A1 (en) * 2017-04-05 2018-10-11 Lyrical Labs Video Compression Technology, LLC Video processing and encoding
US10198629B2 (en) * 2015-06-22 2019-02-05 Photomyne Ltd. System and method for detecting objects in an image
US20190139282A1 (en) * 2017-11-09 2019-05-09 Adobe Inc. Saliency-Based Collage Generation using Digital Images

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8929636B2 (en) * 2012-02-02 2015-01-06 Peter Yim Method and system for image segmentation
US9147255B1 (en) * 2013-03-14 2015-09-29 Hrl Laboratories, Llc Rapid object detection by combining structural information from image segmentation with bio-inspired attentional mechanisms
EP3028256A4 (en) * 2013-07-31 2016-10-19 Microsoft Technology Licensing Llc Geodesic saliency using background priors
US9330334B2 (en) * 2013-10-24 2016-05-03 Adobe Systems Incorporated Iterative saliency map estimation
CN104809729B (en) * 2015-04-29 2018-08-28 山东大学 A kind of saliency region automatic division method of robust
CN105760886B (en) * 2016-02-23 2019-04-12 北京联合大学 A kind of more object segmentation methods of image scene based on target identification and conspicuousness detection

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7835820B2 (en) * 2005-10-11 2010-11-16 Vanderbilt University System and method for image mapping and visual attention
US20110082871A1 (en) * 2005-10-11 2011-04-07 Vanderbilt University System and method for image mapping and visual attention
US8060272B2 (en) * 2005-10-11 2011-11-15 Vanderbilt University System and method for image mapping and visual attention
US20090254236A1 (en) * 2005-10-11 2009-10-08 Peters Ii Richard A System and method for image mapping and visual attention
US20100226564A1 (en) * 2009-03-09 2010-09-09 Xerox Corporation Framework for image thumbnailing based on visual similarity
US20110295515A1 (en) * 2010-05-18 2011-12-01 Siemens Corporation Methods and systems for fast automatic brain matching via spectral correspondence
US20170337711A1 (en) * 2011-03-29 2017-11-23 Lyrical Labs Video Compression Technology, LLC Video processing and encoding
US20120275701A1 (en) * 2011-04-26 2012-11-01 Minwoo Park Identifying high saliency regions in digital images
US8401292B2 (en) * 2011-04-26 2013-03-19 Eastman Kodak Company Identifying high saliency regions in digital images
US20160239981A1 (en) * 2013-08-28 2016-08-18 Aselsan Elektronik Sanayi Ve Ticaret Anonim Sirketi A semi automatic target initialization method based on visual saliency
US9595114B2 (en) * 2013-08-28 2017-03-14 Aselsan Elektronik Sanayi Ve Ticaret Anonim Sirketi Semi automatic target initialization method based on visual saliency
US10198629B2 (en) * 2015-06-22 2019-02-05 Photomyne Ltd. System and method for detecting objects in an image
US20160379055A1 (en) * 2015-06-25 2016-12-29 Kodak Alaris Inc. Graph-based framework for video object segmentation and extraction in feature space
US10192117B2 (en) * 2015-06-25 2019-01-29 Kodak Alaris Inc. Graph-based framework for video object segmentation and extraction in feature space
US20180295375A1 (en) * 2017-04-05 2018-10-11 Lyrical Labs Video Compression Technology, LLC Video processing and encoding
US20190139282A1 (en) * 2017-11-09 2019-05-09 Adobe Inc. Saliency-Based Collage Generation using Digital Images

Also Published As

Publication number Publication date
WO2018118914A2 (en) 2018-06-28
EP3559906B1 (en) 2024-02-21
US20200286239A1 (en) 2020-09-10
US20180174301A1 (en) 2018-06-21
CN110088805A (en) 2019-08-02
EP3559906A2 (en) 2019-10-30
CN110088805B (en) 2023-06-06
US10706549B2 (en) 2020-07-07
WO2018118914A3 (en) 2018-07-26

Similar Documents

Publication Publication Date Title
US11120556B2 (en) Iterative method for salient foreground detection and multi-object segmentation
JP4979033B2 (en) Saliency estimation of object-based visual attention model
US7711146B2 (en) Method and system for performing image re-identification
US12002259B2 (en) Image processing apparatus, training apparatus, image processing method, training method, and storage medium
JP4699298B2 (en) Human body region extraction method, apparatus, and program
US9501837B2 (en) Method and system for unsupervised image segmentation using a trained quality metric
US20060039587A1 (en) Person tracking method and apparatus using robot
JP4098021B2 (en) Scene identification method, apparatus, and program
US9349194B2 (en) Method for superpixel life cycle management
EP3073443B1 (en) 3d saliency map
JP5939056B2 (en) Method and apparatus for positioning a text region in an image
CN112837344A (en) Target tracking method for generating twin network based on conditional confrontation
Chi Self‐organizing map‐based color image segmentation with k‐means clustering and saliency map
Palou et al. Occlusion-based depth ordering on monocular images with binary partition tree
Porikli et al. Automatic video object segmentation using volume growing and hierarchical clustering
CN109785367B (en) Method and device for filtering foreign points in three-dimensional model tracking
Jia et al. Dense interpolation of 3d points based on surface and color
Sima et al. An extension of the Felzenszwalb-Huttenlocher segmentation to 3D point clouds
Liang et al. KmsGC: An Unsupervised Color Image Segmentation Algorithm Based on K‐Means Clustering and Graph Cut
Thinh et al. Depth-aware salient object segmentation
Haindl et al. Unsupervised hierarchical weighted multi-segmenter
Khelifi et al. A new multi-criteria fusion model for color textured image segmentation
Kucer et al. Augmenting salient foreground detection using fiedler vector for multi-object segmentation
CN113538256A (en) Visual saliency model establishment method based on multiple regional characteristics
Hirzer et al. An automatic hybrid segmentation approach for aligned face portrait images

Legal Events

Date Code Title Description
AS Assignment

Owner name: KODAK ALARIS INC., NEW YORK

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LOUI, ALEXANDER;KLOOSTERMAN, DAVID;KUCER, MICHAL;AND OTHERS;SIGNING DATES FROM 20170413 TO 20170418;REEL/FRAME:052728/0582

FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STPP Information on status: patent application and granting procedure in general

Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED

AS Assignment

Owner name: KPP (NO. 2) TRUSTEES LIMITED, NORTHERN IRELAND

Free format text: SECURITY INTEREST;ASSIGNOR:KODAK ALARIS INC.;REEL/FRAME:053993/0454

Effective date: 20200930

FEPP Fee payment procedure

Free format text: PETITION RELATED TO MAINTENANCE FEES GRANTED (ORIGINAL EVENT CODE: PTGR); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

STPP Information on status: patent application and granting procedure in general

Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS

STPP Information on status: patent application and granting procedure in general

Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: THE BOARD OF THE PENSION PROTECTION FUND, UNITED KINGDOM

Free format text: ASSIGNMENT OF SECURITY INTEREST;ASSIGNOR:KPP (NO. 2) TRUSTEES LIMITED;REEL/FRAME:058175/0651

Effective date: 20211031

AS Assignment

Owner name: THE BOARD OF THE PENSION PROTECTION FUND, UNITED KINGDOM

Free format text: IP SECURITY AGREEMENT SUPPLEMENT (FISCAL YEAR 2022);ASSIGNOR:KODAK ALARIS INC.;REEL/FRAME:061504/0900

Effective date: 20220906

AS Assignment

Owner name: FGI WORLDWIDE LLC, NEW YORK

Free format text: SECURITY AGREEMENT;ASSIGNOR:KODAK ALARIS INC.;REEL/FRAME:068325/0938

Effective date: 20240801

AS Assignment

Owner name: KODAK ALARIS INC., NEW YORK

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:THE BOARD OF THE PENSION PROTECTION FUND;REEL/FRAME:068481/0300

Effective date: 20240801