GB2589478A

GB2589478A - Segmenting irregular shapes in images using deep region growing

Info

Publication number: GB2589478A
Application number: GB2019774.5A
Authority: GB
Inventors: Dufort Paul
Original assignee: International Business Machines Corp
Current assignee: International Business Machines Corp
Priority date: 2018-06-21
Filing date: 2019-05-13
Publication date: 2021-06-02
Anticipated expiration: 2039-05-13
Also published as: WO2019243910A1; GB2589478B; DE112019001959T5; GB202019774D0; CN112189217A; JP2021527859A

Abstract

A system for determining a region of interest in an image. The system includes a memory and an electronic processor. The electronic processor included in the system is connected to the memory and is configured to initialize internal states of nodes of a spatial lattice. Each node of the spatial lattice corresponds to a pixel of the image and is connected to at least one node representing a neighboring pixel of the image. The electronic processor is also configured to iteratively update, using a neural network, the internal states of each nodes in the spatial lattice using spatially gated propagation and identify the region of interest within the image based on the internal states of the nodes at a convergence of the spatial lattice. The electronic processor is configured to creating an image pyramid for the image.

Claims

1. A method for identifying an object of interest in a medical image, the method comprising: initializing internal states of nodes of a spatial lattice, wherein each node corresponds to a pixel of the medical image and is connected to at least one node representing a neighboring pixel of the medical image; iteratively updating, using a neural network, the internal states of the nodes in the spatial lattice using spatially gated propagation, wherein at each iteration each node updates its internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, and a new value of the node; and identifying the object of interest within the medical image based on the values of the nodes at a convergence of the spatial lattice.

2. The method according to claim 1, wherein iteratively updating, using a neural network, the internal states of the nodes includes updating a value in a vector of values associated with the internal states of the nodes.

3. The method according to claim 2, wherein the values in the vector of values include a value representing the brightness of the pixel corresponding to the node and a value representing the internal state of the node.

4. The method according to claim 1 , wherein a convolution involving previous internal states of the nodes is performed for each iteration.

5. The method according to claim 1, wherein the method further includes performing, in a first iteration, convolutions on each value representing a brightness of each pixel.

6. The method according to claim 1, wherein identifying an object of interest within the medical image based on the values of the nodes at a convergence of the spatial lattice includes using a final layer of the neural network to calculate a probability that each pixel is included in the object of interest based on a value included in a vector of values associated with each pixel; and determining, for each pixel, if the calculated probability is above a predetermined threshold.

7. The method according to claim 1, wherein each node updates its internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, and a new value of the node using a squashing function.

8. The method according to claim 1, wherein the neighboring node is one selected from a group consisting of a node that represents a pixel that is directly above, directly below, to the right of, and to the left of a pixel represented by the node.

9. The method according to claim 1, the method further comprising generating an image pyramid with a plurality of layers, wherein each successive layer represents the medical image with fewer values.

10 The method according to claim 9, the method further comprising concatenating values from a plurality of layers of the image pyramid in each iteration.

11. A system for determining a region of interest in an image, the system comprising a memory; and an electronic processor, connected to the memory and configured to initialize internal states of nodes of a spatial lattice, wherein each node corresponds to a pixel of the image and is connected to at least one node representing a neighboring pixel of the image, iteratively update, using a neural network, the internal states of each nodes in the spatial lattice using spatially gated propagation; and identify the region of interest within the image based on the internal states of the nodes at a convergence of the spatial lattice.

12. The system according to claim 11, wherein the electronic processor is configured to update the internal states of the nodes by, at each iteration, updating the internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, or a new value of the node.

13. The system according to claim 11, wherein the electronic processor is configured to iteratively update, using a neural network, the internal states of the nodes by updating a value in a vector of values associated with the internal states of the nodes.

14. The system according to claim 13, wherein the values in the vector of values include a value representing the brightness of a pixel corresponding to the node and a value representing the internal state of the node.

15. The system according to claim 11, wherein the electronic processor is further configured to perform, in each iteration, a convolution involving previous internal states of the nodes.

16. The system according to claim 11, wherein the electronic processor is further configured to perform, in the first iteration, convolutions on each value representing a brightness of each pixel.

17. The system according to claim 11, wherein the electronic processor is configured to identify an object of interest within the image based on the values of the nodes at a convergence of the spatial lattice by using a final layer of the neural network to calculate a probability that each pixel is included in the object of interest based on the vector associated with each pixel, and determining, for each pixel, if the calculated probability is above a predetermined threshold.

18. The system according to claim 12, wherein the electronic processor is configured to update the internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, or a new value of the node by using a squashing function.

19. The system according to claim 12, wherein the neighboring node is one selected from a group consisting of a node that represents a pixel that is directly above, directly below, to the right of, and to the left of the pixel represented by the node.

20. Non-transitory computer-readable medium storing instructions that, when executed with an electronic processor, perform a set of functions, the set of functions comprising: initializing internal states of nodes of a spatial lattice, wherein each node represents a pixel of an image and is connected to at least one neighboring pixel of the image; iteratively updating, using a neural network, the internal states of the nodes in the spatial lattice using spatially gated propagation, wherein at each iteration each node updates its internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, or a new value of the node; and identifying an object of interest within the image based on the values of the nodes at a convergence of the spatial lattice.

21. The non-transitory computer-readable medium according to claim 20, wherein iteratively updating, using a neural network, the internal states of the nodes includes updating a value in a vector of values associated with the internal states of the nodes.

22. The non-transitory computer-readable medium according to claim 20, wherein identifying an object of interest within the image based on the values of the nodes at a convergence of the spatial lattice includes using a final layer in the neural network to calculate a probability that each pixel is included in the object of interest based on the vector associated with each pixel; and determining, for each pixel, if the calculated probability is above a predetermined threshold.

23. A method for identifying an object of interest in a medical image, the method comprising: creating an image pyramid for the medical image, wherein the image pyramid includes a plurality of layers, each layer includes a plurality of values, each value represents a block of one or more pixels in the medical image, and each successive layer includes fewer values than a most previous layer; for each layer of the image pyramid; initializing internal states of nodes of a spatial lattice, wherein each node in the spatial lattice represents a block of one or more pixels in the medical image and is connected to at least one node representing a neighboring block of one or more pixels in the medical image; and iteratively updating, using a neural network, the internal states of the nodes in the spatial lattice using spatially gated propagation, wherein at each iteration each node updates its internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, and a new value of the node; and identifying the object of interest within the medical image based on the values of the nodes at a convergence of the spatial lattice having nodes representing the values included in a first layer of the image pyramid.

24. The method according to claim 23, wherein iteratively updating, using a neural network, the internal states of the nodes includes updating a value in a vector of values associated with the internal states of the nodes.

25. The method according to claim 23, the method further comprising performing, at each iteration for each layer of the image pyramid, a first convolution involving a first concatenation of previous internal states of the nodes representing the values included in a layer of the image pyramid and the values included in the layer of the image pyramid, and storing results of performing the first convolution.

26. The method according to claim 25, the method further comprising performing, at each iteration for each layer of the image pyramid, a second convolution involving a second concatenation of results of performing the first convolution for a current layer of the image pyramid, a layer of the image pyramid directly above the current layer of the image pyramid, and a layer of the image pyramid directly below the current layer of the image pyramid.

27. The method according to claim 23, wherein creating the image pyramid includes performing convolutions on each value representing a brightness of each block of one or more pixels in the medical image, wherein each convolution involving a reduction of dimensions of input medical image data produces values that are used to represent the medical image in a next layer of the image pyramid.

28. The method according to claim 23, wherein each value representing the medical image in the first layer of the image pyramid corresponds to a pixel in the medical image.

29. The method according to claim 28, wherein identifying the object of interest within the medical image based on the values of the nodes at a convergence of the spatial lattice having nodes representing the values included in a first layer of the image pyramid includes using a final layer of the neural network to calculate a probability that each pixel in the medical image is included in the object of interest based on a value included in each vector of values associated with a node representing the values included in a first layer of the image pyramid; and determining, for each pixel, if the calculated probability is above a predetermined threshold.

30. The method according to claim 26, wherein each node updates its internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, and a new value of the node includes using a squashing function and results of performing the second convolution.

31. The method according to claim 23, wherein the neighboring node is one selected from a group consisting of a node that represents a block of one or more pixels that is directly above, directly below, to the right of, and to the left of a block of one or more pixels represented by the node.

32. The method according to claim 23, wherein representing the medical image with fewer values creates a medical image with a lower resolution.

33. A system for determining a region of interest in an image, the system comprising a memory; and an electronic processor, connected to the memory and configured to: create an image pyramid for the image, the image pyramid including a plurality of layers, for each layer of the image pyramid, initialize internal states of nodes of a spatial lattice, wherein each node represents a block of one or more pixels in the image and is connected to at least one node representing a neighboring block of one or more pixels in the image, and iteratively update, using a neural network, the internal states of the nodes in the spatial lattice using spatially gated propagation; and identify the region of interest within the image based on the internal states of the nodes at a convergence of the spatial lattice having nodes representing values included in a first layer of the image pyramid.

34. The system according to claim 33, wherein each successive layer of the plurality of layers included in the image pyramid represents the image at a lower resolution than an image represented in a most previous layer of the image pyramid.

35. The system according to claim 34, wherein the electronic processor is configured to represent the image at a lower resolution by representing the image with fewer values.

36. The system according to claim 33, wherein the electronic processor is configured to update the internal states of the nodes by, at each iteration, deciding for each node whether to maintain a value of the node from a previous iteration, to set a value of the node to a value of a neighboring node from a previous iteration, or set a new value of the node.

37. The system according to claim 33, wherein the electronic processor is configured to iteratively update, using a neural network, the internal states of the nodes by updating a value in a vector of values associated with the internal states of the nodes.

38. The system according to claim 35, wherein the electronic processor is configured to perform, at each iteration for each layer of the image pyramid, a first convolution involving a first concatenation of previous internal states of the nodes representing the values included in the layer of the image pyramid and the values included in the layer of the image pyramid, and store results of performing the first convolution.

39. The system according to claim 38, wherein the electronic processor is configured to perform, at each iteration for each layer of the image pyramid, a second convolution involving a second concatenation of results of performing the first convolution for a current layer of the image pyramid, a layer of the image pyramid directly above the current layer of the image pyramid, and a layer of the image pyramid directly below the current layer of the image pyramid.

40. The system according to claim 34, wherein the electronic processor is further configured to perform, in the first iteration, convolutions on each value representing a brightness of each block of one or more pixels in the image, wherein each convolution involving a reduction of dimensions of input image data produces values that are used to represent the image in a next layer of the image pyramid.

41. The system according to claim 33, wherein the electronic processor is configured to identify an object of interest within the image based on the values of the nodes at a convergence of the spatial lattice having nodes representing values included in a first layer of the image pyramid by using a final layer of the neural network to calculate a probability that each pixel in the image is included in the object of interest based on each vector associated with a node representing the values included in a first layer of the image pyramid, and determining, for each pixel, if the calculated probability is above a predetermined threshold.

42. The system according to claim 39, wherein the electronic processor is configured to update the internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, or a new value of the node by using a squashing function and results of performing the second convolution.

43. The system according to claim 36, wherein the neighboring node is one selected from the group consisting of a node that represents a block of one or more pixels in the image that is directly above, directly below, to the right of, and to the left of the block of one or more pixels in the image represented by the node.

44. Non-transitory computer-readable medium storing instructions that, when executed with an electronic processor, perform a set of functions, the set of functions comprising: creating an image pyramid for an image, wherein the image pyramid includes a plurality of layers, each layer includes a plurality of values, each value represents a block of one or more pixels in the image, and each successive layer includes fewer values than a most previous layer; for each layer of the image pyramid; initializing internal states of nodes of a spatial lattice, wherein each node represents a block of one or more pixels in the image and is connected to at least one node representing a neighboring block of one or more pixels in of the image; and iteratively updating, using a neural network, the internal states of the nodes in the spatial lattice using spatially gated propagation, wherein at each iteration each node updates its internal state based on at least one selected from the group consisting of a value of the node from a previous iteration, a value of a neighboring node from the previous iteration, or a new value of the node; and identifying an object of interest within the image based on the values of the nodes at a convergence of the spatial lattice having nodes representing the values included in a first layer of the image pyramid.

45. The non-transitory computer-readable medium according to claim 44, wherein iteratively updating, using a neural network, the internal states of the nodes includes updating a value in a vector of values associated with the internal states of the nodes.

46. The non-transitory computer-readable medium according to claim 44, wherein identifying an object of interest within the image based on the values of the nodes at a convergence of the spatial lattice having nodes representing the values included in a first layer of the image pyramid includes using a final layer in the neural network to calculate a probability that each pixel in the image is included in the object of interest based on the vector associated with a node representing the values included in a first layer of the image pyramid; and determining, for each pixel, if the calculated probability is above a predetermined threshold.