-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add the Chan-Vese Segmentation for Gray #84
base: master
Are you sure you want to change the base?
Changes from 3 commits
96ceac3
982cfc4
9996e09
7e3828d
9e07ebc
6395478
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -5,6 +5,7 @@ import Base: show | |||||
using LinearAlgebra, Statistics | ||||||
using DataStructures, StaticArrays, ImageCore, ImageFiltering, ImageMorphology, LightGraphs, SimpleWeightedGraphs, RegionTrees, Distances, StaticArrays, Clustering, MetaGraphs | ||||||
using ImageCore.ColorVectorSpace: MathTypes | ||||||
using ImageBase.ImageCore: GenericGrayImage, GenericImage | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We should either remove
Suggested change
|
||||||
import Clustering: kmeans, fuzzy_cmeans | ||||||
|
||||||
const PairOrTuple{K,V} = Union{Pair{K,V},Tuple{K,V}} | ||||||
|
@@ -21,6 +22,7 @@ include("meanshift.jl") | |||||
include("clustering.jl") | ||||||
include("merge_segments.jl") | ||||||
include("deprecations.jl") | ||||||
include("chan_vese.jl") | ||||||
|
||||||
export | ||||||
#accessor methods | ||||||
|
@@ -49,6 +51,7 @@ export | |||||
kmeans, | ||||||
fuzzy_cmeans, | ||||||
merge_segments, | ||||||
chan_vese, | ||||||
|
||||||
# types | ||||||
SegmentedImage, | ||||||
|
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
@@ -0,0 +1,258 @@ | ||||||
""" | ||||||
chan_vese(img; μ, λ₁, λ₂, tol, max_iter, Δt, reinitial_flag) | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
Segments image `img` by evolving a level set. An active contour model | ||||||
which can be used to segment objects without clearly defined boundaries. | ||||||
Comment on lines
+4
to
+5
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Might be worth documenting that this algorithm supports 3d images. If you don't plan to add |
||||||
|
||||||
# output | ||||||
Return a `BitMatrix`. | ||||||
|
||||||
# Details | ||||||
|
||||||
Chan-Vese algorithm deals quite well even with images which are quite | ||||||
difficult to segment. Since CV algorithm relies on global properties, | ||||||
rather than just taking local properties under consideration, such as | ||||||
gradient. Better robustness for noise is one of the main advantages of | ||||||
this algorithm. See [1], [2], [3] for more details. | ||||||
|
||||||
# Options | ||||||
|
||||||
The function argument is described in detail below. | ||||||
|
||||||
Denote the edge set curve with 𝐶 in the following part. | ||||||
|
||||||
## `μ::Float64` | ||||||
|
||||||
The argument `μ` is a weight controlling the penalty on the total length | ||||||
of the curve 𝐶; | ||||||
|
||||||
For example, if the boundaries of the image are quite smooth, a larger `μ` | ||||||
can prevent 𝐶 from being a complex curve. | ||||||
|
||||||
Default: 0.25 | ||||||
|
||||||
## `λ₁::Float64`, `λ₂::Float64` | ||||||
|
||||||
The argument `λ₁` and `λ₂` affect the desired uniformity inside 𝐶 and | ||||||
outside 𝐶, respectively. | ||||||
|
||||||
For example, if set `λ₁` < `λ₂`, we are more possible to get result with | ||||||
quite uniform background and varying grayscale objects in the foreground. | ||||||
|
||||||
Default: λ₁ = 1.0 | ||||||
λ₂ = 1.0 | ||||||
|
||||||
## `tol::Float64` | ||||||
|
||||||
The argument `tol` controls the level set variation tolerance between | ||||||
iteration. If the L2 norm difference between two level sets of adjacent | ||||||
iterations is below `tol`, then the solution will be assumed to be reached. | ||||||
|
||||||
Default: 1e-3 | ||||||
|
||||||
## `max_iter::Int64` | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
The argument `max_iter` controls the maximum of iteration number. | ||||||
|
||||||
Default: 500 | ||||||
|
||||||
## `Δt::Float64` | ||||||
|
||||||
The argument `Δt` is a multiplication factor applied at calculations | ||||||
for each step, serves to accelerate the algorithm. Although larger `Δt` | ||||||
can speed up the algorithm, it might prevent algorithm from converging to | ||||||
the solution. | ||||||
|
||||||
Default: 0.5 | ||||||
|
||||||
## reinitial_flag::Bool | ||||||
|
||||||
The arguement `reinitial_flag` controls whether to reinitialize the | ||||||
level set in each step. | ||||||
|
||||||
Default: false | ||||||
|
||||||
# Examples | ||||||
|
||||||
```julia | ||||||
using TestImages | ||||||
using ImageSegmentation | ||||||
|
||||||
img = testimage("cameraman") | ||||||
|
||||||
cv_result = chan_vese(img, μ=0.25, λ₁=1.0, λ₂=1.0, tol=1e-3, max_iter=200, Δt=0.5, reinitial_flag=false) | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
``` | ||||||
|
||||||
# References | ||||||
|
||||||
[1] An Active Contour Model without Edges, Tony Chan and Luminita Vese, | ||||||
Scale-Space Theories in Computer Vision, 1999, :DOI:`10.1007/3-540-48236-9_13` | ||||||
[2] Chan-Vese Segmentation, Pascal Getreuer Image Processing On Line, 2 (2012), | ||||||
pp. 214-224, :DOI:`10.5201/ipol.2012.g-cv` | ||||||
[3] The Chan-Vese Algorithm - Project Report, Rami Cohen, 2011 :arXiv:`1107.2782` | ||||||
""" | ||||||
function chan_vese(img::GenericGrayImage; | ||||||
μ::Float64=0.25, | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
λ₁::Float64=1.0, | ||||||
λ₂::Float64=1.0, | ||||||
tol::Float64=1e-3, | ||||||
max_iter::Int64=500, | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
Δt::Float64=0.5, | ||||||
reinitial_flag::Bool=false) | ||||||
# Signs used in the codes and comments mainly follow paper[3] in the References. | ||||||
img = float64.(channelview(img)) | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Now I see where you are getting the 3d handling from: it's not actually 3d, it's your color channel. This is almost never needed in JuliaImages; just use the image as provided, and if you've written everything generically it should "just work." That is, for a grayscale image Keep in mind that Python and Julia have reversed "fast axes" for their arrays (Python has its fastest axis last, Julia first), but that in fact the memory layout is the same. Consequently, in Julia if you do call There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. If you want a raw numerical array, then just:
Suggested change
But generally, we encourage directly handling There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I would recommend deleting this line and the |
||||||
iter = 0 | ||||||
h = 1.0 | ||||||
del = tol + 1 | ||||||
img .= img .- minimum(img) | ||||||
|
||||||
if maximum(img) != 0 | ||||||
img .= img ./ maximum(img) | ||||||
end | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Why is this standardization needed? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. About standardization, I think a standardized image may contribute to a segmentation algorithm since it makes the gray levels of image more dispersive. If the grey levels of image range from 0.2 to 0.8, it seems better to standardize image so that the grey levels will range from 0.0 to 1.0. julia> using TestImages, Images
julia> img_512 = testimage("cameraman");
julia> maximum(img_512)
Gray{N0f8}(1.0)
julia> minimum(img_512)
Gray{N0f8}(0.0)
julia> img_64 = imresize(testimage("cameraman"), (64, 64));
julia> maximum(img)
Gray{N0f8}(0.988)
julia> minimum(img)
Gray{N0f8}(0.02) Test image shows as following: julia> res_no_std = chan_vese(img, μ=0.1, tol=1e-2, max_iter=200);
julia> sum(res_no_std)
1075
julia> colorview(Gray, res_no_std)
julia> res_std = chan_vese(img, μ=0.1, tol=1e-2, max_iter=200);
julia> sum(res_std)
1064
julia> colorview(Gray, res_std)
We can find that there are some difference. Without standardization, the part between the man's leg can't be segmented well. So maybe we have to maintain the standardization? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Maybe we can introduce a keyword By hardcoding the standardization in the implementation we lose some possibilities. In many similar cases, we prefer to let the caller rather than the callee do the work. See also https://docs.julialang.org/en/v1/manual/style-guide/#Handle-excess-argument-diversity-in-the-caller There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Presumably that is easily compensated by changing the values of What does standardization even mean for an image with RGB pixels? Do you take the min & max across channels too? Why does the magnitude of the blue channel scale the magnitude of the red one? Aren't these independent pieces of information? Conversely, if my raw image has almost no diversity in the green channel, I'll be really surprised if standardization on the green channel converts something that is perceptually insignificant into something that drives the segmentation. Basically, there isn't an answer to such questions. So it's better to the let the user be in charge. |
||||||
|
||||||
# Precalculation of some constants which helps simplify some integration | ||||||
area = length(img) # area = ∫H𝚽 + ∫H𝚽ⁱ | ||||||
∫u₀ = sum(img) # ∫u₀ = ∫u₀H𝚽 + ∫u₀H𝚽ⁱ | ||||||
|
||||||
# Initialize the level set | ||||||
𝚽ⁿ = initial_level_set(size(img)) | ||||||
|
||||||
# Preallocation and initializtion | ||||||
H𝚽 = trues(size(img)...) | ||||||
𝚽ⁿ⁺¹ = similar(𝚽ⁿ) | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
# The upper bounds of 𝚽ⁿ's coordinates is `m` and `n` | ||||||
s, t = first(CartesianIndices(𝚽ⁿ))[1], first(CartesianIndices(𝚽ⁿ))[2] | ||||||
m, n = last(CartesianIndices(𝚽ⁿ))[1], last(CartesianIndices(𝚽ⁿ))[2] | ||||||
|
||||||
while (del > tol) & (iter < max_iter) | ||||||
ϵ = 1e-8 | ||||||
diff = 0 | ||||||
|
||||||
# Calculate the average intensities | ||||||
@. H𝚽 = 𝚽ⁿ > 0 # Heaviside function | ||||||
c₁, c₂ = calculate_averages(img, H𝚽, area, ∫u₀) # Compute c₁(𝚽ⁿ), c₂(𝚽ⁿ) | ||||||
|
||||||
# Calculate the variation of level set 𝚽ⁿ | ||||||
for idx in CartesianIndices(𝚽ⁿ) # Denote idx = (x, y) | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
# i₊ ≔ i₊(x, y), denotes 𝚽ⁿ(x, y + 1)'s CartesianIndex | ||||||
# j₊ ≔ j₊(x, y), denotes 𝚽ⁿ(x + 1, y)'s CartesianIndex | ||||||
# i₋ ≔ i₋(x, y), denotes 𝚽ⁿ(x, y - 1)'s CartesianIndex | ||||||
# j₋ ≔ j₋(x, y), denotes 𝚽ⁿ(x - 1, y)'s CartesianIndex | ||||||
# Taking notice that if 𝚽ⁿ(x, y) is the boundary of 𝚽ⁿ, than 𝚽ⁿ(x ± 1, y), 𝚽ⁿ(x, y ± 1) might be out of bound. | ||||||
# So the pixel values of these outbounded terms are equal to 𝚽ⁿ(x, y) | ||||||
i₊ = idx[2] != n ? idx + CartesianIndex(0, 1) : idx | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
j₊ = idx[1] != m ? idx + CartesianIndex(1, 0) : idx | ||||||
i₋ = idx[2] != t ? idx - CartesianIndex(0, 1) : idx | ||||||
j₋ = idx[1] != s ? idx - CartesianIndex(1, 0) : idx | ||||||
|
||||||
𝚽₀ = 𝚽ⁿ[idx] # 𝚽ⁿ(x, y) | ||||||
u₀ = img[idx] # u₀(x, y) | ||||||
𝚽ᵢ₊ = 𝚽ⁿ[i₊] # 𝚽ⁿ(x, y + 1) | ||||||
𝚽ⱼ₊ = 𝚽ⁿ[j₊] # 𝚽ⁿ(x + 1, y) | ||||||
𝚽ᵢ₋ = 𝚽ⁿ[i₋] # 𝚽ⁿ(x, y - 1) | ||||||
𝚽ⱼ₋ = 𝚽ⁿ[j₋] # 𝚽ⁿ(x - 1, y) | ||||||
JKay0327 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
# Solve the PDE of equation 9 in paper[3] | ||||||
C₁ = 1. / sqrt(ϵ + (𝚽ᵢ₊ - 𝚽₀)^2 + (𝚽ⱼ₊ - 𝚽ⱼ₋)^2 / 4.) | ||||||
C₂ = 1. / sqrt(ϵ + (𝚽₀ - 𝚽ᵢ₋)^2 + (𝚽ⱼ₊ - 𝚽ⱼ₋)^2 / 4.) | ||||||
C₃ = 1. / sqrt(ϵ + (𝚽ᵢ₊ - 𝚽ᵢ₋)^2 / 4. + (𝚽ⱼ₊ - 𝚽₀)^2) | ||||||
C₄ = 1. / sqrt(ϵ + (𝚽ᵢ₊ - 𝚽ᵢ₋)^2 / 4. + (𝚽₀ - 𝚽ⱼ₋)^2) | ||||||
|
||||||
K = 𝚽ᵢ₊ * C₁ + 𝚽ᵢ₋ * C₂ + 𝚽ⱼ₊ * C₃ + 𝚽ⱼ₋ * C₄ | ||||||
δₕ = h / (h^2 + 𝚽₀^2) # Regularised Dirac function | ||||||
difference_from_average = - λ₁ * (u₀ - c₁) ^ 2 + λ₂ * (u₀ - c₂) ^ 2 | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
𝚽ⁿ⁺¹[idx] = 𝚽 = (𝚽₀ + Δt * δₕ * (μ * K + difference_from_average)) / (1. + μ * Δt * δₕ * (C₁ + C₂ + C₃ + C₄)) | ||||||
diff += (𝚽 - 𝚽₀)^2 | ||||||
end | ||||||
|
||||||
del = sqrt(diff / area) | ||||||
|
||||||
if reinitial_flag | ||||||
# Reinitialize 𝚽 to be the signed distance function to its zero level set | ||||||
reinitialize(𝚽ⁿ⁺¹, 𝚽ⁿ, Δt, h) | ||||||
else | ||||||
𝚽ⁿ .= 𝚽ⁿ⁺¹ | ||||||
end | ||||||
|
||||||
iter += 1 | ||||||
end | ||||||
|
||||||
return 𝚽ⁿ .> 0 | ||||||
end | ||||||
|
||||||
function initial_level_set(shape::Tuple) | ||||||
x₀ = reshape(collect(0:shape[begin]-1), shape[begin], 1) | ||||||
y₀ = reshape(collect(0:shape[begin+1]-1), 1, shape[begin+1]) | ||||||
𝚽₀ = @. sin(pi / 5 * x₀) * sin(pi / 5 * y₀) | ||||||
end | ||||||
|
||||||
function calculate_averages(img::AbstractArray{T, N}, H𝚽::AbstractArray{S, N}, area::Int64, ∫u₀::Float64) where {T<:Real, S<:Bool, N} | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
Putting these types in doesn't help you, and only prevents you from using, e.g., |
||||||
∫u₀H𝚽 = 0 | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
This will make you generic. |
||||||
∫H𝚽 = 0 | ||||||
for i in eachindex(img) | ||||||
if H𝚽[i] | ||||||
∫u₀H𝚽 += img[i] | ||||||
∫H𝚽 += 1 | ||||||
end | ||||||
end | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
∫H𝚽ⁱ = area - ∫H𝚽 | ||||||
∫u₀H𝚽ⁱ = ∫u₀ - ∫u₀H𝚽 | ||||||
c₁ = ∫u₀H𝚽 / max(1, ∫H𝚽) | ||||||
c₂ = ∫u₀H𝚽ⁱ / max(1, ∫H𝚽ⁱ) | ||||||
|
||||||
return c₁, c₂ | ||||||
end | ||||||
|
||||||
function calculate_reinitial(𝚽::AbstractArray{T, M}, 𝚿::AbstractArray{T, M}, Δt::Float64, h::Float64) where {T<:Real, M} | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Comments and test-coverage would be appreciated here. I'm not quite sure what this is about (I haven't yet seen this in the paper but I haven't read the whole thing carefully, just skimmed it). |
||||||
ϵ = 1e-8 | ||||||
|
||||||
s, t = first(CartesianIndices(𝚽))[1], first(CartesianIndices(𝚽))[2] | ||||||
m, n = last(CartesianIndices(𝚽))[1], last(CartesianIndices(𝚽))[2] | ||||||
|
||||||
for idx in CartesianIndices(𝚽) | ||||||
i₊ = idx[2] != n ? idx + CartesianIndex(0, 1) : idx | ||||||
j₊ = idx[1] != m ? idx + CartesianIndex(1, 0) : idx | ||||||
i₋ = idx[2] != t ? idx - CartesianIndex(0, 1) : idx | ||||||
j₋ = idx[1] != s ? idx - CartesianIndex(1, 0) : idx | ||||||
𝚽₀ = 𝚽[idx] # 𝚽(i, j) | ||||||
𝚽ᵢ₊ = 𝚽[i₊] # 𝚽(i + 1, j) | ||||||
𝚽ⱼ₊ = 𝚽[j₊] # 𝚽(i, j + 1) | ||||||
𝚽ᵢ₋ = 𝚽[i₋] # 𝚽(i - 1, j) | ||||||
𝚽ⱼ₋ = 𝚽[j₋] # 𝚽(i, j - 1) | ||||||
|
||||||
a = (𝚽₀ - 𝚽ᵢ₋) / h | ||||||
b = (𝚽ᵢ₊ - 𝚽₀) / h | ||||||
c = (𝚽₀ - 𝚽ⱼ₋) / h | ||||||
d = (𝚽ⱼ₊ - 𝚽₀) / h | ||||||
|
||||||
a⁺ = max(a, 0) | ||||||
a⁻ = min(a, 0) | ||||||
b⁺ = max(b, 0) | ||||||
b⁻ = min(b, 0) | ||||||
c⁺ = max(c, 0) | ||||||
c⁻ = min(c, 0) | ||||||
d⁺ = max(d, 0) | ||||||
d⁻ = min(d, 0) | ||||||
|
||||||
G = 0 | ||||||
if 𝚽₀ > 0 | ||||||
G += sqrt(max(a⁺^2, b⁻^2) + max(c⁺^2, d⁻^2)) - 1 | ||||||
elseif 𝚽₀ < 0 | ||||||
G += sqrt(max(a⁻^2, b⁺^2) + max(c⁻^2, d⁺^2)) - 1 | ||||||
end | ||||||
sign𝚽 = 𝚽₀ / sqrt(𝚽₀^2 + ϵ) | ||||||
𝚿[idx] = 𝚽₀ - Δt * sign𝚽 * G | ||||||
end | ||||||
|
||||||
return 𝚿 | ||||||
end | ||||||
|
||||||
function reinitialize(𝚽::AbstractArray{T, M}, 𝚿::AbstractArray{T, M}, Δt::Float64, h::Float64, max_reiter::Int64=5) where {T<:Real, M} | ||||||
JKay0327 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
iter = 0 | ||||||
while iter < max_reiter | ||||||
𝚽 .= calculate_reinitial(𝚽, 𝚿, Δt, h) | ||||||
iter += 1 | ||||||
johnnychen94 marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
end | ||||||
end |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
@testset "chan_vese" begin | ||
@info "Test: Chan Vese Segmentation" | ||
|
||
@testset "Gray Image Chan-Vese Segmentation Reference Test" begin | ||
img_gray = imresize(testimage("cameraman"), (64, 64)) | ||
ref = load("references/Chan_Vese_Gray.png") | ||
ref = ref .> 0 | ||
out = chan_vese(img_gray, μ=0.1, λ₁=1.0, λ₂=1.0, tol=1e-2, max_iter=200, Δt=0.5, reinitial_flag=false) | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. For test coverage, also need to test |
||
@test eltype(out) == Bool | ||
@test sum(out) == sum(ref) | ||
@test out == ref | ||
end | ||
end |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Every dependency requires a corresponding
compat
entry.