Beruflich Dokumente
Kultur Dokumente
894903
j o u r n a l h o m e p a g e : w w w.i n t l . e l s e v i e r h e a l t h . c o m / j o u r n a l s / c m p b
Research Lab for Medical Imaging and Digital Surgery, Shenzhen Institutes of Advanced Technology, 1068 Xueyuan
Boulevard, University Town of Shenzhen, Shenzhen 518055, PR China
b
School of Design Communication and IT, The University of Newcastle, Callaghan, NSW 2308, Australia
c
Nanfang Medical University, 1838 Guangzhou Avenue, Guangzhou 510515, PR China
a r t i c l e
i n f o
a b s t r a c t
Article history:
Digital X-ray images are the most frequent modality for both screening and diagnosis in
hos- pitals. To facilitate subsequent analysis such as quantication and computer aided
30 October 2013
segmenta- tion method was proposed to segment background of X-ray images. The
Keywords:
extraction. One hundred clin- ical direct radiograph X-ray images were used to validate the
Computer-aided diagnosis
method. Manual thresholding and multiscale gradient based watershed method were
Direct radiography
X-ray image
proposed method yielded a dice coefcient of 0.964 0.069, which was better than that of
the manual thresholding (0.937 0.119) and that of multiscale gradient based watershed
method (0.942 0.098). Special means were adopted to decrease the computational cost,
Watershed segmentation
including getting rid of few pixels with highest grayscale via percentile, calculation of
gradi- ent magnitude through simple operations, decreasing the number of markers by
appropriate thresholding, and merging regions based on simple grayscale statistics. As
a result, the
processing time was at most 6 s even for a 3072 3072 image on a Pentium 4 PC with 2.4
GHz
CPU (4 cores) and 2G RAM, which was more than one time faster than that of the multiscale gradient based watershed method. The proposed method could be a potential tool
for diagnosis and quantication of X-ray images.
2014 Elsevier Ireland Ltd. All rights reserved.
1.
Introduction
of medical experts and augment diagnosis. With the invention of direct radiographs (DR) to have a spatial resolution
being comparable to or even better than computed tomography images (as high as 0.14 mm) and good contrast, its value
in screening and diagnosis is increasingly recognized [5].
With the increased availability and spatial resolution of DR
images, more skills and more manpower are needed to
interpret the DR images. Due to complexity and variability of
DR imaging, DR
Corresponding author. Tel.: +86 0755 86392214; fax: +86 0755 86392299.
E-mail address: qm.hu@siat.ac.cn (Q. Hu).
0169-2607/$ see front matter 2014 Elsevier Ireland Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.cmpb.2013.12.025
895
Fig. 1 An X-ray image that could not be segmented with grayscale thresholding.
2.
2.1.
Materials
2.2.
Methods
2.2.1.
Image preprocessing
896
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
Fig. 2 Typical images of the hand, chest and leg, and their corresponding histograms with 0100 percentiles and 10
90 percentiles.
T1 = {min(i)|
j=0
HI (j) 1 }
(1)
2.2.2.
(2)
(3)
(4)
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
897
HG (j) 1 }
T2 = {min(i)|
2.2.3.
The traditional watershed method takes every local minimum pixel as a seed point to grow by merging surrounding
non-seed pixels to result in the over-segmentation problem. Although region merging could be employed, it is hard
to derive appropriate rules for merging. Another means to
get around over-segmentation is marker-based watershed
method. Unfortunately, there is no general way to produce
seed pixels or markers.
As is shown in Fig. 4, gradient magnitudes of most pixels
distribute in the lower value range while a small proportion
which corresponds to the sharp edges located in foreground
contour have large gradient magnitudes. It reveals that
gradi- ent magnitudes of most contour pixels are greater
than those within background and foreground. Based on
these obser- vations, we propose to derive markers through
thresholding gradient magnitudes.
Denote the histogram of gradient image as HG . The
thresh- old T2 is obtained by formula (5)
(5)
j=0
2.2.4.
898
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
|Rgray Sgray |
Rgray
2.2.5.
Region merging
<T
(6)
2.2.6.
After region merging, some large regions are left inside the
foreground and background. These regions need to be
classi- ed as either background or foreground. Here it is
supposed that the average grayscale of the foreground is
lower than that of the background. A grayscale threshold can
be used to divide these regions according to their average
grayscale. First nd the largest region with its average
grayscale Bmax larger than the average grayscale of the
Fig. 5 Principle of watershed segmentation from markers.
image, and the largest region with its average grayscale Fmax
smaller than the average grayscale
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
899
3.
Experiments
Dice = 2|A B|
|A| + |B|
(8)
3.1.
Parameters determination
(7)
900
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
values.
Dice coefcient
0.001
0.005
0.01
0.05
0.1
0.2
0.3
0.4
0.5
0.979
0.979
0.979
0.979
0.979
0.977
0.977
0.977
0.973
Dice coefcient
0.5
0.6
0.7
0.8
0.9
0.979
0.979
0.979
0.979
0.977
3.2.
Number of
regions
Computation
time (ms)
Dice coefcient
3.3.
0.01
0.05
0.10
0.20
0.30
0.40
0.50
1
80
253
337
615
615
615
297
343
359
375
390
406
390
0
0.988
0.986
0.979
0.945
0.945
0.945
0.60
1468
640
0.590
MG(I) =
337
337
337
337
337
1
23
43
118
181
0
0.984
0.979
0.979
0.979
i=1
[(I Bi I Bi ) Bi1 ]
(9)
where and denote dilation and erosion and Bi is structuring element with different sizes. The size of structuring
element Bi is (2 i + 1) (2 i + 1) and B0 is a structuring element containing only 1 pixel. The number of structuring
elements n was set as 3 in our experiments. Then a
reconstruc- tion by erosion was performed to eliminate
minimums with contrast lower than a constant h. The
marker image used in reconstruction was generated by
adding this constant h to the gradient image. In the
experiments, h was set to 3. For a fair comparison, the
regions produced by [14] were merged using the same
procedure (with = 0.005, T3 = 0.3 and = 0.8) to yield
an average dice coefcient of 0.942 0.098. The comparative
results were listed in Table 5. The proposed method could
yield
statistically
segmentation
accuracy time
than [14] (p =
In addition,
we better
also compared
the execution
0.0094)
based onmethod
the paired
the images
100 dice might
coefcients.
between
the
proposed
andt-test
[14].of As
have
different sizes, one image with size of 3072 3072 was
selected and resampled to have 2048 2048 and 1024 1024
sizes, to
compare the time efciency (Table 6).
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
901
Fig. 9 (a) Initial segmentation. (b) Region merging and (c) nal segmentation.
Average Dice
Coefcient
Standard Deviation
Proposed method
Thresholding
0.964
0.937
0.069
0.119
Multiscale gradient
based watershed
0.942
0.098
4.
Discussion
3072 3072
watershed segmentation, meaning that the foreground contour has been broken so that the foreground and
background are merged. In order to retain continuity of the
foreground contour, should be large enough such that the minimum
gradient magnitude of foreground contour pixels is still
larger than the
gradient threshold. However, increasing will generate
more connected regions in the initial segmentation to make
merging more difcult and more time consuming. According to
our experiments, a value between 0.1 and 0.4 is appropriate
to balance between region merging and minimum gradient
magnitude of foreground contour pixels.
The parameter T3 is used to dene the similarity threshold
for merging regions. If T3 is too large as 0.5 in Table 3, it is
highly possible that all regions will be merged. On the other
hand, too small T3 may yield too many regions after
merging. So a value around 0.3 seems a balanced choice.
The parameter is to differentiate the background
and foreground according to their differences in average
grayscales. From Table 4 it can be seen that the
segmentation remains stable when the parameter is in the
range of 0.5 and
0.9.
Due to the grayscale overlap between the background
and foreground of X-ray DR images, grayscale thresholding
(even with manual derivation of optimum threshold) will
yield sta- tistically lower accuracy as expected (Table 5).
Watershed segmentation can overcome the grayscale
overlap problem as it is based on catchment basins instead of
grayscales. However, the usual way to derive catchment
basins is via local minimum of
grayscale gradient
magnitudes, which in turn will have over-segmentation due
to too many undesirable local mini- mums. Multiscale
gradient based watershed method [14] is an attempt to
suppress local minimums, but its application to X- ray DR
images is not as good as the proposed method (Table 5) due to
the great variability of local minimums that could not be
suppressed. Marker based watershed method sounds a way
to derive catchment basins, but generally there does not
exist a structure either in background or foreground that
can be used as markers (such as the case of white matter as
a marker in brain segmentation). A natural way to get the
markers is to explore gradient threshold as is done in this
paper, which employs the fact that gradient magnitudes
within the back- ground/foreground are generally smaller.
To the best of our
577
1856
5382
1419
5429
11,950
knowledge, this is
the rst trial to
explore gradient
Special means have been adopted to decrease the computational cost. Firstly, a percentile is introduced to ignore
902
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
Fig. 10 Comparison of the segmentation results between the proposed method and the thresholding method, (a) the
original image with substantial grayscale overlap between the background and foreground, (b) segmentation from
manual thresholding, and (c) segmentation based on the proposed method.
pixels with highest grayscales for further processing, which
are likely noisy pixels and would yield high gradient magnitudes if not excluded. Excluding these pixels by percentile
instead of ltering has two advantages: low computational
cost and decreasing grayscale range for subsequent
processing without changing grayscales of foreground.
Secondly, gradient magnitudes are calculated through
additions and subtrac- tions, avoiding multiplication and
square root operations. Experiments show the simplied
calculation of
gradient magnitudes could provide the
necessary information for sub- sequent
watershed
segmentation. Thirdly, the number of markers has been
decreased substantially by thresholding gra- dient
magnitudes, which will decrease the number of regions to be
merged to decrease the computational cost (Table 2).
Finally, regions are merged based on simple grayscale statistics (average) instead of other measures that need complex
computation such as texture. As a result, the processing
time
is at most 6 s even for a 3072 3072 image on a Pentium 4 PC
with 2.4 GHz CPU (4 cores) and 2G RAM which is more than
one time faster than that of the multiscale gradient based
watershed method [14].
The present study is not without limitations. Firstly, it is
assumed that the background has a larger average
grayscale than that of the background. This assumption
may be vio- lated for extreme cases as shown in Fig. 11. This
problem could only be solved during imaging such as
controlling eld of view. Secondly, to achieve real-time image
processing, one way is to decrease the computational cost
while the other way is to use complex hardware such as
graphics processing unit to speed up. We only addressed the
rst way and will leave the second way for further study.
5.
Conclusion
A marker-based watershed segmentation method was proposed to segment background of X-ray images. The method
consisted of six parts: image preprocessing, gradient computation, marker extraction, watershed segmentation, regions
merging and background extraction. One
hundred DR
images were
used
to
quantify
the
segmentation
accuracy. Man- ual thresholding and multiscale gradient
based watershed method [14] were implemented for
comparison. The proposed
method yielded a dice coefcient of 0.964 0.069, which was
better than that of the manual thresholding (0.937 0.119)
and that of multiscale gradient based watershed method
(0.942 0.098). Special means have been adopted to decrease
the computational cost, including getting rid of few pixels
with highest grayscale via percentile, calculation of gradient
mag- nitude through additions, decreasing the number of
markers by appropriate thresholding, and merging regions
based on simple grayscale statistics. As a result, the
processing time
was at most 6 s even for a 3072 3072 image on a Pentium 4
PC with 2.4 GHz CPU (4 cores) and 2G RAM which was more
than one time faster than that of the multiscale gradient
based watershed method. Experiments on tolerance to
param- eters showed that the method was not sensitive to
the varied parameters. The proposed method could be a
potential tool for both diagnosis and computer aided
diagnosis of X-ray images.
Acknowledgements
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 1 3 ( 2 0 1 4 ) 894903
903
Natural Science Foundation (No. S2011010001820). Authors
would like to thank the Beijing Aerospace Zhongxing Medical
Systems Co, Ltd for providing the clinical data.
r ef er en ce
s