Beruflich Dokumente
Kultur Dokumente
article
info
Article history:
Received 30 April 2009
Received in revised form
3 February 2010
Accepted 3 February 2010
Available online 12 March 2010
Keywords:
Digital Elevation Model
Building footprint
Energetic modeling
Marked point processes
RJMCMC
abstract
In the past two decades, building detection and reconstruction from remotely sensed data has been
an active research topic in the photogrammetric and remote sensing communities. Recently, effective
high level approaches have been developed, i.e., the ones involving the minimization of an energetic
formulation. Yet, their efficiency has to be balanced by the amount of processing power required to obtain
good results.
In this paper, we introduce an original energetic model for building footprint extraction from high
resolution digital elevation models (1 m) in urban areas. Our goal is to formulate the energy in an
efficient way, easy to parametrize and fast to compute, in order to get an effective process still providing
good results.
Our work is based on stochastic geometry, and in particular on marked point processes of rectangles.
We therefore try to obtain a reliable object configuration described by a collection of rectangular building
footprints. To do so, an energy function made up of two terms is defined: the first term measures the
adequacy of the objects with respect to the data and the second one has the ability to favour or penalize
some footprint configurations based on prior knowledge (alignment, overlapping, . . . ). To minimize the
global energy, we use a Reversible Jump Monte Carlo Markov Chain (RJMCMC) sampler coupled with
a simulated annealing algorithm, leading to an optimal configuration of objects. Various results from
different areas and resolutions are presented and evaluated. Our work is also compared with an already
existing methodology based on the same mathematical framework that uses a much more complex
energy function. We show how we obtain similarly good results with a high computational efficiency
(between 50 and 100 times faster) using a simplified energy that requires a single data-independent
parameter, compared to more than 20 inter-related and hard-to-tune parameters.
2010 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS). Published by
Elsevier B.V. All rights reserved.
1. Introduction
Building extraction and reconstruction from remotely sensed
images has been a motivating topic for many researchers in the
past years. Indeed, 3D urban models can be very useful in various
applications: virtual tourism, models of wave propagation for
telecommunication operators or realistic environments for video
games among others. However, before providing a volumetric
representation of the buildings, most automatic methods need to
focus on their 2D outlines (Haala et al., 1998; Brunn and Weidner,
1997; Lafarge et al., 2008). This is the problem we propose to
address in this paper, without any prior knowledge on the building
location and only loose constraints on shape.
0924-2716/$ see front matter 2010 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS). Published by Elsevier B.V. All rights reserved.
doi:10.1016/j.isprsjprs.2010.02.002
318
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
319
where Z =
eU(Xt ) ensures normalization. Moreover, the
Xt
energy U (.) can be expressed as a weighted sum of an internal
energy measuring the spatial quality of the collection of objects
(prior or internal field) and an external energy which qualifies the
quality of the objects with respect to the underlying DEM (external
field). The unnormalized formulation is then:
described by its center ci = (xi , yi ), its semi-major axis vi and its aspect ratio ri =
li
Li
li
Li
M=
xmax xmax
ymax ymax
,
,
]0, 1] .
| {z }
2
2
2
2
|
{z
}
ri
(1)
vi
1 U(Xt )
e
(2)
(3)
With probability
(1 )
set Xt +1 = Xnew
set Xt +1 = Xt
until convergence;
320
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
Uprior ui , uj = .Sinter ui , uj .
(6)
Uprior (Xt ) =
Uprior ui , uj .
(7)
i6=j
u Xt ,
N (u, Xt ) = {v Xt | u v, u 6= v}.
(4)
(5)
Udata (a, b) =
kabk
Z
0
!!
ab
n a+t
dt
kabk
(8)
ab
kabk
Udata (a, b) = ab
Z
a + t ab dt
(9)
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
j+1
rectangle. The grid stands for the DEM pixels; the segment [ ,
j+1
ends at pi
j+1
pi
] of a given
] begins at pji ,
and crosses 8 pixels of the DEM (represented with a dark gray outline).
Udata (a, b) = ab
n
X
(tk+1 tk ) k
(10)
k=0
Udata (a, b) is the dot product of the edge vector ab and the average
gradient along the edge . It is thus linear with the edge length,
and with the mean gradient magnitude (corresponding to the
average faade height), yielding a measure of a faade surface.
Note that the integral formulation and the discretized (tk+1 tk )
weighting ensures that the measure is isotropic. It further takes
into account that the orientations of the vectors to only measure
the gradient outgoing the rectangle edges. It thus favours rectangle
edges alignment with large height variations in the DEM. For an
object ui , its external energy is the sum of the external energies of
j
j +1
the four individual segments [pi , pi ] subtracted from a constant
wdata :
3
X
j +1
max 0, Udata pi , pi
(11)
j =0
Udata (Xt ) =
Udata (ui ) .
(12)
ui Xt
321
322
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
p'2
p2
p1
p2
p3
p2
p'3
p0
p3
p'1= p1
p3
p0
p0
v from the center to the middle of one of its smaller sides and its
aspect ratio r (i.e. the ratio of the length l of the side orthogonal to
corners = c v r v
and
area = r v 2 .
(c +
w + 0 r
w ,
v +
w , r)
r
Edge translation
c + (t 1) v , t v ,
0
Rotation/scaling
=1
0 =1
(c + (t 1)r
v ,
v , rt ).
(13)
(14)
(15)
Note that translating an edge using one of the equations above may
yield an aspect ratio r > 1. This is simply fixed by using the alter
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
323
Table 1
Cumulative operation counts, assuming (vi2 ) and (ri vi2 ) values are cached, which
saves 2 additions and 6 multiplications. The intersection test returns as soon as a
separating axis is found, so that the exact operation count among the last 4 columns
depends on the which axis, if any, is found to be a separating axis.
, ,
c12
v1 ?
v2 ?
v1 ?
v2 ?
| | absolute values
+ additions
multiplications
2
4
4
3
7
7
4
10
10
5
13
13
6
16
16
[c1 +
v1 r1
v1 , c1 +
v1 + r1
v1 ] opposite to R1 if and only if
(, 0 ) {1, 1}2 :
c2 + v2 + 0 r2 v2 v1 > c1 + v1 v1
35000
Energy
Samples of the energy
30000
25000
20000
15000
10000
5000
0
-5000
-10000
1e+06
2e+06
3e+06 4e+06
Iterations
5e+06
6e+06
7e+06
Fig. 9. The curve shows the evolution of the energy. The samples denoted by dots
correspond to the superimposed configurations.
(16)
v2 :
Intersects(R1 , R2 ) = c12 v1 v12 + + r2
& c12 v2 v22 + + r1
(17)
& c12 v1 r1 v12 + + r2
Energy
Operation counts
4
X
area(R1 Ti ).
(18)
i =1
324
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
Fig. 10. Comparison of the results obtain with our strategy (left) and with Ortners strategy (middle). The right image is a direct comparison: in dark gray our results alone,
in medium gray Ortners results alone, in light gray the common part of both results (see also Table 2 for a quantitative study).
Fig. 11. Comparison of the results obtain with our approach on a satellite (left) and LiDAR (middle) DEMs. The right image is a direct comparison: in dark gray our results
alone, in medium gray the LiDAR results alone, in light gray the common part of both results (see also Table 3 for a quantitative study).
Table 2
Quantitative comparison of building/non-building classification with both strategies. Percentages express the rate of pixel in each category w.r.t. the total number
of pixels.
Table 3
Quantitative comparison of building/non-building classification with satellite and
LiDAR DEMs. Percentages express the rate of pixel in each category w.r.t. the total
number of pixels.
Our methodology
Ortners
Methodology
Building
Not building
Total
Satellite DEM
Building
Not building
Total
35.5%
4.7%
41.3%
11.2%
48.6%
59.8%
46.7%
53.3%
100%
LiDAR DEM
Building
Not building
Total
Building
Not building
Total
34.3%
5.8%
41.1%
5.6%
54.3%
58.9%
39.9%
60.1%
100%
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
325
Table 4
Quantitative comparison between our work and Ortners approach on the result
presented in Fig. 13. Results are clearly equivalent.
Rates (%)
Detection
False alarm
Quality
Our methodology
Ortners methodology
86.1
88.3
34.9
45.8
63.8
60.5
83.8
14.3
73.3
Fig. 12. Comparison of both approaches on a complex building (left: ours; right:
Ortners).
2009, IGN
Fig. 14. Result obtained with our approach on a 20 cm aerial DEM. See also Table 4
2009, IGN
TP
2 detection rate = TP , false alarm rate = FP and quality rate =
,
total
total
FP +total
where TP denotes pixel-wise true positive, TP false positive and total is the number
of building pixels in the reference.
326
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
Table 5
Computation times of the compared approaches. Our methodology provides a gain factor varying from 80 to 100.
Image size
] objects
'100
' 90
30
11
Computation times
] iterations
Our methodology
Ortners methodology
Our methodology
'120 s
' 60 s
'100 s
'100 s
'4 h
Fig. 15. Influence of parameter (left, = 10; right, = 100). Results are strictly
equivalent in terms of energy, even if the layout is not the same.
N/A
'3 h
'3 h
Fig. 16. Influence of wdata parameter. From left to right and top to bottom, wdata value is 25, 100, 250 and 1000 m2 . For too small a value, too many rectangles are detected
(noise in the DEM), and, as the value increase, the level of generalization becomes higher.
O. Tournaire et al. / ISPRS Journal of Photogrammetry and Remote Sensing 65 (2010) 317327
References
Alharthy, A., Bethel, J., 2002. Heuristic filtering and 3D feature extraction from LiDAR
data. International Archives of Photogrammetry, Remote Sensing and Spatial
Information Science 34 (Part 3), 2934.
Brunn, A., Weidner, U., 1997. Extracting buildings from digital surface models.
International Archives of Photogrammetry and Remote Sensing 32 (Part
3/4W2), 2734.
Cressie, N.A.C., 1993. Statistics for spatial data. In: Probability and Mathematical
Statistics. Wiley-Interscience, New-York, USA.
Frdricque, B., Daniel, S., Bdard, Y., Paparoditis, N., 2008. Populating a building
multi-representation data base with photogrammetric tools: Recent progress.
ISPRS Journal of Photogrammetry and Remote Sensing 64 (4), 441460.
Geyer, C.J., Mller, J., 1994. Simulation and likelihood inference for spatial point
processes. Scandinavian Journal of Statistics 21 (4), 359373.
Green, P.J., 1995. Reversible Jump Markov Chain Monte Carlo computation and
Bayesian model determination. Biometrika 82 (4), 711732.
Haala, N., Brenner, C., Anders, K.H., 1998. 3D urban GIS from laser altimeter and 2D
map data. International Archives of Photogrammetry and Remote Sensing 32
(Part 3), 339346.
Haithcoat, T.L., Song, W., Hipple, J.D., 2001. Building footprint extraction and
3D reconstruction from LiDAR data. In: Proceedings of the Remote Sensing
and Data Fusion over Urban Areas. 89 November. IEEE/ISPRS, Roma, Italy,
pp. 7478.
Hastings, W.K., 1970. Monte Carlo sampling using Markov chains and their
applications. Biometrica 57 (1), 97109.
Lacoste, C., Descombes, X., Zerubia, J., 2005. Point processes for unsupervised line
network extraction in remote sensing. IEEE Transactions on Pattern Analysis
and Machine Intelligence 27 (10), 15681579.
Lafarge, F., Descombes, X., Zerubia, J., Pierrot-Deseilligny, M., 2006. A structural
approach for 3D building reconstruction. Research Report 6048, INRIA.
Lafarge, F., Descombes, X., Zerubia, J., Pierrot-Deseilligny, M., 2008. Building
reconstruction from a single DEM. In: Proceedings of the Conference on
Computer Vision and Pattern Recognition. 2426 June. IEEE, Anchorage, AK,
USA.
327
Ortner, M., Descombes, X., Zerubia, J., 2007. Building outline extraction from
digital elevation models using marked point processes. International Journal of
Computer Vision 72 (2), 107132.
Perrin, G., Descombes, X., Zerubia, J., 2006. 2D and 3D vegetation resource
parameters assessment using marked point processes. In: Proceedings of the
International Conference on Pattern Recognition. 2024 August. IAPR, HongKong, China.
Pierrot-Deseilligny, M., Paparoditis, N., 2006. A multiresolution and optimizationbased image matching approach: An application to surface reconstruction
from SPOT5-HRS stereo imagery. In: International Archives of Photogrammetry,
Remote Sensing and Spatial Information Sciences 36 (Part 1/W41) (on CD-ROM).
Robert, C.P., Casella, G., 2004. Monte Carlo Statistical Methods, 2nd Edition.
In: Springer Texts in Statistics, Springer.
Roy, S., Cox, I.J., 1998. A maximum-flow formulation of the N-camera stereo
correspondence problem. In: Proceedings of the International Conference on
Computer Vision. 47 January. IEEE, Bombay, India, pp. 492499.
Salamon, P., Sibani, P., Frost, R., 2002. Facts, Conjectures and Improvements
for Simulated Annealing. Society for Industrial and Applied Mathematics,
Philadelphia, USA.
Shackelford, A.K., Davis, C.H., Xiangyun Wang, X., 2004. Automated 2D building
footprint extraction from high-resolution satellite multispectral imagery.
In: Proceedings of the Geoscience and Remote Sensing Symposium, 2004,
vol. 3. 2024 September. IEEE, Anchorage, AK, USA, pp. 19961999.
Stoyan, D., Kendall, W.S., Mecke, J., 1996. Stochastic Geometry and its Applications,
2nd Edition. Wiley-Interscience.
van Lieshout, M.N.M., 2000. Markov Point Processes and their Applications. Imperial
College Press, London, England.
Wang, O., Lodha, S.K., Helmbold, D.P., 2006. A Bayesian approach to building
footprint extraction from aerial LiDAR data. In: Proceedings of the International
Symposium on 3D Data Processing, Visualization, and Transmission. 1416
June. IEEE, Chapel Hill, USA, pp. 192199.
Winkler, G., 2003. Image Analysis, Random Fields and Markov Chain Monte Carlo
Methods: A Mathematical Introduction, 2nd Edition. Springer-Verlag.