Abstract
A frame decimation scheme is proposed that makes automatic extraction of Structure and Motion (SaM) from handheld sequences more practical. Decimation of the number of frames used for the actual SaM calculations keeps the size of the problem manageable, regardless of the input frame rate. The proposed preprocessor is based upon global motion estimation between frames and a sharpness measure. With these tools, shot boundary detection is first performed followed by the removal of redundant frames. The frame decimation makes it feasible to feed the system with a high frame rate, which in turn avoids loss of connectivity due to matching difficulties. A high input frame rate also enables robust automatic detection of shot boundaries. The development of the preprocessor was prompted by experience with a number of test sequences, acquired directly from a handheld camera. The preprocessor was tested on this material together with a SaM algorithm. The scheme is conceptually simple and still has clear benefits.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
P. Beardsley, P. Torr, A. Zisserman, 3D model acquisition from extended image sequences, Proc. ECCV 96, pp. 683–695.
P. Beardsley, A. Zisserman, D. Murray, Sequential updating of projective and affine structure from motion, IJCV, 23(3), pp. 235–259, 1997.
D. Capel, A. Zisserman, Automated mosaicing with super-resolution zoom, Proc. CVPR 98, pp. 885–891.
R. Cipolla, E. G. Boyer, 3D model acquisition from uncalibrated images, Proc IAPR Workshop on Machine Vision Applications, Chiba Japan, pp. 559–568, November 1998.
P. Debevec, C. Taylor, J. Malik, Modeling and rendering architecture from photographs: A hybrid geometry-and image-based approach, Proc. SIGGRAPH’96, pp. 11–20.
O. Faugeras, What can be seen in three dimensions with an uncalibrated stereo rig?, Proc.ECCV 92, pp. 563–578.
P. Fua, Reconstructing complex surfaces from multiple stereo views, Proc. ICCV 95, pp. 1078–1085.
A. W. Fitzgibbon, A. Zisserman, Automatic camera recovery for closed or open image sequences, Proc. ECCV 98, pp. 311–326.
K. Hanna, N. Okamoto, Combining stereo and motion for direct estimation of scene structure, Proc. ICCV 93, pp. 357–365.
R. Hartley, Euclidean reconstruction from uncalibrated views, Applications of Invariance in Computer Vision, LNCS 825, pp. 237–256, Springer-Verlag, 1994.
A. Heyden, K. Åström, Euclidean reconstruction from image sequences with varying and unknown focal length and principal point, Proc. CVPR 97, pp. 438–443.
M. Irani, P. Anandan, M. Cohen, Direct recovery of planar-parallax from multiple frames, Proc. Vision Algorithms Workshop (ICCV 99), Corfu Greece, pp. 1–8, September 1999.
K. Kanatani, N. Ohta, Accuracy bounds and optimal computation of homography for image mosaicing applications, Proc. ICCV 99, pp. 73–78.
P. McLauchlan, D. Murray, A unifying framework for structure from motion recovery from image sequences, Proc. ICCV 95, pp. 314–320.
R. Mohr, F. Veillon, L. Quan, Relative 3D reconstruction using multiple uncalibrated images, Proc. CVPR 93, pp. 543–548.
D. Nistér, Reconstruction from uncalibrated sequences with a hierarchy of trifocal tensors, Accepted to ECCV 2000.
S. Peleg, J. Herman, Panoramic mosaics by manifold projection, Proc. CVPR 97, pp. 338–343.
M. Pollefeys, R. Koch, L. Van Gool, Self-calibration and metric reconstruction in spite of varying and unknown internal camera parameters, IJCV, 32(1), pp. 7–26, Aug, 1999.
W. Press, S. Teukolsky, W. Vetterling, B. Flannery, Numerical recipes in C, ISBN 0-521-43108-5, Cambridge University Press, 1988.
L. Robert, O. Faugeras, Relative 3D positioning and 3D convex hull computation from a weakly calibrated stereo pair, Proc. ICCV 93, pp. 540–544.
H. Sawhney, S. Hsu, R. Kumar, “Robust video mosaicing through topology inference and local to global alignment”, Proc. ECCV 98, pp.103–119.
A. Shashua, Trilinearity in visual recognition by alignment, Proc. ECCV 94, pp. 479–484.
M. Spetsakis, J. Aloimonos, Structure from motion using line correspondences, IJCV, pp. 171–183, 1990.
P. Sturm, W. Triggs, A factorization based algorithm for multi-image projective structure and motion, Proc. ECCV 96, pp. 709–720.
R. Szeliski, H.-Y. Shum, Creating full view panoramic image mosaics and environment maps, Proc. SIGGRAPH’97, pp. 251–258.
C. Tomasi, T. Kanade, Shape and motion from image streams under orthography: a factorization approach, IJCV, 9(2), pp. 137–154. November 1992.
L. Van Gool, A. Zisserman, Automatic 3D model building from video sequences, Proc. ECMAST 96, pp. 563–582.
References
B. Triggs. Plane + Parallax, Tensors and Factorization. In Proc. European Conference on Computer Vision, pages 522–538, 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nistér, D. (2001). Frame Decimation for Structure and Motion. In: Pollefeys, M., Van Gool, L., Zisserman, A., Fitzgibbon, A. (eds) 3D Structure from Images — SMILE 2000. SMILE 2000. Lecture Notes in Computer Science, vol 2018. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45296-6_2
Download citation
DOI: https://doi.org/10.1007/3-540-45296-6_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41845-0
Online ISBN: 978-3-540-45296-6
eBook Packages: Springer Book Archive