Abstract
Similarity search structures for metric spaces have different performance characteristics depending on the properties of the data, construction cost, and space consumption. Nonetheless, recent experiments seem to favor vantage points-based methods such as LAESA, Spaghettis, and FQA if they are allowed to use enough pivots. By using more pre-processing time, these methods can produce superior query performance in terms of distance computations. Unfortunately this also causes them to use more space and CPU time than other structures. In this paper we explore ways to organize the basic structure according to distance relations between database objects, pivots and query objects. We introduce the priority vantage points method, which reduces the CPU overhead without adding extra space requirements. The Kvp structure is also introduced as an improvement, which stores less distance values than other vantage points algorithms. Kvp needs one sequential scan over the index data, making it very suitable to be stored on disk. We show that Kvp is superior to the other methods given same amount of storage.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
C. Traina Jr., Agma J. M. Traina, B. Seeger, C. Faloutsos. Slim-Trees: High Performance Metric Trees Minimizing Overlap Between Nodes. EDBT: 51–65. 2000.
E. Chávez, J. L. Marroquín, R. A. Baeza-Yates. Spaghettis: An Array Based Algorithm for Similarity Queries in Metric Spaces. SPIRE/CRIWG: 38–46. 1999.
E. Chávez, G. Navarro, R. Baeza-Yates, J. L. Marroquín. Proximity Searching in Metric Spaces. ACM Computing Surveys 33(3):273–321. 2001a.
E. Chávez, J. L. Marroquín, G. Navarro. Fixed Queries Array: A Fast and Economical Data Structure for Proximity Searching. Multimedia Tools and Applications 14(2): 113–135. 2001b.
E. Vidal. An Algorithm for Finding Nearest Neighbors in (approximately) Constant Average Time. Pattern Recognition Letters 4:145. 1986.
J. Uhlmann. Satisfying General Proximity/Similarity Queries with Metric Trees. Information Processing Letters 40:175–179. 1991.
L. Mico, J. Oncina, E. Vidal. A New Version of the Nearest-neighbor Approximating and Eliminating Search (AESA) with Linear Preprocessing Time and Memory Requirements. Pattern Recognition Letters 17:731–739. 1996.
P. Ciaccia, M. Patella, and P. Zezula, M-tree: An Efficient Access Method for Similarity Search in Metric Spaces. Proceedings of the 23rd VLDB. 1997.
R.F. Sproull. Refinements to Nearest-Neighbor Searching in k-Dimensional Trees. Algorithmica 6:579–589. 1991.
S. Brin. Near Neighbor Search in Large Metric Spaces. Proc. VLDB 574–584. 1995.
S. Nene, S. Nayar. A Simple Algorithm for Nearest Neighbor Search in High Dimensions. IEEE Transactions on Pattern Analysis and Machine Intelligence 19(9):989–1003. 1997.
T. Bozkaya, M. Ozsoyoglu. Distance-based Indexing for High-dimensional Metric Spaces. SIGMOD 357–368. 1997.
W. Burkhard, R. Keller. Some Approaches to Best-Match File Searching. CACM 16(4): 230–236. 1973.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Celik, C. (2002). Priority Vantage Points Structures for Similarity Queries in Metric Spaces. In: Shafazand, H., Tjoa, A.M. (eds) EurAsia-ICT 2002: Information and Communication Technology. EurAsia-ICT 2002. Lecture Notes in Computer Science, vol 2510. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36087-5_30
Download citation
DOI: https://doi.org/10.1007/3-540-36087-5_30
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00028-0
Online ISBN: 978-3-540-36087-2
eBook Packages: Springer Book Archive