Center selection techniques for metric indexes

The metric spaces model formalizes the similarity search concept in nontraditional databases. The goal is to build an index designed to save distance computations when answering similarity queries later. A large class of algorithms to build the index are based on partitioning the space in zones as c...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Mendoza Alric, Cristian, Herrera, Norma Edith
Formato: Articulo
Lenguaje:Inglés
Publicado: 2007
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/9535
http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Mar07-16.pdf
Aporte de:
Descripción
Sumario:The metric spaces model formalizes the similarity search concept in nontraditional databases. The goal is to build an index designed to save distance computations when answering similarity queries later. A large class of algorithms to build the index are based on partitioning the space in zones as compact as possible. Each zone stores a representative point, called center, and a few extra data that allow to discard the entire zone at query time without measuring the actual distance between the elements of the zone and the query object. The way in which the centers are selected affects the performance of the algorithm. In this paper, we introduce two new center selection techniques for compact partition based indexes. These techniques were evaluated using the Geometric Near-neighbor Access Tree (GNAT). We experimentally showed that they achieve good performance.