Using Big Data Analysis to Improve Cache Performance in Search Engines

Web Search Engines process huge amounts of data to support search but must run under strong performance requirements (to answer a query in a fraction of a second). To meet that performance they implement different optimization techniques such as caching, that may be implemented at several levels. On...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Tolosa, Gabriel Hernán, Feuerstein, Esteban
Formato: Objeto de conferencia
Lenguaje:Español
Publicado: 2015
Materias:
Acceso en línea:http://sedici.unlp.edu.ar/handle/10915/51952
http://44jaiio.sadio.org.ar/sites/default/files/agranda7-10.pdf
Aporte de:
id I19-R120-10915-51952
record_format dspace
institution Universidad Nacional de La Plata
institution_str I-19
repository_str R-120
collection SEDICI (UNLP)
language Español
topic Ciencias Informáticas
big data
Web Search Engines (WSE)
intersection caching
Search process
spellingShingle Ciencias Informáticas
big data
Web Search Engines (WSE)
intersection caching
Search process
Tolosa, Gabriel Hernán
Feuerstein, Esteban
Using Big Data Analysis to Improve Cache Performance in Search Engines
topic_facet Ciencias Informáticas
big data
Web Search Engines (WSE)
intersection caching
Search process
description Web Search Engines process huge amounts of data to support search but must run under strong performance requirements (to answer a query in a fraction of a second). To meet that performance they implement different optimization techniques such as caching, that may be implemented at several levels. One of these caching levels is the intersection cache, that attempts to exploit frequently occurring pairs of terms by keeping in the memory of the search node the results of intersecting the corresponding inverted lists. In this work we propose an optimization step to decide which items should be cached and which not by introducing the usage of data mining techniques. Our preliminary results show that it is possible to achieve extra cost savings in this already hyper-optimized field.
format Objeto de conferencia
Objeto de conferencia
author Tolosa, Gabriel Hernán
Feuerstein, Esteban
author_facet Tolosa, Gabriel Hernán
Feuerstein, Esteban
author_sort Tolosa, Gabriel Hernán
title Using Big Data Analysis to Improve Cache Performance in Search Engines
title_short Using Big Data Analysis to Improve Cache Performance in Search Engines
title_full Using Big Data Analysis to Improve Cache Performance in Search Engines
title_fullStr Using Big Data Analysis to Improve Cache Performance in Search Engines
title_full_unstemmed Using Big Data Analysis to Improve Cache Performance in Search Engines
title_sort using big data analysis to improve cache performance in search engines
publishDate 2015
url http://sedici.unlp.edu.ar/handle/10915/51952
http://44jaiio.sadio.org.ar/sites/default/files/agranda7-10.pdf
work_keys_str_mv AT tolosagabrielhernan usingbigdataanalysistoimprovecacheperformanceinsearchengines
AT feuersteinesteban usingbigdataanalysistoimprovecacheperformanceinsearchengines
bdutipo_str Repositorios
_version_ 1764820476316614659