Bilkent University
Department of Computer Engineering
MS Thesis Presentation

 

Efficient Result Caching Mechanisms in Search Engines

 

Fethi Burak Sazoðlu
MS Student
(Supervisor: Prof. Dr. Özgür Ulusoy)
Computer Engineering Department
Bilkent University

The performance of a search engine depends on its components such as crawler, indexer and processor. The query latency, accuracy and recency of the results play crucial role in determining the performance.

This can be pro- vided with powerful hardware in the datacenter, but keeping the operational costs restrained is mandatory for search engines for commercial durability. Therefore, this thesis focuses on techniques to boost the performance of search engines by means of reducing both number of queries issued to the backend and cost to process a query stream. This can be accomplished by taking advantage of the temporal locality of the queries. Caching the result for a recently issued query removes the need to reprocess this query when it is issued again by the same or different user. Therefore, making use of query result cache increases the processing power of available resources by decreasing their load. The main objective of this thesis is to improve search engine performance by enhancing productivity of result cache. This is done by endeavoring to maximize the cache hit rate and minimizing the processing cost by using the per query statistics such as frequency, timestamp and cost. While providing high hit rates and low processing costs im- prove performance, the freshness of the queries in the cache has to be considered as well for user satisfaction.

Therefore, a variety of techniques are examined in this thesis to bound the staleness of cache results without blasting the backend with refresh queries. The offered techniques are demonstrated to be efficient and effective by using real query log data from a commercial search engine.

 

DATE: 03 September, 2014, Wednesday @ 13:30
PLACE: EA-409