Exploiting Semantic Relationships for the Scalability and Efficiency of Web Search Engines

Goal The goal of this project is devising new methods for index pruning and caching by exploiting the semantic relationships among Web pages and queries; and thus constructing more efficient and scalable search engines.

Sponsor Scientific and Technical Research Council of Turkey - TÜBITAK (Grant No: 108E008))

Duration 2008-2010

Budget 128,452 YTL (~$100,000)

People

Principle Investigator Prof. Özgür Ulusoy
Graduate Students Ismail Sengor Altingovde
Rifat Ozcan
Duygu Atilgan
Senior Project Students
(2008-2009)
Seher Acer
Elif Demirli
Şadiye Kaptanoğlu
Özlem B. İskender
M. Fatih Bulut
Koray Bayram
Kerem Yasin Oktay
Research Course Students
(2008-2009)
Işık Ayrancı
Şadiye Kaptanoğlu
Levent Koç
İbrahim Uysal
Volunteer Students Aslı Deniz Güven
Burak Özek

Related student projects

  • Bilkent media tracking system (Özlem Basak Iskender, Elif Demirli, Seher Acer, Sadiye Kaptanoglu)
  • Cluster-based patent retrieval (M. Fatih Bulut, Koray Bayram, Kerim Yasin Oktay)
  • Effectiveness of static index pruning techniques (Levent Koç)
  • Efficiency of static index pruning techniques (İbrahim Uysal)
  • Cluster-based static pruning of inverted index files (Işık Ayrancı)
  • Search efficiency in Web directories (Sadiye Kaptanoglu)

Publications

  1. R. Ozcan, I. S. Altingovde, Ö. Ulusoy, Exploiting navigational queries for result presentation and caching in Web search engines, Journal of the American Society for Information Science and Technology, Volume 62, Issue 4, pages 714-726, April 2011. (pdf)

     

  2. R. Ozcan, I. S. Altingovde, B. B. Cambazoglu, F. P. Junqueira, and Ö. Ulusoy, Five-level static cache architecture for web search engines, Information Processing & Management, in press. (pdf)

     

  3. R. Ozcan, I. S. Altingovde, Ö. Ulusoy, Cost-Aware Strategies for Query Result Caching in Web Search Engines, ACM Transactions on Web, Vol. 5, No. 2, 9, 2011. (pdf).

     

  4. I. S. Altingovde, D. Atilgan, Ö. Ulusoy, XML Retrieval using Pruned Element-Index Files, Proceedings of the 32nd European Conference on Information Retrieval (ECIR), pp. 306–318, Milton Keynes, UK, March 2010. (pdf)

     

  5. I. S. Altingovde, D. Atilgan, Ö. Ulusoy, Exploiting Index Pruning Methods for Clustering XML Collections, Proceedings of the 8th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), pp. 379–386, Brisbane, Australia, December 2009. (pdf)

     

  6. I. S. Altingovde, R. Ozcan, Ö. Ulusoy, Exploiting Query Views for Static Index Pruning in Web Search Engines, Proceedings of the 18th ACM Conference on Information and Knowledge Management (CIKM), pp. 1951–1954, HK, China, November 2009. (pdf)

     

  7. I. S. Altingovde, R. Ozcan, Ö. Ulusoy, A Practitioner’s Guide for Static Index Pruning, Proceedings of the 31st European Conference on Information Retrieval (ECIR), pp.675–679, Toulouse, France, April 2009. (Best Poster Award) (pdf)

     

  8. I. S. Altingovde, R. Ozcan, Ö. Ulusoy, A Cost-Aware Strategy for Query Result Caching in Web Search Engines, Proceedings of the 31st European Conference on Information Retrieval (ECIR), pp. 628–636, Toulouse, France, April 2009.  (pdf)

     

  9. R. Ozcan, I. S. Altingovde, Ö. Ulusoy, Utilization of Navigational Queries for Result Presentation and Caching in Search Engines,ACM 17th Conference on Information and Knowledge Management (CIKM'08), Napa Valley, California, USA, October 2008.(pdf copy)

     

  10. R. Ozcan, I. S. Altingovde, Ö. Ulusoy, Space efficient caching of query results in search engines, International Symposium on Computer and Information Sciences (ISCIS'08), Istanbul, Turkey, October 2008. (pdf copy)

     

  11. I. S. Altingovde, E. Demir, F. Can, Ö. Ulusoy, Site-based Dynamic Pruning for Query Processing in Search Engines, 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'08), Singapore, 2008. (pdf copy)