||Search Engine Project
||Weber, Rene A.
||Bille, Philip (Algorithms and Logic, Department of Informatics and Mathematical Modeling, Technical University of Denmark, DTU, DK-2800 Kgs. Lyngby, Denmark)
Gørtz, Inge Li (Algorithms and Logic, Department of Informatics and Mathematical Modeling, Technical University of Denmark, DTU, DK-2800 Kgs. Lyngby, Denmark)
||Technical University of Denmark, DTU, DK-2800 Kgs. Lyngby, Denmark
||The task was to develop a scalable search engine, where the main focus is on the algorithmic
challenges in efficiently representing large data sets while supporting fast searches.
Using an incremental developing model the report explains how a search engine based on the
publication database ”The DBLP Computer Science Bibliography” is developed.
The project consist of a basic part and an advanced part:
• Basic part
The basic part consist of a series of steps, which provides an initial data structure. This part
starts out as a linked list and ends up as a hash table which stores the authors.
• Advanced part
In this part of the project the main focus has been on finding and evaluating data structures
which efficiently supports the search functionality auto-suggest and implementing the
solution in a web application.
The first update in the advanced part was to extend the data structure to support searches on
the publications. The next increment consisted of making searching for keywords possible
and afterwards prefix searches on the keywords. In the following update the memory usage
was reduced, such that the complete data file could be loaded into the data structure. Then
boolean searches was added and in the last version, the program and was implemented as a
web application, providing a web based GUI.
||Technical University of Denmark (DTU) : Kgs. Lyngby, Denmark
Creation date: 2010-12-20
Update date: 2010-12-20