Search Engine Project

New data files

Posted in Uncategorized by philipbille on June 25, 2011

The search engine project is now officially updated with the data files for Wikipedia. The old version of the project can be found in “archive 2009-2011”.

\Philip

Spring round up.

Posted in Uncategorized by philipbille on June 24, 2011

Almost 30 students distributed in 12 groups did the search engine project this spring semester. Most of these were students from the “software technology project” course, where the project was explicitly offered as a “standard-project”. I won’t be teaching the “software technology project” course, and therefore  The search engine project will (with high probability) not be offered as such again. If you are interested in doing a search engine for this course you will have to arrange it with the new course teachers.

\Philip

 

Preview of new data files.

Posted in Uncategorized by philipbille on March 2, 2011

We decided that the data for next semesters search engine project will be a snapshot of WikiPedia. Lots of new and interesting challenges! A preview of the upcoming search engine main page with the new data files is available for the interested. See here

\Philip

Forthcoming change of input data

Posted in Uncategorized by philipbille on February 9, 2011

I the not too distant future (= next semester with high probability) we will change the input data used for the search engine project. Why? For the fun of it and because we want even bigger and more challenging data sets to be a direct part of the search engine project. Our top candidate data sets is the project gutenberg containing about  35K books each in a separate text file. If some of the current 11 groups (!!) working on the search engine project would like to try out their techniques on this data set please let us know. We would be happy to assist in the setup and we would of course love to see it work. Also if you have some other suggestion for a challenging data set please tell us.

\Philip

 

Fall wrap up.

Posted in Uncategorized by philipbille on February 6, 2011

This semester 2 Bsc. project and 1 B. Eng. project were completed.

\Philip

Fall 2010

Posted in Uncategorized by philipbille on August 31, 2010

This semester a number of groups will do a search engine project. In addition to Bsc. students we have decided to also open up for B. Eng. students (“diplomingeniører”). The first B. Eng. student will start next monday on the search engine project.

Spring Wrap Up.

Posted in Uncategorized by philipbille on July 6, 2010

The search engine project succesfully completed Spring 2010 with 2 Bsc. projects and 5 software technology projects. Many interesting ideas popped up along the way. We will use these to make the project even better in the future.

Source Code Typo

Posted in Uncategorized by philipbille on February 9, 2010

I noticed a small typo in the listed source code for Index1: In main Index0 should of course be Index1.

Data Structures Refresh

Posted in Uncategorized by philipbille on February 9, 2010

In two short lectures/meetings/discussions I’ll refresh the key data structures needed for the basic part of the search engine project:

Linked lists: February 10, 13.00-14.00 (tomorrow!)

Hashing: March 3, 13.00-14.00

Both take place in bldg. 308, aud. 11.

Software Technology Project Course.

Posted in Uncategorized by philipbille on January 8, 2010

The search engine project will also be available as a standard project in the software technology course in Spring 2010.