Reading List for Final Exam
- Class notes on the web.
- Chakrabarti, chaps. 1 and 2; chap 3 through sec 3.2.2; chap. 4
through sec. 4.2 and sec. 4.5; sec. 7.1, 7.2, 7.6.
Mercator: A Scalable, Extensible Web Crawler by Allan Heydon and Marc
Scaling Question Answering to the Web
Cody Kwok, Oren Etzioni, and Daniel Weld.
Unsupervised Named-Entity Extraction from the Web. Oren Etzioni et al.
Web Search for a Planet: The Google Cluster Architecture
by Luiz Andre Barroso et al., IEEE Micro, 2003, pp. 22-28.
The PageRank Citation Ranking: Bringing Order to the Web
by Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd.
Graph Structure in the Web Andrei Broder et al. 2000
Structured Databases on the Web: Observations and Implications
Kevin Chang et. al
Crawling the Hidden Web
(Sriram Raghavan, Hector Garcia-Molina)
DEADLINER: Building a New Niche Search Engine