Google

Wednesday, October 10, 2007

A buzzword – Web spiders

Spider
- Search engine special software robots
- Download webpages

Crawler
- Look for "links"
- Crawling the downloaded pages by spider
- Decide where the spider should go to next based on links

Indexer
- Rips apart a page into it's various components and analyzes them.
- Components e.g titles, headings, links, text, bold, italic etc.

No comments: