You are here: Webwiki >

Thebana­nat­ - Building a Search Engine (No review yet)

Goto Thebana­nat­

Language: english

Spidering the Internet using a couple of Robots, Perl and Postgres

Keywords: Robots Spiders Postgres Perl Harvesting URL's Database Distributed

Category: Computer and Technology

Reviews and ratings of

There are no reviews yet.

Content and keywords

Harry Jackson is specified as the websites creator.

Important and popular websites

The website with the homepage "The Banana Tree" provides content on the pages Vector Space Search Engine, Postgres and Building A Spider. In the following table you'll find the 10 most important pages of

# Description URL of the website
1. The Bana­na Tree /
2. Vector Spa­ce Search En­gi­ne /vector_space/tuto­rial.html
3. Postgres /postgres/postgres.html
4. Buil­ding a Spi­der /vector_space/buil­ding_a_spider.html
5. Prog­ram­ming /met­hodo­logy.html
6. Vector Spa­ce /vector_space/vector_space.html
7. In­de­xing the In­ter­net /vector_space/in­de­xing_the_in­ter­net.html
8. What is a ro­bots.txt file /robots.html
9. Stop List /stop­list.html
10. Road Map /road_map.html

Technical information

The web server with the IP-address used by is owned by Linode and is located in Houston, USA. This web server runs a few other websites, mostly in the english language.

The websites of are served by a Apache server. The markup language of the website is XHTML 1.0 Strict. The website does not specify details about the inclusion of its content in search engines. For this reason the content will be included by search engines. In order to display ads the Google Adsense advertising network is used.

Information about the server of the website

IP address:
Server provider:Linode
Number of websites:4 - more websites using this IP address
Language distribution:100% of the websites are english

Technical information about the technology of the website

Webserver software: Apache
Load time: 0.27 seconds (faster than 85 % of all websites)
HTML version:XHTML 1.0 Strict
Filesize:7.65 KB (674 recognized words in text)

Safety and classification

The website doesn't contain questionable content. It can be used by kids and is safe for work.

Attribute Classification
Google Safebrowsing
Safe for children
Safe for work
Webwiki rating
No ratings
Server location
 USA, Houston
Trustworthy 85%
Disclaimer: The classification is based on the automatic analysis of public information, ratings and customer reviews. All information is provided without warranty.
For webmasters:
Add a Webwiki button with the current rating to your website!