Website Crawler

Hello,

Our cmpany needs a site that wil crawl different sites to get some prices of hotels and store them in a database.
Please find bellow the description of what we call the “hotel pricing system.”

The goal of this document is to explain how we would like the hotelsniff.com website will work.

The design of the site hotelsniff.com will be very basic.
There will be an admin and different users.
Admin can add/edit/remove/suspend users.

Each user will be a hotel or will be a group of hotel.

Each user will have a log and password.

Each hotel will have a name, and 3 URL :
– The official URL of the hotel
– The expedia URL of the hotel
– The booking.com URL of the hotel

Each hotel will have a list of competitors.

Each competitor will have a neme and 3 URL :
– The official URL of the competitor
– The expedia URL of the competitor
– The booking.com URL of the competitor

Let’s take an exemple :

A user has access to one hotels :
– Hotel Notre Dame

So for this hotel, we have 3 URLS :
– Hotel Notre Dame
– Official site : http://hotelnotredameparis.com/
– Expedia site : http://www.expedia.fr/Paris-Hotel-Le-Notre-Dame.h110086.Hotel-Information
– Booking.com site : http://www.booking.com/hotel/fr/le-notre-dame.en.html

This hotel has 4 competitors :
– Hotel lumens : http://www.hotel-lumenparis.com/fr/
– Hotel Cluny : http://www.hotel-cluny.fr/
– Select Hotel : http://www.selecthotel.fr/fr_content/hotel.php
– Hotel de notre dame : http://www.hotel-paris-notredame.com/

Each competitors has 3 URLS :
– Hotel lumens
– Official site : http://www.hotel-lumenparis.com/fr/
– Expdia Site : http://www.expedia.fr/Paris-Hotel-Hotel-Lumen-Paris-Louvre.h1682992.Hotel-Information
– Booking.com site : http://www.booking.com/hotel/fr/lumen-paris-louvre.en.html

– Hotel Cluny
– Official site : http://www.hotel-cluny.fr/
– Expedia site : http://www.expedia.fr/Paris-Hotel-Cluny-Sorbonne.h455105.Hotel-Information
– Booking.com site : No site

– Select Hotel :
– Official site : http://www.selecthotel.fr/fr_content/hotel.php
– Expedia site : http://www.expedia.fr/Paris-Hotel-Select-Hotel-Rive-Gauche.h1169045.Hotel-Information
– Booking.com site : http://www.booking.com/hotel/fr/select-paris.html

– Hotel de notre dame :
– Offcial site : http://www.hotel-paris-notredame.com
– Expedia site : http://www.expedia.fr/Paris-Hotel-De-Notre-Dame.h63307.Hotel-Information
– Booking.com site : http://www.booking.com/hotel/fr/de-notre-dame-paris.html

=> What we want :

We would like you to get the price for a double room for today and the next 60 days and store this data for each hotel and each competitor.

We would like to allow user to view in a snap the price for a double room for any day or set of days he choose for his hotel a competiton.

Ex : We are today october 1st. user1 wants to see the price for his hotel and competitors from October 10th to October 13th.
You will show him in a grid the price of each hotel per night (always a double room).

user1 will also want to compare if prices changes in time. We will offer him a compare tool.

Ex : We are today october 1st. user1 wants to see the price for his hotel and competitors from October 10th to October 13th as it was recorded today (october 1st) and wants another panel to compare with the prices that were recorded on September 1st for the same dates (10th to 13th of october)

You will show him 2 grids with the price of each hotel per night.
grid1 as it is on the 1st of October
grid2 as it was on the 1st of september.

=> Comments :

Please bif for the overall system and send me in PMB the price you would charge per hotel.
As you see each hotel has at least 5 competitors. This mean that every night you will have to get the data of 5 competitor + 1 hotel x 3 sites = 18 prices.
In terms of programming, the expedia and booking.com will be easy as once you have developped the programming for one hotel, other hotels will be very similar. The official hotel is a bit more complicated.

I had someone work on this site a few month ago but he does not reply anymore. Nevertheless, he started working on this attached word document.

Thx and good luck.

Leave a Reply

Your email address will not be published. Required fields are marked *