Scraping Business Categories
We need the business categories and it’s related business categories scraped off www.hotfroguk.co.uk.
Into a mysql database like:
<catergory a..z> : <related category>,…, <related category>
<catergory a..z> : <related category>,…, <related category>
<catergory a..z> : <related category>,…, <related category>
They are listed in the footer as A-Z links!
For example the Architectural Consultants Category here
http://www.hotfroguk.co.uk/Products/ARCHITECT
you can see on the page on the left it has: Similar Categories
architectural services
Architects
Drafting Service
architecture
housing construction
builder
building construction
Accommodation
architectural
self catering apartments
so here you’d have
id1, ARCHITECT :with: Architects , Drafting Service, architecture , housing construction , builder , building construction ,Accommodation , architectural , self catering apartments
This should be a relatively straight forward scaper project
and ideal for programmers who want to start off at scriptlance
and get a review quickly.
It’s even possible to use various online scraper/mashup services
to complete this task.
We’d prefer if it was done in python with a scraper frameworklike
http://scrapy.org/ as this makes it easier to write
and to extend and maintain code.
– But you can use PHP or whatever you are comfortable with.
Budget $50