Scrape Website Data

I need the following data scraped and entered into a csv file from this website;

http://filext.com (Alaphabetical list of all pages available here: http://filext.com/alphalist.php )

Extension
Information
Primary association:
Primary association URL:
Company:
Company URL:
Program ID:
File classification:

Here is an example taken from: http://filext.com/alphalist.php?extstart=%5EN

Extension: NPL
Information: Nokia Audio File (Nokia)

The following is taken from: http://filext.com/file-extension/NPL

Primary association: Nokia
Primary association URL: http://www.nokia.com/
Company: Nokia
Company URL: http://www.nokia.com/
Program ID:
File classification: Audio

Note. Not all the above fields are present in every page.

Unsure of how many pages in all – estimate there are between 20,000-30,000.

Leave a Reply

Your email address will not be published. Required fields are marked *