I need the following data scraped and entered into a csv file from this website;
http://filext.com (Alaphabetical list of all pages available here: http://filext.com/alphalist.php )
Extension
Information
Primary association:
Primary association URL:
Company:
Company URL:
Program ID:
File classification:
Here is an example taken from: http://filext.com/alphalist.php?extstart=%5EN
Extension: NPL
Information: Nokia Audio File (Nokia)
The following is taken from: http://filext.com/file-extension/NPL
Primary association: Nokia
Primary association URL: http://www.nokia.com/
Company: Nokia
Company URL: http://www.nokia.com/
Program ID:
File classification: Audio
Note. Not all the above fields are present in every page.
Unsure of how many pages in all – estimate there are between 20,000-30,000.
