I would like to have an application, module, software, or some kind of code written to extract organized unstructured data from a website.
There are approximately 50 sites. However, they will be done over a time period and not all at once. Perhaps, 5 to 10 at a time. I can provide access to one of them for review.
The requirements include the following:
1. Needs to extract organized unstructured data from any website
2. Needs to login to the site
3. Needs to run at a certain time or times throughout the day
4. Needs to write to a MySQL database
5. Needs to compare and modify data in another table
6. Needs to email a comprehensive report that shows # of websites polled, data extracted, errors, duplicates, date of extraction, plus other useful information, etc. for all service areas.
7. Needs to do steps 1 – 6 automatically