Data Scraper & Data Export

i need data scraper created to run via a cron as well as manual execution for url’s i enter in it.

i am looking to have this script scrape the url i give it (for example http://www.oobg.com/matchups/?page=cfb/daily) and then visit each “report” link listed under the “info” column.

when visiting the “report” link, it needs to look and see if there is a star image (http://statfeed.statfox.com/images/star.gif) listed under the “edge” section of the power rating section. see attached image as well as http://www.oobg.com/matchups/?page=cfb/expanded&gameid=20100925NORTHWESTERN&sit=1&back=1 for an example.

if there is a star image, then the script should gather the team name, in this example it would be “C MICHIGAN” as well as the start time of the game which is “Saturday at 12:00 PM”

if the script finds there is not a star image, it ignores this page and moves on to the next one on the list from http://www.oobg.com/matchups/?page=cfb/daily

so when the script is executed it will display the results via web & email like this for all pages that have a star image:

12:00 PM – C MICHIGAN

i should be able to add email addresses to a config file or something to send a copy of the report to when the script is run.

please ask any questions you many have.

this should be an easy task and i would like it to be completed today 🙂

Leave a Reply

Your email address will not be published. Required fields are marked *