I need a wordpress plugin that lets me scrape data from magiccards.info.
I need a plugin that has an admin panel and asks for 2 things.
1 – The URL for the ‘set’ of cards that it should scrape data from.
1a – http://magiccards.info/roe/en.html would be an example URL that I would give it.
2 – The category that it should make these posts.
____________________________
The specific data I want scraped can be seen here. . .
http://www.magickards.com/zendikar/misty-rainforest/
And this is the magiccards.info website for that card
http://magiccards.info/zen/en/220.html
_____________________________
I need the text from ‘Land’ all the way down to the ‘illustrator’. It also needs to grab the image, and put it at the top of the post(centered). Then the text should go below the image, and then the ‘wordpress more tag’ should go below that.
This is the only data that needs to be scraped(along with the name of the card). The ‘card type’ won’t always be ‘Land’ it will sometimes be ‘Creature’ ‘Aura’ etc. but it’s that text area that I need to scrape.
_____________________________
So when I give it this URL
http://magiccards.info/roe/en.html
or another URL for a ‘set of cards’ like
http://magiccards.info/wwk/en.html
and tell the plugin what Category those posts should go into. I want the plugin to scrape 1 card every 10 minutes(to keep the load down on my shared hosting account). Once it scrapes the image and text for 1 card, and posts it, it’ll take 10 minutes and then go to the next card in the list(from the URL). And continue until there is no more card in that set.
Please look at ALL links before bidding, and ask any questions will help you with this project.
Note – I would like to use WordPress’ cron function so I don’t have to manually setup a cronjob.