I’m helping a friend to migrate their blog from a “homemade” platform to WordPress and the original developers have not been helpful in extracting the content so I’m trying to figure out how to scrape the site. I’m using PHP & Puppeteer (PuPHPEteer).
https://www.conversationwithachef.com/
The home page has a “load more” button at the bottom to load more blog posts – I know I’ll need to create a loop to keep “clicking” and loading blog posts until there are no more, then loop through each blog post. However, they do not seem to use any Ids or Classes (everything is inline css) so I’m not sure how I can easily identify and “click” the button. The button itself is actually divs so no easy identification there either.
Any thoughts on how I can go about this? Maybe something along the lines of finding the “Load more” then getting the parent div? I’m not super familiar with Puppeteer so not sure if this is even possible?