How to Resolve a 403 Forbidden Error When Scraping a Website Using PHP Simple HTML DOM Parser?

I am trying to scrape data from a website using the PHP Simple HTML DOM Parser. However, every time I attempt to fetch the HTML content of the page, I encounter a 403 Forbidden error.

To troubleshoot, I tried setting custom headers, including a User-Agent, using Guzzle PHP to mimic a browser request. Despite this, the issue persists, and I am unable to retrieve the webpage content.

// using simple dom parser
require '../simple_html_dom.php';

$html = file_get_html('https://www.mywebsite.com');
$title = $html->find('title', 0);
$image = $html->find('img', 0);

echo $title->plaintext."<br>n";
echo $image->src;

// using guzzle
require '../../vendor/autoload.php';

use GuzzleHttpClient;

$url = "https://www.mywebsite.com";
$client = new Client();

try {
    $response = $client->request('GET', $url, [
        'headers' => [
            'User-Agent' => 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.124 Safari/537.36',
            'Accept-Language' => 'en-US,en;q=0.9',
            'Accept-Encoding' => 'gzip, deflate, br',
            'accept' => 'text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.9',
            'Referer' => 'https://www.mywebsite.com',
        ]
    ]);

    if ($response->getStatusCode() === 200) {
        $html = $response->getBody()->getContents();
        echo "Fetched HTML (first 500 characters):n" . substr($html, 0, 500) . "nn";

        // Continue with DOM parsing...
    } else {
        echo "Failed to fetch the URL. HTTP Status Code: " . $response->getStatusCode() . "n";
    }
} catch (Exception $e) {
    echo "An error occurred: " . $e->getMessage() . "n";
}

I suspect the server might have additional mechanisms, such as IP blocking, anti-bot protection, or cookies, that are causing the 403 error.

Are there other headers or configurations I should include to bypass
the 403 Forbidden error?
Is there an alternative approach or library that might work better
for scraping websites with such restrictions?

Any guidance on resolving this issue would be greatly appreciated!