PHP Web Media Grabber (Images and Media)

If you want to grab, retrieve or download various media files (images, videos, audios, flash files, documents, javascript sources, css stylesheet files etc.) from a public web site and use them locally in within your site, PHP Web Media Grabber is the perfect tool that will help you do that.
Extract and download anything from anywhere to your localhost server! Use the downloaded media locally into your website without external hotlinking.

Product Description

WiseLoop PHP Web Media Grabber is a set of PHP classes designed to extract and optionally download media resources form the web.
This package allows complex media extraction in a flexible manner, just by using only a few lines of code.
WiseLoop PHP Web Media Grabber suite consists of a complete built-in set of specialized grabbing engines, each component beeing implemented into a separate PHP class:

  • PHP Web Image Grabber: image extraction engine;
  • PHP Web Video Grabber: video extraction engine;
  • PHP Web Audio Grabber: audio grabbing extraction;
  • PHP Flash / Flex Grabber: flash / flex applications extraction engine
  • PHP JavaScript Grabber: javascript source files extraction engine
  • PHP CSS Grabber: CSS stylesheet files extraction engine
  • PHP Web Content-Type Grabber: extraction engine based on the content-type HTTP response header returned by the web resources
  • PHP Web File Grabber: file / document extraction engine based on file name or file extension of the web resources.

Using its smart and powerfull grabbing engines, WiseLoop PHP Web Media Graber suite is able to extract and download almost any web resource file due to its exhaustive set of specialized component PHP classes. The extraction is made from any given web URL that contains or refers media files using web links (a href tags), various tag attributes (src, embed, param, movie etc.) or even inline css styling attributes (such as background images); also, the media grabing engine is able to identify more than the obvious media resources having the most common file extensions – it will find the media generated dynamically by the servers or media files that have no valid extesions or no extensions at all (such as images generated at runtime by the web servers); the identification is made by checking the server response header when pinging the tested media resource.
The smart caching feature stores the usefull information about founded media (HTTP headers, width, height for images etc.) and so, it improves the grabbing speed, saves bandwidth and prevents useless pinging of the grabbed media.
The choosen programming model allows the web grabbers library enlargement due to its simple, but yet smart HTML DOM parser and processor.

Main Features

  • all formats image grabbing and downloading;
  • all formats video grabbing and downloading;
  • all formats audio grabbing and downloading;
  • flash / flex applications grabbing and downloading;
  • javascript source files grabbing and downloading;
  • CSS stylesheet files grabbing and downloading;
  • grabbing and downloading any web media resource based on its content-type;
  • grabbing and downloading any file or document based on its name or extension;
  • custom filtering depending on the grabbed media type (such as image width and height for image grabber);
  • downloading capability: the WiseLoop PHP Media Grabber is able to download the grabbed media to the local server, so those media files can be referred or used as local resources in the future;
  • media url name (filename) filtering: only those media files having or containing in their url names some specified strings will be included in the grabbing results;
  • media size filtering: only those media files having the specified size (in bytes) will be included in the grabbing results;
  • media count limiter: number of grabbed media files will be limited to a specified value;
  • HTML area searching: the grabbing engine is able to search for media only inside a designated HTML area specified by a tag; in this way you can skip grabbing from the start any unwanted media by narrowing the full HTML target page to a smaller area consisting of a tag content; an incomplete tag (tag slice) can be specified also, the tag will autocomplete depending on the contextual HTML content;
  • simple, but yet smart HTML DOM parser and processor;
  • smart caching for fast processing;
  • easy enlargement of the grabbers library;
  • lightweight due to the Autoloader feature;
  • exhaustive documentation (tutorials included);

PHP Web Image Grabber Features

  • smart image recognition (all formats and extensions, all locations: under the img src tag attribute, under a href link tag, under inline css attribute or by content-type);
  • default native support for most common web image extensions (jpg, jpeg, gif, png, bmp, tif, tiff, yuv, ai, eps, ps, svg, drw, ief, jfif, svg, cod, ras, cmx, ico, pnm, pbm, pgm, rgb, xbm, xpm, xwd);
  • a href link following: the grabbing engine is capable of following a href link tags that can hide behind them another images – this is a very powerfull feature that can help grabbing entire image galleries (thumbs and full size images) that are displaying only the thumbs on the starting page and those thumbs are linked with an a href tag to the real full size image;
  • parent/child image relationing: when grabbing image galleries with A Href Link following enabled, the followed thumbs are set to be parents for the full size images founded underneath them; in this way you will know for every grabbed thumb the corresponding full size image and viceversa;
  • inline CSS background image recogintion: the grabbing engine is able to identify images that are referred inline inside the CSS background or background-image attributes;
  • image seach and identification by the HTML content-type response header: the grabbing engine is able to identify more than the obvious image resources having the most common image file extensions – it will find the images generated dynamically by the servers or images that have no valid image extesions or no extensions at all; the identification is made by checking the server response header when pinging the tested media resource;
  • image extension filtering: only those images having the specified extensions will be included in the grabbing results;
  • image dimensions filtering: only those images having the specified dimensions (width / height) will be included in the grabbing results;

PHP Web Video Grabber Features

  • smart video recognition (common web formats and extensions, locations: under a href links or by content-type);
  • default native support for most common web movie extensions (wmv, mpg, mpg2, mpeg, mp4, flv, avi, mov, qt, rm, 3g2, 3gp, 3gp2, 3gpp, 3p2, lsf, lsx, asr, movie, asf, asx, bik, bix, bsf, divx, dv, dvr-ms, dvdmedia, evo, flc, flh, fli, m2v, m4v, mj2, mjp, mp21, ogm, ogv, scm, swf, vob, xvid, yuv);
  • video search and identification by the HTML content-type response header: the grabbing engine is able to identify more than the obvious video resources having the most common video file extensions – it will find the videos that have no valid video extesions or no extensions at all; the identification is made by checking the server response header when pinging the tested media resource;
  • video files extension filtering: only those videos having the specified extensions will be included in the grabbing results;

PHP Web Audio Grabber Features

  • smart audio recognition (common web formats and extensions, locations: under a href links or by content-type);
  • default native support for most common web audio extensions (au, snd, mid, rmi, mp3, aif, aifc, aiff, m3u, ra, ram, wav);
  • audio search and identification by the HTML content-type response header: the grabbing engine is able to identify more than the obvious audio resources having the most common audio file extensions – it will find the audio files that have no valid audio extesions or no extensions at all; the identification is made by checking the server response header when pinging the tested media resource;
  • audio files extension filtering: only those audio files having the specified extensions will be included in the grabbing results;

PHP Flash Grabber Features

  • grab Flash or Flex applications embedded into the targeted url page;
  • flash (flex) files search and identification by the HTML content-type response header: the grabbing engine is able to identify beyound the obvious Flash resources having the .swf file extension – it will find the Flash files that have different file extesion or no extension at all; the identification is made by checking the server response header when pinging the tested Flash / Flex file resource;

PHP JavaScript Grabber Features

  • grab external JavaScript source code files included by the targeted url page;
  • javascript files search and identification by the HTML content-type response header: the grabbing engine is able to identify beyound the obvious JavaScript resources having the .js file extension – it will find the JavaScript files that have different file extesion or no extension at all; the identification is made by checking the server response header when pinging the tested JavaScript file resource;

PHP CSS Grabber Features

  • grab external CSS stylesheets files referred by the targeted url page;
  • CSS stylesheets files seach and identification by the HTML content-type response header: the grabbing engine is able to identify beyound the obvious css resources having the .css file extension – it will find the css styles files that have different file extesion or no extension at all; the identification is made by checking the server response header when pinging the tested css file resource;

PHP Web Content-Type Grabber Features

  • grab anything embedded or referred by the targeted url page depending on the content-type HTTP response headers;
  • HTTP headers Content-Type filtering: only those files having the specified content-type will be included in the grabbing results;

PHP Web File Grabber Features

  • grab any document or file embedded or referred by the targeted url page depending on its type;
  • file name filtering: only those files containing the specified strings or extensions will be included in the grabbing results;

Requirements

  • Apache Web Server 2.0.0 or above
  • PHP 5 .0.0 or above

Installation Instructions

  • Step 1: make a folder named /php-web-media-grabber on your Web server;
  • Step 2: copy entire /bin directory to the new created /php-web-media-grabber folder;
  • Step 3: create the /download directory under the new created /php-web-media-grabber folder;
  • Step 4: make sure that the /php-web-media-grabber/download directory has full access rights (chmod 0777 for linux);
  • Step 5: include /php-web-media-grabber/bin/wlWmg.php in your application.

Information

Note

WiseLoop assumes no responsibility for any abusive use of this software product and/or violation of any terms of usage of the grabbed web pages.
If you decide to use this software product, do it with responsibility and make sure that you are allowed to grab media from the desired web page by checking its terms of usage.

free counters

Download PHP Web Media Grabber (Images and Media)

Leave a Reply

Your email address will not be published. Required fields are marked *