Hello everyone,
My name is Jonathan Paul 🙂
Thanks for taking the time read this project description.
If you have any concerns about this task or myself, then
please check my feedback. I will not waste your time 🙂
This project will suit someone with CGI or PHP experience.
I have an existing piece of software that can parse results
from Google based on a search query. I can provide this
software with a keyword – like “dogs” and it will ask
Google for the results for this keyword.
However – Google only returns a maximum of 1,000 results per
search engine query. This is a limit that Google has created.
This limit applies to all search engine queries.
For example … even though the following query produces
millions of results – the user is only allowed to view a
maximum of 1,000 results.
http://www.google.com/search?q=dogs&ie=utf-8&oe=utf-8&aq=t&rls=org.mozilla:en-GB:official&client=firefox-a
We are restricted to 1,000 results total. There is however,
a way to collect more than 1,000 results cumulatively.
We can do this, by limiting the results for a single query,
by date. Google actually allows us to restrict results using
an advanced operator daterange.
We can ask Google for the results for the query “dogs” and get
a get a maximum of 1,000 results for a certain date range, and
then we can ask Google again for another list of results, from
a different date range.
The daterange functions works like this …
If I want to find pages (about a certain topic) indexed by
Google during the month of May 2003, I would type in the
following Google search query:
“dogs” daterange: 2452774-2452803
(This is a Julian date range)
Now …
Your job, is to create a PHP script, that allows me to provide
a keyword phrase, and a starting and ending date. The script
will then create a series of search engine querys, like the
example shown above, for each day between the date ranges
tat I specify.
For example …
The script would show me a list of search engine
queries based on a certain keyword for:
– The 1st of May
– The 2nd of May
– The 3rd of May
and so on …
The idea is simple.
Ask Google for the results for a keyword – but only,
a single day’s worth of results at a time, across a
daterange that I input to the script.
If I ask for 2 months of results, then the 60 search
queries that are generated by the script would be in
the following format:
“keyword” daterange: 2452774-2452803
and there would be 60 such queries outputted representing
each day, across a date range I specify.
Hopefully that makes sense.
My goal, is to end up with thousands of queries like the example
shown above, so I can feed these queries into my other software,
and parse the results for each day, based on my keyword.
For example … I might ask the script to produce a year’s
worth of queries. The script would produce 365 queries in the
following format
“keyword” daterange: 2452774-2452803
Each query, representing a single day across the entire year.
You will need to create a script that asks for 3 things:
Thanks for taking the time to read this project description.
Good luck and I look forward to reviewing your bids 🙂
Cheers,
Jonathan
