View Single Post
Old 2005-10-02, 04:24 AM   #2
Mr. Stiff
Trying is the first step towards failure
 
Mr. Stiff's Avatar
 
Join Date: Sep 2005
Location: The Netherlands
Posts: 120
Send a message via ICQ to Mr. Stiff
Hi,

Curl is a good program for getting webpages, headers, etc. It's installed on most (good) hosting servers.

Here's how I use it:

- Column 'lastspider' on my gallery table
- Query table, getting URL's not spidered the last xxx days/hours/weeks/whatever
- Use curl extension to connect to URL.
- You can choose only to download headers, which is much faster than downloading the full page
- Check header respons (must be 200). If it's 404 -> page not found, 301 or 302 -> redirect)
- Update your table!
Mr. Stiff is offline   Reply With Quote