Greenguy's Board


Go Back   Greenguy's Board > Search Engines
Register FAQ Calendar Today's Posts

Reply
 
Thread Tools Search this Thread Rate Thread Display Modes
Old 2006-04-12, 01:11 AM   #1
Useless
Certified Nice Person
 
Useless's Avatar
 
Join Date: Oct 2003
Location: Dirty Undies, NY
Posts: 11,268
Send a message via ICQ to Useless
HTML Validation

How much, do you imagine, does the HTML validation of your pages effect SE bots and indexing. Or do you think that it plays no factor at all? I've hit a couple of small link lists to day where the little validation thing-a-ma-jig on my browser reported that it pretty much gave up due to so many page errors. I'd think shit like that would have some level effect on the bots.
__________________
Click here to purchase a bridge I'm selling.
Useless is offline   Reply With Quote
Old 2006-04-12, 02:21 AM   #2
Maj. Stress
Progress rarely comes in buckets, it normally comes in teaspoons
 
Maj. Stress's Avatar
 
Join Date: Jun 2005
Location: Dark Side Of Naboo
Posts: 1,289
It can have a lot of effect. Especially on a page as bad as the one you described. It's bad enough to screw yourself on the seo end but the chances of it being viewed as intended dramatically decrease if you do get any traffic.
Maj. Stress is offline   Reply With Quote
Old 2006-04-12, 06:58 AM   #3
ladydesigner
~Serving Up Sinful Sex ~
 
ladydesigner's Avatar
 
Join Date: Apr 2003
Location: Missouri City, Texas
Posts: 1,928
Quote:
Originally Posted by Useless Warrior
I've hit a couple of small link lists to day where the little validation thing-a-ma-jig on my browser reported that it pretty much gave up due to so many page errors.
UW, What "validation thing-a-ma-jig" do you use? I'd like to have something like that on my browser.
__________________
Porn Pixie XxX ~ Looking for Link Trades
ladydesigner is offline   Reply With Quote
Old 2006-04-12, 07:50 AM   #4
Useless
Certified Nice Person
 
Useless's Avatar
 
Join Date: Oct 2003
Location: Dirty Undies, NY
Posts: 11,268
Send a message via ICQ to Useless
Quote:
Originally Posted by ladydesigner
UW, What "validation thing-a-ma-jig" do you use? I'd like to have something like that on my browser.
I use two firefox extensions - Html Validator 0.7.9 and HTML Validator (based on CSE HTML Validator) 1.2.2. I use the first one primarily. It will tell you to make changes like changing <br> to <br /> and things like that. Very hand tool to clean up your own pages.
__________________
Click here to purchase a bridge I'm selling.
Useless is offline   Reply With Quote
Old 2006-04-12, 09:26 AM   #5
ladydesigner
~Serving Up Sinful Sex ~
 
ladydesigner's Avatar
 
Join Date: Apr 2003
Location: Missouri City, Texas
Posts: 1,928
Thanks!
__________________
Porn Pixie XxX ~ Looking for Link Trades
ladydesigner is offline   Reply With Quote
Old 2006-04-12, 11:10 AM   #6
MrYum
Arghhhh...submit yer sites ya ruddy swabs!
 
MrYum's Avatar
 
Join Date: May 2004
Location: Sunny Florida!
Posts: 5,108
Send a message via ICQ to MrYum
Yep, I validate every page and every css with the FF webdeveloper extension...very useful tool
MrYum is offline   Reply With Quote
Old 2006-04-12, 12:32 PM   #7
cd34
a.k.a. Sparky
 
cd34's Avatar
 
Join Date: Sep 2004
Location: West Palm Beach, FL, USA
Posts: 2,396
It is my belief that extremely broken html will cause extreme problems.

What the browser does, and how a google bot reads a page are totally different. Google uses Python, so, we'll assume that they use Python for their bot. Python has an sgml parser which takes your page, dissects it into a tree structure, then goes to work on that.

Things like

Code:
<a href="page.html>hi there</a>
Will sometimes properly be rendered in browsers. Hanging table cells, improperly nested cells, etc -- all go towards making an automated process have problems. It used to be that you didn't need to close the <td>, <tr>, <b>, etc, however, those all break automatic parsing.

While I don't follow every recommendation that the validators give -- I do make sure that the html isn't broken. There are pages I have that have incorrect html, bgimage, bgcolor, etc as attributes on html that isn't in the standard, but, I'll let that slide. That won't break a parser.

However, improperly nested content can sometimes cause problems.

Code:
<a href="page.html"><h1>hi there</a></h1>
An automated process will get confused with the above. Depending on how they are parsing, I would suspect you might lose the effect of the <h1>. Now, google probably goes to all lengths to make sure they can spider the web to the best of their ability, but, why gamble on that?
__________________
SnapReplay.com a different way to share photos - iPhone & Android
cd34 is offline   Reply With Quote
Old 2006-04-12, 01:07 PM   #8
MeatPounder
Women might be able to fake orgasms But men can fake whole relationships
 
MeatPounder's Avatar
 
Join Date: Oct 2003
Location: Fort Lauderdale, Fl
Posts: 2,408
Fot you IE users this is a great tool, I turn it off and on as I need it
Internet Explorer Developer Toolbar
-- Explore and modify the document object model (DOM) of a Web page.
-- Locate and select specific elements on a Web page through a variety of techniques.
-- Selectively disable Internet Explorer settings.
-- View HTML object class names, ID's, and details such as link paths, tab index values, and access keys.
-- Outline tables, table cells, images, or selected tags.
-- Validate HTML, CSS, WAI, and RSS Web feed links.
-- Display image dimensions, file sizes, path information, and alternate (ALT) text.
-- Immediately resize the browser window to a new resolution.
-- Selectively clear the browser cache and saved cookies. Choose from all objects or those associated with a given domain.
-- Choose direct links to W3C specification references, the Internet Explorer team weblog (blog), and other resources.
-- Display a fully featured design ruler to help accurately align and measure objects on your pages.
-- You can now selectively enable and disable CSS parsing.
-- The Misc menu contains a color picker.
-- Several link reports are available.
-- When you select an element in the DOM element tree list, the selected element scrolls into view if it is not already visible in the browser window.
MeatPounder is offline   Reply With Quote
Old 2006-04-12, 01:28 PM   #9
ladydesigner
~Serving Up Sinful Sex ~
 
ladydesigner's Avatar
 
Join Date: Apr 2003
Location: Missouri City, Texas
Posts: 1,928
Outstanding! Thanks MeatPounder.
__________________
Porn Pixie XxX ~ Looking for Link Trades
ladydesigner is offline   Reply With Quote
Old 2006-04-13, 05:55 AM   #10
Halfdeck
You can now put whatever you want in this space :)
 
Halfdeck's Avatar
 
Join Date: Oct 2004
Location: New Haven, CT
Posts: 985
Send a message via ICQ to Halfdeck
Quote:
How much, do you imagine, does the HTML validation of your pages effect SE bots and indexing. Or do you think that it plays no factor at all?
We can only guess. We don't know the extent to which the SE bots can correctly spider a page with broken HTML. The only thing I assume is the major SEs have no problems crawling/indexing a WC3 Valid HTML page -- but even that's a big assumption. I've written a few parsers using regexp and I'll say its not hard for a parser to choke on some random /unexpected stuff on a page. But the final answer to your question really is "we don't know."

If you're asking SHOULD I validate my HTML pages, my answer is yeah why chance it?
__________________
Success is going from failure to failure without a loss of enthusiasm.
Halfdeck is offline   Reply With Quote
Old 2006-04-13, 06:26 PM   #11
Linkster
NO! Im not a female - but being a dragon, I do eat them.
 
Linkster's Avatar
 
Join Date: Mar 2003
Location: Sex Delta
Posts: 5,084
Send a message via ICQ to Linkster
I do know that in the old days (LOL a few years ago) the original googlebot would definitely fall short if you had bad html like CD34 posted above - and validation was sometimes the difference between the ranking positions for some keywords.

Since that time, Google has two new bots that dont seem to be "as affected" but for the minimum time it takes to validate (something Im real bad about) it would seem like a silly thing not to do - let alone making sure that the html will render well for a firefox user vs an IE user
__________________
Pussy Chompers
Porn Links
NSCash
Linkster is offline   Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT -4. The time now is 06:52 AM.


Mark Read
Powered by vBulletin® Version 3.8.1
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.
© Greenguy Marketing Inc