|
2005-10-06, 09:22 AM | #1 |
Shut up brain, or I'll stab you with a Q-tip!
|
Robots.txt & freesites ?
What's your opinion on this one?
I was checking broken links today and found out that site below have robots.txt file that disallow SE spiders to see these pages. http://www.titfuckingxxx.com/free/hu...y-fucking/xxx/ Personally, I would like to see freesites submitted to my sites listed in SEs as well, so I deleted this one. |
2005-10-06, 09:28 AM | #2 |
NO! Im not a female - but being a dragon, I do eat them.
|
I guess you just have to decide if you are running a Linklist or you are trying to use people to get rankings in search engines - since they are two different things
|
2005-10-06, 09:29 AM | #3 |
Took the hint.
|
Some people do that to avoid duplicate page problems. People who build doorways are getting smart to the fact that google doesn't like them (making me feel better about asking to always be on an index.html page and not on index9898978.html). Obviously a robots.txt that blocks SE's from spidering the page limits your likely long term traffic, and that does change the balance between incoming and outgoing traffic potentials.
I can understand why they might do it, and I can understand you removing them from your list. Alex |
2005-10-06, 01:36 PM | #4 |
Banned
Join Date: Oct 2003
Location: About to be evicted!!!!
Posts: 4,082
|
Wolfie, what RawAlex says about why robots.txt excludes these gateways is true. However what he does not point out is that it is in your advantage if they do this, because if the site gets flagged as SE spam, and a link to you is on one of the pages as spam you are likely to loose PR, and if you are found on a lot of pages the SE's think is spam, you may even get blacklisted by the engines.
But WRT: "...so I deleted this one" - Did you email the submitter first and ask him why he did this? Because if you did and he did not reply, then fine, but if you didn't then you are a cheater, stealing bandwidth from your submitters, plain and simple. Also what the hell were you doing poking around in someone's server looking at non public files (like robots.txt)? That is hacking friend, and about as acceptable as fucking your kid sister. (In the UK it is an inprisonable offence, and I think the same applies to the US.) |
2005-10-06, 05:24 PM | #5 |
Well you know boys, a nuclear reactor is a lot like women. You just have to read the manual and press the right button
Join Date: Nov 2003
Posts: 157
|
Hi wolfie, nice initials. As has already been pointed out in this thread, I disallowed those mirror pages to prevent Google, and other search engines, from indexing them.
Generally, I don't list mirror pages on my own link list and I wanted to see who else sent me any traffic, so that I may give them recips on my future free sites. You weren't one on the list. I'll make sure not to submit my sites to porn-hawk in the future. JK
__________________
To alcohol! The cause of, and solution to, all of life’s problems |
2005-10-06, 05:58 PM | #6 |
Trying is the first step towards failure
|
I have a question and Please pardon my ignorence...
why not just a noindex in the META tag on mirror pages? Ben |
2005-10-06, 06:37 PM | #7 |
Well you know boys, a nuclear reactor is a lot like women. You just have to read the manual and press the right button
Join Date: Nov 2003
Posts: 157
|
Why, when I can disallow the entire folder with one file?
I don't really see the difference, or the relevance.
__________________
To alcohol! The cause of, and solution to, all of life’s problems |
2005-10-06, 07:26 PM | #8 |
Heh Heh Heh! Lisa! Vampires are make believe, just like elves and gremlins and eskimos!
Join Date: Mar 2005
Posts: 70
|
Google won't even list duplicated sites if they are on the same domain on the same host so you're not loosing traffic |viking|
__________________
Magnoody |
2005-10-07, 01:42 AM | #9 |
You can now put whatever you want in this space :)
|
I personally never thought of it before, but the robot.txt thing is a nice idea. Like Magnoody said, with duplicate content pages, you won't be getting SE traffic or PR even if there was no robots.txt there. If more LLs allowed people to link multiple index.htmls to one main.html, I wonder if this would ever be a problem.
__________________
Success is going from failure to failure without a loss of enthusiasm. |
2005-10-07, 01:46 AM | #10 | |
I'm the only guy in the world who has to wake up to have a nightmare
Join Date: Feb 2004
Location: London, United Kingdom
Posts: 1,895
|
Quote:
|
|
2006-02-19, 03:03 PM | #11 |
Shut up brain, or I'll stab you with a Q-tip!
|
Thanks to board search feature I found that people was replied to this.
Yeah, I see things different way now. Thanks for replying and been open-minded. Link popularity was a thing I was thinking back then, but like Linkster said, it is not all. Content will keep surfers coming again and attract new surfers so I will list also mirror pages as always has done. If I remember correct, I was looking that domain more closely because bot couldn't connect to it. To be honest, I had no idea that looking a robots.txt file is a crime, and I'm not that sure about it now either. If SE bots can read it, why not an human eye is not allowed from a site where link was submitted... Anyway, this was only time I did it. Haven't got any reasons to do it again. ecchi, Thanks for a input also, but people decline freesites from a very different reasons. Including if they simply don't like the site. So would that make those people a cheater..? And I'm not only one who don't always send decline messages or questions. So answer to your question, I didn't sent email about this. JUST A NOTE! Porn-Hawk.com reviewers or I don't "fuck with your servers", don't cheat, don't steal nothing. Every company I have done business with have said only good things and being happy. I respect my business partners and propably that's why they like also me. |
2006-02-19, 03:24 PM | #12 | |
I'm normally not a praying man, but if you're up there, please save me Superman!
|
Quote:
__________________
The tendency is to push it as far as you can -- Fear and Loathing In Las Vegas |
|
2006-02-19, 03:58 PM | #13 |
Vagabond
|
We need to get a new little smiley
A Roman Maroni on a big horse with a sword in his hand looking all big and mighty. |
2006-02-19, 04:07 PM | #14 |
Shut up brain, or I'll stab you with a Q-tip!
|
I feel more like..
lol |
2006-02-19, 10:51 PM | #15 |
Banned
Join Date: Nov 2005
Location: ARIZONA - INDIANA
Posts: 101
|
Some FYI - (from hard learned experience)
Any file on any server is Public Domain (US) That doesn't mean you or your server needs to actually exist physicaly in the US. It means if your files can be accessed from the US your files are 'free game' to US users. I have no idea about Canada, UK, Aus, etc, laws. It's sorta like a newspaper article. If it's published in the public domain then it is truely public. Including any supporting documentation (files). That could mean .js or .css or .txt or whatever. |
2006-02-19, 11:57 PM | #16 | ||
Banned
Join Date: Oct 2003
Location: About to be evicted!!!!
Posts: 4,082
|
Quote:
So either: It says in your rules that robots.txt files must not be used to stop indexing. Or you did list this site and were only spoofing us when you said "so I deleted this one". Or yes, you are a cheater. Quote:
|
||
2006-02-20, 12:07 AM | #17 | |
Banned
Join Date: Oct 2003
Location: About to be evicted!!!!
Posts: 4,082
|
Quote:
2. Even if it were not, saying "That doesn't mean you or your server needs to actually exist physically in the US" is also bollocks. If you do something that is illegal in the country I or my server exists in, I can sue you in that country. If I can convince the police to take an interest in that country, then they can apply for your extradition to stand trial in the country that I or my server exists in. In cases like this the US usually agree to extradition. 3. If you have an opinion on a legal matter like this, please always check your facts before making a post that someone might believe, and end up getting arrested for following. It is alright posting bollocks if it does not matter, but it is not alright if you are going to get someone even greener than you into trouble. |
|
2006-02-20, 12:14 AM | #18 |
Took the hint.
|
Ecchi, sorry, but robots.txt is an open and public file. No hack or trespass occurs in accessing the file.
There is a difference however between "public domain" and "free to be viewed by the public". People often confuse the terms. Any file not requiring a password on your server that can be accessed in a normal manner is open for publiv view. However, it isn't public domain, which would suggest that others could use it for commercial purposes without permission, resell it, etc. That isn't the case here. Alex |
2006-02-20, 04:35 AM | #19 | ||
Shut up brain, or I'll stab you with a Q-tip!
|
I see your point, but LL owners can't list all possible rules. Using common sence is mostly enaugh to get listed. And also reviewers do errors.
Besides I didn't think the way robots.txt can be used, before I read this thread. Now I see a clear reason why it was used. We don't all get all from moms milk! Quote:
Quote:
|
||
2006-02-20, 08:06 AM | #20 | |
You can now put whatever you want in this space :)
|
Quote:
After considering the robots.tx idea a ways back I kind of thought that people who weren't really up on its reason could get freaked out so I passed. BUT, I have begun using your second idea and haven't had any bad feedback. Everybody wants an index page recip so just give it to them. Obviously everybody can't be with penisbot or jays so I've arranged the folders to refer back to a "pages" folder all within the same particular free site sub folder. I'm very careful to change keywords, titles, content tages, and the warning page text on each index page so no duplicates exist. This way I can watch traffic and see which recips really deliver the best and it gives me the chance to keep people together who like to be together. Maybe I'll get more listings and I understand that getting listings and PR in Google is important for every linklist too, not just direct traffic so I've done this so that I and they can benefit too. IMHO as a new linklist owner and a long time self employed person I think it is just downright impolite (I can't say unprofessional because it is too common) not to notify a submitter of your successful relationship. I think unless you are one of the top 20 LL's it also helps to build goodwill with the submitters. Re: Hacking. I'm not a lawyer of course but I'm pretty sure to qualify for hacking the owner has to make an effort to put some barrier or security on the file or at minium perhaps provide a notice and the user has to make an effort to circumvent this barrier (the barrier indicates intention of privacy). Something that is freely accessiable by typing in a url like www.domain/robots.txt probably wouldn't rise to that level. I may be wrong though.
__________________
Submit Free Sites, Blogs, Movies, TGP's, Triple XXX Info |
|
2006-02-20, 08:30 AM | #21 | |
The Original Greenguy (Est'd 1996) & AVN HOF Member - I Crop Pics For Thumbs In My Sleep
|
Quote:
|
|
2006-02-20, 08:40 AM | #22 | |
That which does not kill us, will try, try again.
|
Coming Out
Quote:
My name is Simon ... and evidently I am a bad man. No, I never had a kind sister, or an older sister for that matter, so it's hard to be sure what kind of a relationship we might have had. But .. whenever I see something interesting on a website, I almost always hit the command keys to pop the source code to see how it was done. Often, if it was done by Javascript or CSS, I'll look for the link to the external JS or CSS file, and open that in my browser too. I've looked at all kinds of readily available supporting files and page source codes this way. I've learned a lot by reading these Javascript, CSS, robots.txt, and other files. Mostly I've used the information to code better pages of my own. So, you may as well mark me down on the list of people who "peep" behind the scenes of sites. Yes, I do it, I've done it for a long time, and I'll mostly likely keep on doing it. Simon -- a lawyer's job is to protect his client from other lawyers
__________________
"If you're happy and you know it, think again." -- Guru Pitka |
|
2006-02-20, 01:02 PM | #23 | ||
Banned
Join Date: Oct 2003
Location: About to be evicted!!!!
Posts: 4,082
|
Quote:
Quote:
If someone goes out and leaves their front door open, that does not mean you are allowed to wander in. That is still trespassing. It is the same with computers, just because "the door is open" you cannot just wander in and poke around. Personally I have no problems with anyone looking into files on an open server if they want to take the risk. But I do have a problem with people posting on any open board that this is OK to do this. Because some green newbie will read the post, believe it, do that, and end up in jail. It is simply not fair to post misinformation if that misinformation is likely to get someone else in trouble. |
||
2006-02-20, 01:16 PM | #24 | |
The Original Greenguy (Est'd 1996) & AVN HOF Member - I Crop Pics For Thumbs In My Sleep
|
Quote:
Maybe you should tell Google: http://www.google.com/search?sourcei...q=robots%2Etxt |
|
2006-02-20, 01:49 PM | #25 | |
Banned
Join Date: Oct 2003
Location: About to be evicted!!!!
Posts: 4,082
|
Quote:
A few years ago Google were successfully sued for giving links to non passworded but non public HTML page. If someone as big as Google with the money for real legal muscle could not win, what hope for the rest of us? |
|
|
|