Computer Webmaster Gaming Console Graphics Forum

Welcome to the Computer Webmaster Gaming Console Graphics Forum forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact contact us.

MK PitStop Main Earn $25 Earn Money Posting Extras Members Blogs Image Hosting User Pages
Go Back   Computer Webmaster Gaming Console Graphics Forum > Webmaster Forum > Website Coding > HTML
Register FAQ/Rules Become A V.I.P. Member Search Today's Posts Mark Forums Read

HTML HTML problems, maybe you are trying to code for MySpace. Get all the HTML coding help here for your website.

Google
Closed Thread
 
LinkBack Thread Tools Display Modes
Old 07-01-2007, 1:46 PM   #1
David Graham
 
David Graham's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt

Hi
I have a folder on my site that I use to practice on, I don't want robots
indexing this folder. I believe the meta tag is not as good as a robot.txt
file. I would like to use a robot.txt file but...

1. What is the syntax of the line that I write to prevent access to a folder
(the folder is called 'sefriendly' and it lives off the root folder which is
called 'www'

2. In which folder is the robot.txt file stored?

thanks

David


 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Advertisements
Old 07-01-2007, 1:46 PM   #2
PeterMcC
 
PeterMcC's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt

David Graham wrote:
> Hi
> I have a folder on my site that I use to practice on, I don't want
> robots indexing this folder. I believe the meta tag is not as good as
> a robot.txt file. I would like to use a robot.txt file but...
>
> 1. What is the syntax of the line that I write to prevent access to a
> folder (the folder is called 'sefriendly' and it lives off the root
> folder which is called 'www'


User-agent: *
Disallow: /sefriendly/

> 2. In which folder is the robot.txt file stored?

in your root - in your case, www - folder

There's lots of info at:
http://www.searchengineworld.com/cgi-bin/robotcheck.cgi
And a script that checks your robot.txt file

--
PeterMcC
If you feel that any of the above is incorrect,
inappropriate or offensive in any way,
please ignore it and accept my apologies.
 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #3
David Graham
 
David Graham's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt


"PeterMcC" <peter@mccourt.org.uk> wrote in message
news:uweLa.44927$xd5.2251967@stones.force9.net...
> David Graham wrote:
> > Hi
> > I have a folder on my site that I use to practice on, I don't want
> > robots indexing this folder. I believe the meta tag is not as good as
> > a robot.txt file. I would like to use a robot.txt file but...
> >
> > 1. What is the syntax of the line that I write to prevent access to a
> > folder (the folder is called 'sefriendly' and it lives off the root
> > folder which is called 'www'

>
> User-agent: *
> Disallow: /sefriendly/
>
> > 2. In which folder is the robot.txt file stored?

> in your root - in your case, www - folder
>
> There's lots of info at:
> http://www.searchengineworld.com/cgi-bin/robotcheck.cgi
> And a script that checks your robot.txt file


Thanks for the link

David


 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #4
David Graham
 
David Graham's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt


"PeterMcC" <peter@mccourt.org.uk> wrote in message
news:uweLa.44927$xd5.2251967@stones.force9.net...
> David Graham wrote:
> > Hi
> > I have a folder on my site that I use to practice on, I don't want
> > robots indexing this folder. I believe the meta tag is not as good as
> > a robot.txt file. I would like to use a robot.txt file but...
> >
> > 1. What is the syntax of the line that I write to prevent access to a
> > folder (the folder is called 'sefriendly' and it lives off the root
> > folder which is called 'www'

>
> User-agent: *
> Disallow: /sefriendly/
>


I put the robot.txt file into the www folder containing the two lines above
(exactly as you indicate i.e. on two lines) but I can still visit the site
using IE6. I thought those two lines ban access from all UA's. I have
cleared out my browsers cache in case that was what I was viewing, but that
made no difference. I will read up on this subject, but could you point out
were my thinking is a bit off here. Does the robot.txt file just ban spiders
and not browsers?

TIA
David


 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #5
PeterMcC
 
PeterMcC's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt

David Graham wrote:
> "PeterMcC" <peter@mccourt.org.uk> wrote in message
> news:uweLa.44927$xd5.2251967@stones.force9.net...
>> David Graham wrote:
>>> Hi
>>> I have a folder on my site that I use to practice on, I don't want
>>> robots indexing this folder. I believe the meta tag is not as good
>>> as a robot.txt file. I would like to use a robot.txt file but...
>>>
>>> 1. What is the syntax of the line that I write to prevent access to
>>> a folder (the folder is called 'sefriendly' and it lives off the
>>> root folder which is called 'www'

>>
>> User-agent: *
>> Disallow: /sefriendly/
>>

>
> I put the robot.txt file into the www folder containing the two lines
> above (exactly as you indicate i.e. on two lines) but I can still
> visit the site using IE6. I thought those two lines ban access from
> all UA's. I have cleared out my browsers cache in case that was what
> I was viewing, but that made no difference. I will read up on this
> subject, but could you point out were my thinking is a bit off here.
> Does the robot.txt file just ban spiders and not browsers?


Just spiders.

--
PeterMcC
If you feel that any of the above is incorrect,
inappropriate or offensive in any way,
please ignore it and accept my apologies.
 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #6
PeterMcC
 
PeterMcC's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt

PeterMcC wrote:
> David Graham wrote:

<snip>
>> I put the robot.txt file into the www folder containing the two lines
>> above (exactly as you indicate i.e. on two lines) but I can still
>> visit the site using IE6. I thought those two lines ban access from
>> all UA's. I have cleared out my browsers cache in case that was what
>> I was viewing, but that made no difference. I will read up on this
>> subject, but could you point out were my thinking is a bit off here.
>> Does the robot.txt file just ban spiders and not browsers?

>
> Just spiders.


BTW - if you don't have a link to a page, it won't get spidered because the
spider only follows links.

If you want to have links to the page but don't want it spidering or seeing
by others, use .htaccess to password protect the directory that holds the
page.

HTH
--
PeterMcC
If you feel that any of the above is incorrect,
inappropriate or offensive in any way,
please ignore it and accept my apologies.

 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #7
Jacqui or (maybe) Pete
 
Jacqui or (maybe) Pete's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt

In article <U%gLa.1981$sh4.272115@newsfep2-win.server.ntli.net>,
david.graham16@ntlworld.com says...
>
> "PeterMcC" <peter@mccourt.org.uk> wrote in message
> news:uweLa.44927$xd5.2251967@stones.force9.net...
> > David Graham wrote:


> > > I have a folder on my site that I use to practice on, I don't want
> > > robots indexing this folder. I believe the meta tag is not as good as
> > > a robot.txt file. I would like to use a robot.txt file but...

....
> > User-agent: *
> > Disallow: /sefriendly/
> >

....
> Does the robot.txt file just ban spiders
> and not browsers?
>

Correct.
 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #8
David Graham
 
David Graham's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt


"PeterMcC" <peter@mccourt.org.uk> wrote in message
newsBhLa.44961$xd5.2257654@stones.force9.net...
> PeterMcC wrote:
> > David Graham wrote:

> <snip>
> >> I put the robot.txt file into the www folder containing the two lines
> >> above (exactly as you indicate i.e. on two lines) but I can still
> >> visit the site using IE6. I thought those two lines ban access from
> >> all UA's. I have cleared out my browsers cache in case that was what
> >> I was viewing, but that made no difference. I will read up on this
> >> subject, but could you point out were my thinking is a bit off here.
> >> Does the robot.txt file just ban spiders and not browsers?

> >
> > Just spiders.

>
> BTW - if you don't have a link to a page, it won't get spidered because

the
> spider only follows links.
>
> If you want to have links to the page but don't want it spidering or

seeing
> by others, use .htaccess to password protect the directory that holds the
> page.
>
> HTH
> --
> PeterMcC
> If you feel that any of the above is incorrect,
> inappropriate or offensive in any way,
> please ignore it and accept my apologies.


Thanks for the help. I have one more question. Google indexed one of my
practice sites, before I had a chance to use a robot.txt file. Do you know
how long it will be before Google deletes the cached version of this site
which I never intended to be indexed. The reason I ask is because the
unwanted site is competing in the search results with the site which I want
to be indexed (the unwanted site is doing better than the wanted site - I
have not yet got round to making my main site more optimised for search
engines)

TIA
David


 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #9
Denise Enck
 
Denise Enck's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt

"David Graham" <david.graham16@ntlworld.com> wrote in message
news:n6eLa.339$n51.25637@newsfep2-gui.server.ntli.net...
> Hi
> I have a folder on my site that I use to practice on, I don't want robots
> indexing this folder. I believe the meta tag is not as good as a robot.txt
> file. I would like to use a robot.txt file but...
>
> 1. What is the syntax of the line that I write to prevent access to a

folder
> (the folder is called 'sefriendly' and it lives off the root folder which

is
> called 'www'
>
> 2. In which folder is the robot.txt file stored?
>
> thanks
>
> David
>



the file should be called robots.txt rather than robot.txt else it won't
keep any spiders out ~

Denise


 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Old 07-01-2007, 1:47 PM   #10
David Graham
 
David Graham's Avatar
 
Posts: n/a
My Photos: (0)

Banked:
MK Cash: $

I am Worth:
MK Cash: $
Donate

Recent Blog: None

Default robot.txt


"Denise Enck" <encksofeverett@earthlink.net> wrote in message
news:tQiLa.69023$Io.6473682@newsread2.prod.itd.ear thlink.net...
> "David Graham" <david.graham16@ntlworld.com> wrote in message
> news:n6eLa.339$n51.25637@newsfep2-gui.server.ntli.net...
> > Hi
> > I have a folder on my site that I use to practice on, I don't want

robots
> > indexing this folder. I believe the meta tag is not as good as a

robot.txt
> > file. I would like to use a robot.txt file but...
> >
> > 1. What is the syntax of the line that I write to prevent access to a

> folder
> > (the folder is called 'sefriendly' and it lives off the root folder

which
> is
> > called 'www'
> >
> > 2. In which folder is the robot.txt file stored?
> >
> > thanks
> >
> > David
> >

>
>
> the file should be called robots.txt rather than robot.txt else it won't
> keep any spiders out ~
>
> Denise
>

Thanks loads - didn't know it had to have the the 's' on the name

David


 
Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Featured Websites
Free Space
Free Space
Free Space Free Space
Closed Thread
Tags:




Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
Robot computers these are crazy MadKad Computer Systems 6 04-15-2008 3:29 PM
Climbatron Window Climbing Robot - New in Box Tom Martinello Ebay Technical Questions 0 05-31-2007 1:13 AM


Featured Websites




All times are GMT +1. The time now is 6:13 PM.


Powered by: vBulletin Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
LinkBacks Enabled by vBSEO 3.0.0
Cheap Computers
MK PitStop Copyright 2005 - 2008

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98