![]() |
|
Welcome to the Computer Webmaster Gaming Console Graphics Forum forums. You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today! If you have any problems with the registration process or your account login, please contact contact us. |
| |||||||
| HTML HTML problems, maybe you are trying to code for MySpace. Get all the HTML coding help here for your website. |
![]() |
| | LinkBack | Thread Tools | Display Modes |
| | #1 | ||
| Hi I have a folder on my site that I use to practice on, I don't want robots indexing this folder. I believe the meta tag is not as good as a robot.txt file. I would like to use a robot.txt file but... 1. What is the syntax of the line that I write to prevent access to a folder (the folder is called 'sefriendly' and it lives off the root folder which is called 'www' 2. In which folder is the robot.txt file stored? thanks David | |||
| Advertisements |
| | #2 | ||
| David Graham wrote: > Hi > I have a folder on my site that I use to practice on, I don't want > robots indexing this folder. I believe the meta tag is not as good as > a robot.txt file. I would like to use a robot.txt file but... > > 1. What is the syntax of the line that I write to prevent access to a > folder (the folder is called 'sefriendly' and it lives off the root > folder which is called 'www' User-agent: * Disallow: /sefriendly/ > 2. In which folder is the robot.txt file stored? in your root - in your case, www - folder There's lots of info at: http://www.searchengineworld.com/cgi-bin/robotcheck.cgi And a script that checks your robot.txt file -- PeterMcC If you feel that any of the above is incorrect, inappropriate or offensive in any way, please ignore it and accept my apologies. | |||
| | #3 | ||
| "PeterMcC" <peter@mccourt.org.uk> wrote in message news:uweLa.44927$xd5.2251967@stones.force9.net... > David Graham wrote: > > Hi > > I have a folder on my site that I use to practice on, I don't want > > robots indexing this folder. I believe the meta tag is not as good as > > a robot.txt file. I would like to use a robot.txt file but... > > > > 1. What is the syntax of the line that I write to prevent access to a > > folder (the folder is called 'sefriendly' and it lives off the root > > folder which is called 'www' > > User-agent: * > Disallow: /sefriendly/ > > > 2. In which folder is the robot.txt file stored? > in your root - in your case, www - folder > > There's lots of info at: > http://www.searchengineworld.com/cgi-bin/robotcheck.cgi > And a script that checks your robot.txt file Thanks for the link David | |||
| | #4 | ||
| "PeterMcC" <peter@mccourt.org.uk> wrote in message news:uweLa.44927$xd5.2251967@stones.force9.net... > David Graham wrote: > > Hi > > I have a folder on my site that I use to practice on, I don't want > > robots indexing this folder. I believe the meta tag is not as good as > > a robot.txt file. I would like to use a robot.txt file but... > > > > 1. What is the syntax of the line that I write to prevent access to a > > folder (the folder is called 'sefriendly' and it lives off the root > > folder which is called 'www' > > User-agent: * > Disallow: /sefriendly/ > I put the robot.txt file into the www folder containing the two lines above (exactly as you indicate i.e. on two lines) but I can still visit the site using IE6. I thought those two lines ban access from all UA's. I have cleared out my browsers cache in case that was what I was viewing, but that made no difference. I will read up on this subject, but could you point out were my thinking is a bit off here. Does the robot.txt file just ban spiders and not browsers? TIA David | |||
| | #5 | ||
| David Graham wrote: > "PeterMcC" <peter@mccourt.org.uk> wrote in message > news:uweLa.44927$xd5.2251967@stones.force9.net... >> David Graham wrote: >>> Hi >>> I have a folder on my site that I use to practice on, I don't want >>> robots indexing this folder. I believe the meta tag is not as good >>> as a robot.txt file. I would like to use a robot.txt file but... >>> >>> 1. What is the syntax of the line that I write to prevent access to >>> a folder (the folder is called 'sefriendly' and it lives off the >>> root folder which is called 'www' >> >> User-agent: * >> Disallow: /sefriendly/ >> > > I put the robot.txt file into the www folder containing the two lines > above (exactly as you indicate i.e. on two lines) but I can still > visit the site using IE6. I thought those two lines ban access from > all UA's. I have cleared out my browsers cache in case that was what > I was viewing, but that made no difference. I will read up on this > subject, but could you point out were my thinking is a bit off here. > Does the robot.txt file just ban spiders and not browsers? Just spiders. -- PeterMcC If you feel that any of the above is incorrect, inappropriate or offensive in any way, please ignore it and accept my apologies. | |||
| | #6 | ||
| PeterMcC wrote: > David Graham wrote: <snip> >> I put the robot.txt file into the www folder containing the two lines >> above (exactly as you indicate i.e. on two lines) but I can still >> visit the site using IE6. I thought those two lines ban access from >> all UA's. I have cleared out my browsers cache in case that was what >> I was viewing, but that made no difference. I will read up on this >> subject, but could you point out were my thinking is a bit off here. >> Does the robot.txt file just ban spiders and not browsers? > > Just spiders. BTW - if you don't have a link to a page, it won't get spidered because the spider only follows links. If you want to have links to the page but don't want it spidering or seeing by others, use .htaccess to password protect the directory that holds the page. HTH -- PeterMcC If you feel that any of the above is incorrect, inappropriate or offensive in any way, please ignore it and accept my apologies. | |||
| | #7 | ||
| In article <U%gLa.1981$sh4.272115@newsfep2-win.server.ntli.net>, david.graham16@ntlworld.com says... > > "PeterMcC" <peter@mccourt.org.uk> wrote in message > news:uweLa.44927$xd5.2251967@stones.force9.net... > > David Graham wrote: > > > I have a folder on my site that I use to practice on, I don't want > > > robots indexing this folder. I believe the meta tag is not as good as > > > a robot.txt file. I would like to use a robot.txt file but... .... > > User-agent: * > > Disallow: /sefriendly/ > > .... > Does the robot.txt file just ban spiders > and not browsers? > Correct. | |||
| | #8 | ||
| "PeterMcC" <peter@mccourt.org.uk> wrote in message news BhLa.44961$xd5.2257654@stones.force9.net...> PeterMcC wrote: > > David Graham wrote: > <snip> > >> I put the robot.txt file into the www folder containing the two lines > >> above (exactly as you indicate i.e. on two lines) but I can still > >> visit the site using IE6. I thought those two lines ban access from > >> all UA's. I have cleared out my browsers cache in case that was what > >> I was viewing, but that made no difference. I will read up on this > >> subject, but could you point out were my thinking is a bit off here. > >> Does the robot.txt file just ban spiders and not browsers? > > > > Just spiders. > > BTW - if you don't have a link to a page, it won't get spidered because the > spider only follows links. > > If you want to have links to the page but don't want it spidering or seeing > by others, use .htaccess to password protect the directory that holds the > page. > > HTH > -- > PeterMcC > If you feel that any of the above is incorrect, > inappropriate or offensive in any way, > please ignore it and accept my apologies. Thanks for the help. I have one more question. Google indexed one of my practice sites, before I had a chance to use a robot.txt file. Do you know how long it will be before Google deletes the cached version of this site which I never intended to be indexed. The reason I ask is because the unwanted site is competing in the search results with the site which I want to be indexed (the unwanted site is doing better than the wanted site - I have not yet got round to making my main site more optimised for search engines) TIA David | |||
| | #9 | ||
| "David Graham" <david.graham16@ntlworld.com> wrote in message news:n6eLa.339$n51.25637@newsfep2-gui.server.ntli.net... > Hi > I have a folder on my site that I use to practice on, I don't want robots > indexing this folder. I believe the meta tag is not as good as a robot.txt > file. I would like to use a robot.txt file but... > > 1. What is the syntax of the line that I write to prevent access to a folder > (the folder is called 'sefriendly' and it lives off the root folder which is > called 'www' > > 2. In which folder is the robot.txt file stored? > > thanks > > David > the file should be called robots.txt rather than robot.txt else it won't keep any spiders out ~ Denise | |||
| | #10 | ||
| "Denise Enck" <encksofeverett@earthlink.net> wrote in message news:tQiLa.69023$Io.6473682@newsread2.prod.itd.ear thlink.net... > "David Graham" <david.graham16@ntlworld.com> wrote in message > news:n6eLa.339$n51.25637@newsfep2-gui.server.ntli.net... > > Hi > > I have a folder on my site that I use to practice on, I don't want robots > > indexing this folder. I believe the meta tag is not as good as a robot.txt > > file. I would like to use a robot.txt file but... > > > > 1. What is the syntax of the line that I write to prevent access to a > folder > > (the folder is called 'sefriendly' and it lives off the root folder which > is > > called 'www' > > > > 2. In which folder is the robot.txt file stored? > > > > thanks > > > > David > > > > > the file should be called robots.txt rather than robot.txt else it won't > keep any spiders out ~ > > Denise > Thanks loads - didn't know it had to have the the 's' on the name David | |||
| Featured Websites | ||||
|
![]() |
| Tags: robottxt |
| Currently Active Users Viewing This Thread: 1 (0 members and 1 guests) | |
| Thread Tools | |
| Display Modes | |
| |
Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Robot computers these are crazy | MadKad | Computer Systems | 6 | 04-15-2008 3:29 PM |
| Climbatron Window Climbing Robot - New in Box | Tom Martinello | Ebay Technical Questions | 0 | 05-31-2007 1:13 AM |
| Featured Websites | ||||
|