navigation
a webmaster learning community
     Home    Register     Search      Help      Login    
Sponsors

Shopping Cart Software
Ecommerce software integrated into Frontpage, Dreamweaver and Golive templates. No monthly fees and available in ASP and PHP versions.

Website Templates
We also have a wide selection of Dreamweaver, Expression Web and Frontpage templates as well as webmaster tools and CSS layouts.

Frontpage website templates
Creative Website Templates for FrontPage, Dreamweaver, Flash, SwishMax

Search Forums
 

Advanced search
Recent Posts

 Todays Posts
 Most Active posts
 Posts since last visit
 My Recent Posts
 Mark posts read

Microsoft MVP

 

Robot.txt file

 
View related threads: (in this forum | in all forums)

Logged in as: Guest
Users viewing this topic: none
Printable Version 

All Forums >> Web Development >> Search Engine Optimization and Web Business >> Robot.txt file
Page: [1]
 
oraclewiz

 

Posts: 39
Joined: 10/31/2003
Status: offline

 
Robot.txt file - 2/28/2006 15:35:56   
Alexa tells me that you should definitely have a robot.txt file in your root directory. They specify:
When a search engine crawler comes to your site, it will look for a special file on your site. That file is called robots.txt and it tells the search engine spider, which Web pages of your site should be indexed and which Web pages should be ignored. If its missing you may be not indexed at all.

The robots.txt file is a simple text file.(no HTML),
( User-agent: *
Disallow:)
that must be placed in your root directory, for example:

http://www.yourwebsite.com/robots.txt
Since I am newbe and file illierate, I'm not sure what this means. Is this my root directory file C:/MyWebs/oraclewiz
and does if have to be the first file in the hierarchy??
Using FP 2000 should I create another page and put this text in it and upload to GoDaddy my provider. And does if have to be the first file in the hierarchy??


Reflect

 

Posts: 4769
From: USA
Status: offline

 
RE: Robot.txt file - 2/28/2006 16:18:30   
Hi,

Do not use a wysiwyg editor to make this file, use notepad.

Once created it goes in the root of your web site (where your index page resides). FP publish is OK to push it out with. As for hierarchy, don't worry about it.

Take care,

Brian

_____________________________


(in reply to oraclewiz)
coreybryant

 

Posts: 2422
Joined: 3/17/2002
From: Castle Rock CO USA
Status: offline

 
RE: Robot.txt file - 2/28/2006 17:44:20   
Your root is basically where your index.html file is (your home page of your website). Good robots will follow them, bad robots will not.

It does not have to be the first file - chances are it will not be - but the name does have to be robots.txt

_____________________________

Corey R. Bryant
Merchant Accounts | Toll Free Numbers | My Blog | Expression Web Blog

(in reply to Reflect)
Mojo

 

Posts: 2431
From: Chicago
Status: offline

 
RE: Robot.txt file - 2/28/2006 18:53:56   
quote:

If its missing you may be not indexed at all


Not true. Missing this file has zero impact on wether or not your site will be indexed.

_____________________________

Split Testing
Chicago Order Fulfillment
Emergency Kits

(in reply to coreybryant)
Peppergal

 

Posts: 2204
Joined: 9/20/2002
Status: offline

 
RE: Robot.txt file - 3/23/2006 22:25:35   
Here's a dumb question.

If I have a few pages in the main directory that I don't want indexed, and I just want to use the robots.txt file instead of <meta> tags on each of those pages, do I just use /page.html ? or do I have to have the entire URL?

or should I use the meta tags?

_____________________________

Northeast PA / Poconos/ Lake Wallenpaupack Real Estate
wallenpaupacklakeproperty.com
Karen's Real Estate Blog

(in reply to Mojo)
womble

 

Posts: 5526
Joined: 3/14/2005
From: Living on the edge
Status: offline

 
RE: Robot.txt file - 3/24/2006 5:30:53   
For a single page I'd use meta tags and use the robots.txt for if you want to block whole directories.

_____________________________

~~ "A cruel god ain't no god at all" ~~
:)

(in reply to Peppergal)
Reflect

 

Posts: 4769
From: USA
Status: offline

 
RE: Robot.txt file - 3/24/2006 9:06:05   
Peppergal,

robots.txt is also used to disallow individual pages. I use it to block individual pages all the time. Your syntax is correct also. I also use on the page the no index no follow meta just to hedge my bets :). I then make sure to leave the page out of my google sitemap and my normal site map.

Take care,

Brian

_____________________________


(in reply to womble)
Peppergal

 

Posts: 2204
Joined: 9/20/2002
Status: offline

 
RE: Robot.txt file - 3/24/2006 11:13:01   
google site map vs. normal site map?:) I didn't know there was a difference! I've been away from frontpagewebmaster too long!!

I can't leave the page(s) out of the site map, as it would be something the clients may need (legal document, Consumer Notice, as well as a contact form.) They just don't need to be indexed in the search engines - as just about every other real estate website must have them, by PA law. I doubt that even the clients read them [consumer notice], but they must be there.

_____________________________

Northeast PA / Poconos/ Lake Wallenpaupack Real Estate
wallenpaupacklakeproperty.com
Karen's Real Estate Blog

(in reply to Reflect)
Reflect

 

Posts: 4769
From: USA
Status: offline

 
RE: Robot.txt file - 3/27/2006 10:56:31   
Google sitemap explained (It's sort of new so don't sweat not knowing about it)....

http://www.google.com/webmasters/sitemaps/login

Take care,

Brian

_____________________________


(in reply to Peppergal)
Page:   [1]

All Forums >> Web Development >> Search Engine Optimization and Web Business >> Robot.txt file
Page: [1]
Jump to: 1





New Messages No New Messages
Hot Topic w/ New Messages Hot Topic w/o New Messages
Locked w/ New Messages Locked w/o New Messages
 Post New Thread
 Reply to Message
 Post New Poll
 Submit Vote
 Delete My Own Post
 Delete My Own Thread
 Rate Posts