Windows Vista Tips

Windows Vista Tips > Newsgroups > Windows Vista General Discussion > How to index HTML files locally even with ROBOTS noindex?

Reply
Thread Tools Display Modes

How to index HTML files locally even with ROBOTS noindex?

 
 
YMA
Guest
Posts: n/a

 
      09-14-2007
I have a local mirror copy of the Web sites I manage. Some of the HTML pages
I don't want to be indexed by Web spiders / robots, so I put the ROBOTS
meta-tag with "noindex" in them.

However, I would like those files to be indexed locally, so that I can find
things in them with the local Windows indexed search function. But the
Windows HTML filter intentionally does NOT index files with ROBOTS noindex,
so I don't get those files in my local searches.

Is there a way to tell the HTML filter to go ahead and index HTML files even
if they have the ROBOTS noindex meta-tag? I want my local and remote copies
to be indentical, so I don't want to have ROBOTS index locally and ROBOTS
noindex remotely.

Anybody else run into that problem? Anyone has a solution?

Thanks!

YMA
 
Reply With Quote
 
 
 
 
Synapse Syndrome
Guest
Posts: n/a

 
      09-14-2007
"YMA" <> wrote in message
news:542B26AA-FF01-4F27-854D-...
>I have a local mirror copy of the Web sites I manage. Some of the HTML
>pages
> I don't want to be indexed by Web spiders / robots, so I put the ROBOTS
> meta-tag with "noindex" in them.
>
> However, I would like those files to be indexed locally, so that I can
> find
> things in them with the local Windows indexed search function. But the
> Windows HTML filter intentionally does NOT index files with ROBOTS
> noindex,
> so I don't get those files in my local searches.
>
> Is there a way to tell the HTML filter to go ahead and index HTML files
> even
> if they have the ROBOTS noindex meta-tag? I want my local and remote
> copies
> to be indentical, so I don't want to have ROBOTS index locally and ROBOTS
> noindex remotely.
>
> Anybody else run into that problem? Anyone has a solution?



I just put a robots.txt file in the root folder of the website instead. I
do not know of the metatag, but maybe the text file is more flexible, as you
can define which folders the spiders can index or not.

Loads more info here:
http://www.google.co.uk/search?sourc...q=robots%2etxt

ss.


 
Reply With Quote
 
Synapse Syndrome
Guest
Posts: n/a

 
      09-14-2007
"Synapse Syndrome" <> wrote in message
news:...
>
> I just put a robots.txt file in the root folder of the website instead. I
> do not know of the metatag, but maybe the text file is more flexible, as
> you can define which folders the spiders can index or not.
>
> Loads more info here:
> http://www.google.co.uk/search?sourc...q=robots%2etxt
>



Also, it says that not all spiders listen to the metatag, according to this
page:
http://www.robotstxt.org/wc/exclusion.html#meta

ss.


 
Reply With Quote
 
YMA
Guest
Posts: n/a

 
      09-14-2007
Thanks for your answer, but I do not have access to the root folder of my
websites (with just one exception). So, I really need to be able to tweak the
local HTML filter on my machine...

BTW, I am aware of the limitations of the ROBOTS noindex meta-tag, but I can
live with them.

YMA

"Synapse Syndrome" wrote:

> "Synapse Syndrome" <> wrote in message
> news:...
> >
> > I just put a robots.txt file in the root folder of the website instead. I
> > do not know of the metatag, but maybe the text file is more flexible, as
> > you can define which folders the spiders can index or not.
> >
> > Loads more info here:
> > http://www.google.co.uk/search?sourc...q=robots%2etxt
> >

>
>
> Also, it says that not all spiders listen to the metatag, according to this
> page:
> http://www.robotstxt.org/wc/exclusion.html#meta
>
> ss.

 
Reply With Quote
 
 
 
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
index.html error Tresa Windows Vista General Discussion 2 12-26-2007 11:37 PM
Error Printing html files JamieE Windows Vista Printing / Faxing / Scanning 2 10-15-2007 04:14 PM
IE 7 in Vista won't load .swf files unless called from html? lforbes Windows Vista General Discussion 5 07-24-2007 06:56 AM
Desktop and HTML Files billbrandi Windows Vista General Discussion 2 06-13-2007 03:18 AM
Vista IE7 - trouble saving files locally when using IE7?! Matthew M \(UK\) Windows Vista General Discussion 6 03-15-2007 10:50 AM



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59