Searching the archive

Post Reply
Rubicon String (imported)
Articles: 0
Posts: 24
Joined: Sun Dec 28, 2008 5:38 pm

Posting Rank

Searching the archive

Post by Rubicon String (imported) »

Hello,

is it anyhow possible to search for phrases in the archive?

Regards
kristoff
Articles: 0
Posts: 4756
Joined: Sat Sep 17, 2005 5:45 pm

Posting Rank

Re: Searching the archive

Post by kristoff »

Hello,
Rubicon String (imported) wrote: Sun Jul 15, 2018 4:31 pm is it anyhow possible to search for phrases in the archive?

Regards

What we have is the Advanced Search function, located on the menu bar. That is the only thing I am aware of
Paolo
Articles: 0
Posts: 9709
Joined: Wed May 16, 2001 8:53 am

Posting Rank

Re: Searching the archive

Post by Paolo »

Upper right hand corner, just under the LOG OUT button.

As this is a site requiring a log in, I doubt the Google instructions for searching a site would work.
TopManFL (imported)
Articles: 0
Posts: 924
Joined: Mon Oct 31, 2016 10:15 am

Posting Rank

Re: Searching the archive

Post by TopManFL (imported) »

I tried the Google Site search and it works fine on the forum.

Example:

site🌐//forums.eunuch.org androcur

Leave everything before the "androcur" and replace androcur with your search term. The sub domain "forums.eunuch.org" is archived by google.

It's true that google won't index sites behind a password. But, the sub domain forums.eunuch.org doesn't require a password to read it - just to comment.

I looked for any instructions that would tell search engines to not index this site and found none.

The main page of eunuch.org request that spiders (the robots that index websites for search engines) only come back every 15 days:

<meta name="revisit-after" content="15 days" />

But, the page makes no request that robots stay away or not follow links and index the site.

On the main page there is a robots.txt file (sort of a backup to the meta data) which does request that robots and spiders not follow to the directory called "forums":

User-agent: *

Disallow:

Disallow: /cgi-bin/

Disallow: /forums/

Disallow: /personals/

But, obviously Google doesn't read that as a prohibition for archiving the sub domain. I'm not sure if Google would see "eunuch.org/forums" differently than it sees "forums.eunuch.org".

So, I looked at the meta data on the sub domain of "forums.eunuch.org" and there is this tag:

<base href="http://forums.eunuch.org/" />

If that was supposed to tell Google to look back at the meta data and robots.txt on the "base", it didn't work (I'm not familiar with that meta data tag and really not sure why it's there. Maybe to help the database find the mysql file?)

Lastly, there is no robots.txt file in the sub domain of "forums.eunuch.org" and I think that's the reason Google archives the forum.

Since the sub domain is available with a direct url entry, Google will find it and look for a robots.txt and any meta data that asks it not to index. Google will honor the request if it finds it. Not all search engines honor the request.

I'm not sure it's ever been discussed if the community wants the site indexed by Google. Then again, the horse might be out of the barn by now and locking the door would be pointless.
Paolo
Articles: 0
Posts: 9709
Joined: Wed May 16, 2001 8:53 am

Posting Rank

Re: Searching the archive

Post by Paolo »

​I'm surprised, as Google doesn't like us anyway.
Post Reply

Return to “Archive Technical Help”