SPAM and archives (Was: Re: [plug] PHP question..)
Sol
sol at autonomon.net
Thu Jul 3 12:07:35 WST 2003
This might sound like a stupid question, but I'd love to know the answer. As I
understand it, the spiders that spammers use to harvest email addresses from
web pages use either hrefs from other sites or start at index pages and use
hrefs from there. If I have a page full of email addresses at
http://www.somedomain.com/somedir/somedir/emailaddresses.html
and there are no hrefs pointing to this page then no spider is going to find
it. Is this correct?
If it is it might be a simple solution for the plug archives and illegal hrefs
which spam spiders don't read ie:
wwwDOTcantechDOTnetDOTau/somedir/archivesDOThtml
Another possibility might be to use some configuration in the archive software
(or another archiving application) that interpolates email addresses. I've
seen this quite a bit on FLOSS web site mailing lists. ie:
sol AT autonomon DOT net
These are just some thoughts. I'm not that familiar with the technical
aspects, nor am I the poor so'n'so that would have to implement it. I'm not
complaining myself as it doesn't seem to be effecting me. Just thought I'd
offer some ideas.
sol
On Thursday 03 July 2003 11:41, Onno Benschop wrote:
> On Thu, 2003-07-03 at 11:34, Paul Wilson wrote:
> > Onno Benschop wrote:
> > > The solution is to stop SPAM in the first place.
> >
> > How do you propose to do this?
>
> I am not proposing to stop this, I am proposing to start making in
> harder by not archiving the email addresses on the PLUG archive.
>
> Is this a drop in the ocean?
>
> Sure, but right now I'm getting mostly wet from the PLUG archive.
>
> Onno Benschop
>
> Connected via Optus B3 from S33:37'33" - E115:07'30" (Dunsborough, WA)
--
==============================
Sol Hanna
sol at autonomon.net
More information about the plug
mailing list