They've Stolen My @ (at)
Related Email Articles
Many people place their email addresses on their Web sites or the Web sites they build. Then, as soon as the site is uploaded to the Internet, their email boxes are flooded with spam. How can you stop this? Try creating a link to your email address.
A common way to make a link to your email address is using the <a> tag:
<a href="mailto:your email address">Send an email</a>
for example:
<a href="mailto:joe@hotmail.com">Send an email</a>
When pressing this link, the email client software is opened with your email address in the form field. This is correct if the surfer is using an email client software such as Outlook or Eudora.
When people use Web mail such as gmail or hotmail, clicking a link to an email is useless. For those users, you must provide a different kind of a link to your email:
<a href="mailto:your email address">your email address</a>
for example:
<a href="mailto:joe@hotmail.com">joe@hotmail.com</a>
The Web mail users cannot USE the link, but they can see your email address, which they can copy and paste or write and type into their Web mail interface.
Specialized software called "spiders" scans the Web pages that are stored on a company's servers. The spiders are also known as "crawlers" or "knowledge-bots" or just "knowbots." Spiders surf the Internet. They are sent out by search engines such as Google, Yahoo, or MSN. Their job is to update the databases of the specific search engines that send them out.
Want a surprise? If you have Web pages that include your email address, then use your favorite search engine to do a search on your email address. The results may surprise you.
There Are Different Types of Spiders
As we've discussed, one type of spider is used to update search engine databases. Another type is used by people who are interested in collecting and using email addresses for their own purposes. These people use spiders to harvest email addresses from Web pages so they can send out advertisements or viruses or anything else that you don't want to the addresses they collect. These spider programs are called "spam bots," and the people who send them out are called "spammers." Spam bots scan Web pages exactly like the search engine spiders do, except that they look for the @ mark, or the source code "mailto:" that is used when constructing an email link.
What Does a Spider See on My Site?
Spiders don't see Web pages like you do. You see a picture of a page that is "rendered" or built by your browser. The behind-the-scenes building blocks used to create a Web page is called "source code" or "HTML code." The spider sees only the source code.
Here's an example. Let's assume that the following appears in the source code of a Web page:
<img src="imageFile.jpg" />
This is the source code to display an image file named "imageFile.jpg". Your browser reads the source code, accesses the image file, and then shows it on screen. The spider, on the other hand, only reads the source code. It doesn't try to access the actual image file. The spider sees only:
<img src="imageFile.jpg" />
In another example, the following appears in the source code:
<font color="green">Welcome</font>
When you use your browser, you see the text Welcome written in green letters: Welcome
The spider, on the other hand, doesn't see any colors. All it sees is the source code.
The source code for an email link on an HTML page looks like this:
<a href="mailto:name@domainName.com">Send an email</a>
or perhaps like this:
<a href="mailto:name@domainName.com">name@domainName.com</a>
Because HTML pages are plain text, it is easy for spiders to extract email addresses. A spider looks first for either the @ sign or the phrase "mailto:". Once the spider recognizes either of these items, it easily figures out the email address that is part of the source code. Therefore, never write your email address as part of the source code. Do not write it as a part of the src property, as a content, in the <title> tag, or in the title property. Make sure that your Web page does not contain your email address, and you will get less spam.
But, what if you still want your email address to appear on your Web page, in spite of the above? How can you prevent spiders from recognizing that it's an email address?
Note: Think about how much effort you are willing to invest in securing your email address. Remember that, in general, if you make it more difficult for the spiders to recognize your email, you will make it more difficult for end users to use your email. There is no perfect defense against spam or computer viruses. But, there are some simple anti-spam devices that can fool the spiders-and, thus, reduce spam.
Use ONE of the following methods to hide the following email address: joe@hotmail.com from the spiders.
Anti-spam Method No. 1: Add Text to the email Address
The concept: Add more characters to the email address so that the address becomes invalid. You can add one or more characters. Space is considered a character as well.
User action required: Anyone who tries to write to you via the email address will have to delete the extra characters you have added.
Spider: If everyone in the world does as you do and adds the same text as you do, then spiders will start to "understand" and will be able to figure out the real email address.
Example:
<a href="mailto:REMOVETHISjoe@hotmail.com">Send an email</a>
Clearly the address "REMOVETHISjoe@hotmail.com" doesn't exist. In order for the address to be valid and useful, the user must erase the text REMOVETHIS, which is easy.
But if everyone in the world adds REMOVETHIS, then the spiders will start to understand what is going on. To prevent this, use a variety of different phrases, for example:
<a href="mailto:joeTAKEITAWAY@hotmail.com">Send an email</a>
Another possibility:
<a href="mailto:joe-at-hotmail-dot-com">Send an email</a>
Anti-spam Method No. 2: Use Decimal Internet-safe Characters
The concept: Use the decimal equivalents of normal characters.
User action required: none.
Spider: If the spider has the required decimal/character conversion table, then it will be able to figure out the address.
The following table shows that the decimal values of the characters a to z are from 97 to 122.
| Dec |
45 |
46 |
64 |
95 |
97 |
98 |
99 |
100 |
101 |
102 |
103 |
104 |
105 |
106 |
107 |
| Char |
- |
. |
@ |
_ |
a |
b |
c |
d |
e |
f |
g |
h |
i |
j |
k |
| |
| Dec |
108 |
109 |
110 |
111 |
112 |
113 |
114 |
115 |
116 |
117 |
118 |
119 |
120 |
121 |
122 |
| Char |
l |
m |
n |
o |
p |
q |
r |
s |
t |
u |
v |
w |
x |
y |
z |
Example: using decimal equivalents, change the source code from:
<a href="mailto:joe@hotmail.com">Send an email</a>
to
The user will see Send an email in the browser window. When putting the cursor over the link, the email address can be viewed in the status bar.
As you can see, there is a mixture of regular and ASCII characters. When the user clicks on the link, everything will work fine. However, if users want to see the source code, they will find two surprises:/p>
- They will not find the @ sign.
- Special efforts will be required to decipher the code.
If the spider has the decimal/character conversion tables, then the effort to hide the address may be useless.
Anti-spam Method No. 3: Use Hexadecimal Internet-safe Characters
The concept: Use hexadecimal equivalents of normal characters.
User action required: none.
Spider: If the spider has access to hexadecimal conversion tables, then it will able to decipher the address.
The following table shows that the hexadecimal values of the characters a to z are between 61HEX and 7aHEX
| Hex |
2d |
2e |
40 |
5f |
61 |
62 |
63 |
64 |
65 |
66 |
67 |
68 |
69 |
6a |
6b |
| Char |
- |
. |
@ |
_ |
a |
b |
c |
d |
e |
f |
g |
h |
i |
j |
k |
| |
| Hex |
6c |
6d |
6e |
6f |
70 |
71 |
72 |
73 |
74 |
75 |
76 |
77 |
78 |
79 |
7a |
| Char |
l |
m |
n |
o |
p |
q |
r |
s |
t |
u |
v |
w |
x |
y |
z |
Example: using hexadecimal equivalents of characters, change the source code from:
<a href="mailto:joe@hotmail.com">Send an email</a>
to
Users will see Send an email in the browser window. When putting the cursor over the link, they will see the email address in the status bar.
When clicking on the link, everything will work fine. But, if the users want to see the source code, they will find two surprises:
- They will not find the @ sign.
- They will have difficulties in translating the hexadecimal numbers into English letters.
If the spider has hexadecimal conversion tables, then the effort will have been useless.
Anti-spam Method No. 4: Use an Image of the email Link
The concept: Instead of using the email address as a hot link, the same text appears as an image-an unlinked image.
User action required: To use an image as an address, users will have to manually copy the email address from the image and type it into their email program.
Spider: Doesn't understand that this is an email address!
Example:
Similar images of email addresses can be generated at the Digital Colony
If you are worried that the spider will be able to convert the image into text by using Optical Character Recognition (OCR) software, then check out this image.
It is doubtful that this image could be converted to text by OCR software.
Creating such an image takes time and effort. Only a human eye and a human mind can discern the email address that is hidden in the graphic.
What is the source code for this image?
<img src="imags/singEnc.jpg" />
That's the whole source code. Do NOT "wrap" the image with <a href="mailto:...> </a>. If you still want the image to be a link to your email address, read method no. 5.
Anti-spam Method No. 5: Use a Program to Hide the email Address
The concept: A program, written in JavaScript, produces the HTML tags including your email address when the page is shown on the end user's computer by the browser.
User action required: none.
Spider: Should not be able to understand what the script does. However, there are already sophisticated spiders that are able to run JavaScript. These spiders may be able to figure out the email address-if it exists.
Here is a JavaScript example:
<script> a = "joe"; b = "hotmail.com"; document.write("<a href='mailto:" + a + "@" + b + "' title='open your email client'>"); document.write("email"); document.write("</a>"); </script>
Note: The line that includes "document.write" needs to be written continuously (without the need to press Enter) until the semi-colon (;). Following the semi-colon, Enter can be pressed. Pay attention: the fourth line is a long one.
The script needs to be inserted in the Web page at exactly the spot on the page that you wish to have an email link.
If you want your email address to appear but not be hot linked, then use the following code.
<script> a = "joe"; b = "hotmail.com"; document.write(a + "@" + b); </script>
The script can be improved by using encryption. But, again, if the spider is capable of running JavaScript, the encryption will be of no value.
If users ask to see the source code, they will have to locate the @ sign and then figure out the email address. This is not an easy task.
Anti-spam Method No. 6: Combine Methods No. 4 and No. 5
The concept: Combine method no. 4, in which the email address appears as an image, with method no. 5, which uses JavaScript.
Example:
<script> a = "joe"; b = "hotmail.com"; document.write("<a href='mailto:" + a + "@" + b + "' title='open your email client'>"); document.write("<img src='images/signEnc.jpg' />"); document.write("</a>"); </script>
Note: The line that includes "document.write" needs to be written continuously (without the need to press Enter) until the semi-colon (;). After the semi-colon, Enter can be pressed. Pay attention: the fourth line is a long one.
After you have written the JavaScript, insert it into the Web page at exactly the place on the page where you wish to have an email link.
If you only want the address to appear without being hot linked, then use method no. 4 alone.
Anti-spam Method No. 7: Combine Data
The concept: The email address is part of a text file that is not part of the HTML page. It is inserted only when the Web page is executed (rendered) by the server. Just before sending the Web page to the client, the email address is taken from the text file and inserted into the HTML page.
User action required: none.
Web developer action required (that's you): You must host your Web site in a server with Windows Operating System.
Spider: The spider should not be able to figure out what the source code does. If the spider scans only the files ending with .html, .htm, or.asp, then all will be okay. If, on the other hand, the spider scans other types of files as well, then there potentially could be a problem, although it is possible to be more sophisticated and fool the spider.
The source code looks like this:
<object classid="clsid:333C7BC4-460F-11D0-BC04-0080C7055A83" id="ply" height="0" width="0"> <param name="DataURL" value="bla.vdr" /> <param name="UseHeader" value="True" /> </object> <a datasrc='#ply' datafld='PlayIt'>bla</a>
The text file, bla.vdr, looks like this:
PlayIt mailto:joe@hotmail.com
Note: This is a simple text file with the ending .vdr instead of .txt.
Explanation: Why did I use the ending .vdr for the text file? The reason is to lower the possibility that the spider will check that particular file. The extension .vdr is my own invention. Any other extension would work just as well. Most importantly, the extension should not be commonly known. Don't use extensions such as .pdf, .doc, or .txt that the spider might recognize and scan.
Users who want to see the source code will have two surprises:
- They won't find the @ sign.
- They will have to invest massive efforts to understand how the script connects to the text file with the .vdr ending.
Summary
Spammers use spiders to harvest email addresses from the source code that make up HTML pages.
There are a number of methods that you can use to protect your address from being harvested by spiders.
We have defined seven different anti-spam methods.
There are two main considerations in choosing an anti-spam method:
- How effectively will the anti-spam method protect the email address against spiders?
- Will the chosen anti-spam method make it harder or easier for a user to use the email link that appears on the Web page?
More articles from this pro: http://www.ArticlePros.com/author.php?Zohar Amihud
More on Computers & Internet
and
Email can be found here.
|