Advertisement
To print: Select File and then Print from your browser's menu
-------------------------------------------------------------- This story was printed from ZDNet Australia. --------------------------------------------------------------
Son of spam: 4 spam filtering packages tested


October 24, 2003
URL: http://www.zdnet.com.au/reviews/software/internet/soa/Son-of-spam-4-spam-filtering-packages-tested/0,139023437,120280115,00.htm




 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

Son on Spam: 4 Spam Filtering Packages Tested Can you trust software to block all the spam your company receives and let all your legitimate e-mail through? We evaluate four top spam filtering packages for their accuracy.

Three months ago in the July edition of Technology & Business we compiled an overview of five anti-spam filtering applications that were available at the time. That initial review addressed the introduction and overview of spam and its concepts and also the individual usability and technical implementations of those applications. However it did not look at the actual accuracy of those individual packages in filtering e-mail.

We are therefore now re-visiting the anti-spam issue with a more results-based review. We invited the same five vendors back for a head-to-head shootout to show the packages' accuracy in filtering unwanted e-mail while keeping as much useful e-mail as possible. All vendors accepted this challenge except for Clearswift, who cited the imminent release of a new redesigned application. We hope that in the next similar accuracy review, Clearswift will be involved.

As you will see in this review, testing these packages for accuracy is a tricky business and to do so fairly and accurately took several months.

As detailed in the previous review, anti-spam filters can be set up in any number of ways, utilising black lists, white lists, and custom made rule sets. Some applications come configured with basic rules, others come as a blank slate. Some also employ quite advanced learning techniques (touted by some vendors as heuristics or Bayesian analysis).

Not so simple
When we actually sat down to work out this test and what results we could achieve that were really correct, several issues presented themselves. Spam and spammers are dynamic, constantly evolving, and are always look to develop different techniques to get past the filters and deliver their message of "lose weight by eating more" and "XXX wholesale". Therefore the tests could only be a snapshot of the particular given period in time that the test was performed (naturally with some legacy "classic" spam messages thrown in for good measure). With this in mind, we collected a static data set over a couple of weeks to ensure that we had the "latest" in the spammers' arsenal.

In addition to running tests on this set of static data, we also needed to run the software on some live e-mail data to ensure similar results were achieved by the products, given the static test data may not be filtered exactly the same as it may be in a live environment.

In order to do this, each vendor needed to have their own test rig so that the live tests could be run simultaneously. Therefore we needed a domain name, sub domain name records in the name servers, and live public IP addresses etc to setup before the testing could commence.

The human factor
The last part of the testing which is often the most difficult--certainly when you consider the rules-based nature of these applications--is the human factor. This is why we took the measure of inviting the individual vendors to send their own engineers to the Labs, to install and configure the applications on the servers.

Sure, from a basic installation and administration point of view the Labs staff could have installed and configure the rule sets for all these applications as they did in the previous review. However, this is a far cry from being an "expert" in each application.

It is one thing to do a usability test to ensure that a person with a reasonable level of technical competency can install and configure an application to get it running. That's nothing like the skill of an engineer working for the company, who creates and maintains that application, and knows of the many little nuances and tweaks needed to be applied to achieve the best possible results. Remember, these are not basic antivirus applications that you can just install and download the latest definition file. The rules on many of these filtering systems are highly complex and evolved.

This is not particularly different from how it works in the real world, anyway. Because the anti-spam market is very competitive, vendors invest a great deal in keeping their products working efficiently. For instance, some vendors run training courses for your staff on the best ways to configure their product. And for your average medium-to-large installation, it's not at all out of the ordinary to have a technician come in to help you install and configure the product.

What we looked for
We designed this test with two overall tests in mind: firstly a static or controlled test using content we had gathered over a period of time that included:

  • defined unwanted e-mail (spam),
  • unsolicited circulars/newsletters (news spam),
  • legitimate e-mails (ham), and
  • solicited circulars/newsletters (news ham).

This ran to some 1800+ items of mail that we sent to each vendor's application. This static test was run through at least twice to ensure accuracy.

The second test was a "live" test combining several real world e-mail boxes into one and then splitting that box to each of the anti-spam filtering servers that the vendors had configured. This test ran for over two weeks, and we then took several days worth of collection and manually went through each e-mail that had arrived and sorted it according to its status.

This live testing period was useful to ensure that the static testing was doing it job correctly in a controlled environment. Naturally, if any large differences occurred, then that application and the testing methodology would need to come under closer scrutiny to find out where and why the differences had occurred. One would act as basically a validation of the other--but as it turned out there were no discrepancies.

Scoring
Once we ran through the static tests, we applied scores and the total overall score achieved at the end as follows:

    +1 point for each spam, e-mail, and solicited newsletter filtered correctly

    -2 points for every unwanted spam message allowed through (false negatives)

    -3 points for every unsolicited newsletter allowed through (false negatives) and

    -5 points for every legitimate e-mail blocked incorrectly (false positives).

The rationale behind this scoring is simple: spam allowed through is an annoyance, but legitimate e-mail blocked can have very serious repercussions. Ironically, it is the false negatives that are more likely to get administrators in trouble--especially if the boss receives a pornographic spam or the like--rather than the false positives, which can be a much more serious matter. But then how are people supposed to know they didn't receive an e-mail if they didn't receive it? While newsletters may be important, we acknowledge that they are more difficult to filter correctly and therefore have less points deducted for improper handling.

Live testing
As intended, the live testing did indeed prove that the static/controlled test results were correct. The live test results basically were identical given the volume of messages sent via both methods.

Due to the very nature of live testing there are also several variables that could be introduced, which potentially are beyond our control especially the "human" factor with counting and classifying the number of messages. Naturally the live testing could only be run once.

Interestingly, the vendors who noted that their applications apply "learning" principles to their filtering did indeed sometimes record different results during the course of the static testing when the same data sets were sent through. However since the captured test data was limited to less than 2000 messages, the variation would not have been sufficient to show any great differences in the test results here. However, this is a good sign that over the course of several months and thousands of messages, these packages may well get better at learning your e-mail pattens and filter better.

With that in mind, these applications did not always produce better results when the "smarts" were activated. In a couple of cases, the results went the other way, but only by one or two messages, and we're confident that with a combination of learning and tweaking, you could improve the accuracy of filtering.

GFI MailEssentials

The first of the "smart" applications is the product provided by GFI with its Bayesian engine. Excitingly, when we started the initial static tests and sent the messages through it was only misidentifying one or two spam messages as opposed to the tens that the other applications were letting through. It looked like GFI were going to run away with the gong by miles. That was until we started counting up the false positives, and very scarily the application started canning some legitimate e-mails and not just legitimate newsletters also but normal e-mail messages. So unfortunately, when the scoring system was applied to the results this package took a big hit due to the fact that it blocked more legit e-mails than the other packages. Total score is 383 points.

The configuration and monitoring for the GFI application are contained in two separate applications, which is similar to both the NetIQ and SurfControl applications. The admin console is quite logical, and makes a difficult task easier to complete particularly when it comes to configuring quite complex rules.

 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

Product: GFI MailEssentials for Exchange/SMTP V9.0 Beta 1
Price: AU$425 for up to 25 mailboxes, AU$1754.50 for unlimited mailboxes; freeware version available
Vendor: GFI Software
Phone: 1800 225 543
Web: www.gfi.com

Interoperability: ½
Supports Windows 2000 or XP Pro with any SMTP mail server.

Futureproofing:
Spam collection as tested was very good but subject to too many false positives.

ROI: <
Freeware version is excellent value if you don't need support. Price for unlimited mailboxes is also very good.

Service:
No support for freeware version; 12 months included in paid version.

Rating: ½

NetIQ MailMarshal

Initially, the results were beginning to look a bit worrying on the static controlled tests and we were very surprised when tabulating the results and applying the scoring to find that even though there were some obvious hits and misses with the MailMarshal application, overall its consistency won out. Instead of being amazingly accurate, this package applies the sure but steady approach to filtering the mail messages. Total score is 1383 points.

MailMarshal has two main operations windows: one for configuration, and the other to allow administrators to check the status of the messages being stopped. The messages can be filtered to a number of specific directories such as images, virus/worm, language etc.

 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

Product: MailMarshal 5.5 SMTP
Price: From AU$2315 for up to 75 mailboxes
Vendor: NetIQ
Phone: 02 9959 2313
Web: www.netiq.com

Interoperability:
Supports Windows 2000 with any SMTP mail server.

Futureproofing: ½
Testing results were best balanced overall.

ROI:
Excellent solution and investment for larger or growing enterprises with 100+ mailboxes to administer.

Service:
Additional contract available.

Rating:

Network Associates McAfee SpamKiller

Compared to the other three packages in this review, this application performed poorly. It performed some filtering functions, however far too many spam messages were being passed through undetected. On the flipside, very few legitimate e-mails were canned and only a few newsletters. Very little configuration exists for the system also. Total score is -323 points.

There is very limited specific configuration available for the McAfee application. This package is more of a set-and-forget app rather than a specific monitored rule-based mail filter. The management/administration console makes it relatively easy to configure settings like adding words to the subject line of messages considered to be spam. Rules and blacklist/whitelists can be readily set up under separate configuration tabs in the console window.

 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

Product: McAfee SpamKiller for Exchange Small Business
Price: From AU$28.65 per mailbox incl support (101 nodes)
Vendor: Network Associates
Phone: 1800 644 646
Web: www.networkassociates.com

Interoperability:
Supports Windows 2000 and Exchange Server only.

Futureproofing:
Accuracy as tested was very poor.

ROI:
Relatively cheap solution for SMEs with up to 100 mailboxes, however the accuracy would need to be tested with your setup before deploying this app.

Service: ½
12 months included service is very good.

Rating: ½

SurfControl

SurfControl is the second of two applications tested here that vendors tout as being "smart", and certainly the SurfControl application includes some fairly hefty weaponry in the name of defending the mailboxes. As we found with both "smart" applications, sometimes they were slightly too smart for their own good. While adequately filtering most spam messages, they tended to pick up a fair few legitimate e-mails and legitimate newsletters which impacted on the overall score. Total score is still a respectable 930 points.

With both "smart" applications, it would be interesting to see over a longer period if their learning capabilities improved their accuracy.

It should also be noted here that since we published the overview article on mail filtering in July, SurfControl has adjusted its pricing and is now within the same ballpark as many of the other players in this field.

SurfControl has a very good management window, similar to the NetIQ and GFI offerings. SurfControl has separate management and admin consoles, one for reviewing blocked messages and the other for configuring the rules.

Interestingly, the rules can be managed in a specific operator set sequence. For example, the language or header/image rules can be applied before the more encompassing spam rules, thereby allowing better filtering to different categories without the chance of missing more general spam messages.

 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

Product: SurfControl
Price: From $13.30 to $57 per mailbox depending on user count
Vendor: SurfControl
Phone: 02 9414 0000
Web: www.surfcontrol.com

Interoperability:
Supports Windows 2000 with any SMTP mail server.

Futureproofing: ½
Spam collection as tested was very good but false positives bring the overall score down.

ROI:
New pricing structure is far better than the original.

Service: ½
Additional contract available.

Rating: ½

Product GFI MailEssentials for Exchange/SMTP V9.0 Beta 1 MailMarshal 5.5 SMTP McAfee SpamKiller for Exchange Small Business SurfControl
Vendor GFI Software NetIQ Network Associates SurfControl
Web www.gfi.com www.netiq.com www.networkassociates.com www.surfcontrol.com
Phone 1800 225 543 02 9959 2313 1800 644 646 02 9414 0000
Price $425 for up to 25 mailboxes, $1754.50 for unlimited mailboxes; freeware version available From $2315 for up to 75 mailboxes From $28.65 per mailbox incl support (101 nodes) From $13.30 to $57 per mailbox depending on user count
Support E-mail, phone, Web support available. Maintenance agreement renewable at approx 25% of the list price, otherwise $120 per incident Essential Care annual maintenance plan available Included in price Phone, e-mail, and dial-in support; priority customer care program. Software maintenance and all new releases 25% of license price
Minimum hardware requirements Same as operating system 400 MHz Intel Pentium II, 128MB RAM, 5GB disk space 400MHz Intel Pentium II, 512MB RAM, 200MB disk space 600MHz Intel Pentium III, 512MB RAM, 1GB disk space
Operating system Windows 2000 Professional, Server, or Advanced Server SP3; Windows XP Professional Windows 2000 or higher Windows 2000 Server with Active Directory Windows 2000 server SP1
E-mail software Exchange 5.5/2000/2003 or any SMTP server Any SMTP server Microsoft Exchange 2000 (service pack 2 or later) Any SMTP server
Additional software requirements IIS5 or higher SMTP service Microsoft SQL Server 7.0/2000 or Microsoft Database Engine (MSDE) Microsoft .NET Framework, Microsoft Data Access Components (MDAC) 2.7 Microsoft Data Access Components (MDAC) 2.5 SP1
Maximum mail clients supported Unlimited Dependent on licensing 500 mailboxes Unlimited
 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

How we tested

 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

Aim
To test participating vendors' anti-spam software applications and their own technical configuration of the same in an effort to provide some statistics on the effectiveness of the current technologies ability to successfully filter desirable and undesirable e-mail messages.

How it worked
We provided each vendor with a server running Microsoft Exchange Server 2000 on Windows 2000 Server. All vendors were invited to send a technician to the Labs on the same day to install and configure their own products and the Exchange servers. The technicians could either install their product on the same server or on a separate server also running Windows 2000 Server.

All test servers were connected via the same switch and the Lab gateway to the Internet. Each server was allocated a pre-defined static publicly accessible IP address and each mail server was assigned a fully qualified sub-domain name. Each server was also assigned an external e-mail account.

Vendors were encouraged to implement their rule sets so they were "tight to catch as much spam as that package is capable of", but not too tight to block everything--false positives were to be avoided as much as possible. The use of current black and white lists was acceptable. Once the install/configuration period was over, the vendors were not allowed to see or access their systems again.

Static tests
A pre-collected set of over 1800 e-mails were sent from a few static mail accounts, and the results collected. We ran through these tests several times to ensure their accuracy, and to determine the effect, if any, of the packages' learning capabilities.

We used a Microsoft internal testing tool to send the static control messages to the servers. This tool was initially developed to test mail servers under load. We adapted its use to allow us to take messages that we have collected (provided their headers have not been corrupted) and then send with original or new headers.

The scores you see are based on the results of these static tests, although they are very similar to the results achieved in the live tests as well.

Live tests
Following the static tests, the packages were run on a number of live external mailboxes, providing a live test scenario with real-world mailboxes and real world external mail servers.

We used a Linux-based Sendmail server to combine all messages to a single account, then forward them to the multiple test accounts, which left the headers as if the messages had been sent directly from the spammers.

After running these two tests using the vendor's suggested configurations, we spent a bit of time altering the vendors' rule configurations to see if tweaking the products could alter their results. Although this did not contribute to the overall scores, because of the subjective and human factors involved, it gave us some valuable information on the ease of use and effectiveness for administrators who will need to constantly tweak the systems once they are in use.

A note on results
Any results achieved by this testing only have a limited lifespan before becoming redundant, due to the nature of the applications themselves, and the dynamic nature of the spam they are trying to filter. The bad guys will always be trying new ways to get around the good guys' defences, which the good guys will always be trying to improve.

A note on servers
Due to the large number of servers required for this test, since each vendor required up to two servers for their products, we were unable to provide enough identical servers to run the tests. We therefore hired all the servers from PC Hire. Each vendor contributed the costs of hiring their own server(s), and we'd like to thank PC Hire for helping us keep these costs to a minimum. We'd like to make it clear that--as always--vendors were not charged to participate in the tests, and were only required to pay the cost of the server hire.

Test bench

    Interoperability: Will the software work with your existing mail server and operating system?

    Futureproofing: Is the software accurate and flexible enough to suit your needs into the future?

    ROI: Does the price justify the accuracy and will you achieve productivity gains by using the software?

    Service: What options are available for service and support, and how much do they cost?

Things to look out for...

  • Easily deployed packages that work with your existing mail applications (mail servers and mail clients)
  • For "self managed" solutions look for apps with the ability to create, manage, and manipulate quite complex rules
  • Evaluate, evaluate, evaluate. If you are a larger organisation, most of the filtering vendors on the market would be more than happy to arrange a demo/test in your given environment, this should assist you in making the decision that is right for your enterprise.

Final Words
Naturally with all these applications, a system administrator can configure and change the rules and create custom rules also to adapt the system to their own particular organisation. Therefore it is not sufficient to take this accuracy test on its own, I would still advise any enterprise looking to implement any spam filtering application to perform their own tests in-house on any of these or the many other applications available to filter spam. Spam is certainly a hot topic at the moment, and doesn't look at being easily remedied in the near future. The innovations and implementation of each of these applications is quite impressive and very exciting--certainly much more involving than your average word processor or antivirus app.

Subscribe now to Australian Technology & Business magazine.

About RMIT Test Labs

 Spam filtering sofware

 Anti-spam software:

 GFI MailEssentials
 NetIQ MailMarshal
 NAI/McAfee SpamKiller
 SurfControl

 Specifications
 How we tested
 Look out for...
 Final words
 About RMIT

RMIT Test Labs RMIT IT Test Labs is an independent testing institution based in Melbourne, Victoria, performing IT product testing for clients such as IBM, Coles-Myer, and a wide variety of government bodies. In the Labs' testing for T&B, they are in direct contact with the clients supplying products and the magazine is responsible for the full cost of the testing. The findings are the Labs' own--only the specifications of the products to be tested are provided by the magazine. For more information on RMIT, please contact the Lab Manager, Steven Turvey.

Copyright © 2009 CBS Interactive, a CBS Company. All Rights Reserved.
ZDNET is a registered service mark of CBS Interactive. ZDNET Logo is a service mark of CBS Interactive.