Evening all,
As most of you will undoubtedly be aware, this week has seen a few problems with the email platform. This post like a few others that I've made recently, is intended to provide a little more information on our continued commitment to ensuring that we are doing everything possible in order to provide a first class email platform that is resilient, scalable and most importantly reliable.
BackgroundSome of you will already be aware that we have recently installed a brand new mail storage platform and today started moving customers across to the new system. Further details regarding this work can be seen
here. If that's a little too much to take in then make sure you check out the Service Status announcement
here as the work will involve a small interruption to your service.
In addition to this, we've also completely rebuilt our mxcore mail delivery platform and almost doubled the number of servers responsible for handling mail collection.
The above changes should certainly be welcomed and will go a long way towards safeguarding the future reliability of the email platform. There is however one core contributor to the problems we have seen that this work doesn't address. That problem lies firmly at the feet of everyone's worst enemy... Spam!
Spam!Spam is an issue on the rise. The amount of spam flying around the Internet is increasing; 2006 was the worst year for it so far; and this trend is expected to continue. It's clear just how significant a problem this presents by looking at the information gathered from our own systems. The following represents a snapshot of the volumes of email we were processing in the run up to Christmas:

We can clearly see that the volume of mail being processed by the platform increased by about 55% in the 2 weeks before Christmas alone!!!
Such increases in spam puts massive strain on the mail platform...
- We need extra resources to process the mail as it arrives. Without enough processing power mails begin to queue which results in delays for customers.
- Email that's awaiting delivery or awaiting collection has to be stored somewhere. This is what the storage platform is for. With increases like those exhibited above you can expect this to start running out pretty quickly unless you're effectively policing spam.
- All those GB's of mail have to get to us somehow so of course we have to scale the transit connections to the rest of the Internet to cope with the additional traffic.
So what are we doing about this? Well, to begin with we're changing the way in which customers mailboxes are set up when they create an account. By default we have always provided a 'catch-all' email address. What this means is that email is delivered to you irrespective as to what comes before the '@' sign. This makes you very susceptible to spam as emails are often sent to random aliases. This is something that we see crop up all too often in these forums so it will be nice to see the back of it. We're looking to make these changes in the 3rd week of April and I posted a bit of a teaser back in January containing more information that can be seen
here.
In light of the most recent spate of mail timeouts we've also taken the decision to make some configuration changes to the way in which our mail delivery servers handle spam. Before explaining these changes it's helpful to have an understanding of how the delivery servers handle your email.
Mail DeliveryWhen you send an email, the email program or web application you are using sends your message to a local mail delivery server (SMTP Server). The SMTP server then forwards the mail onto the server responsible for collecting and delivering messages for the person you're sending the email to. For emails sent to our customers this is our
mxcore platform.
Now consider what would happen if our mxcore servers were unavailable or out of service. It's important that the email isn't lost so it makes sense to have a backup server. These servers are what form our
mxlast platform.
Legitimate email rarely needs to pass through our mxlast platform as it is very uncommon for the mxcore servers to be unavailable.
You wouldn't have thought this looking at the volume of email that passes through the server though:

The reason for this is that most of this email is spam. One trick that spammers often use is to directly target the secondary mail server of a domain (in our case the mxlasts). The reason they do this is because these servers are far less likely to have as stringent spam scanning as the primary server meaning the spammers email is more likely to get successfully delivered.
Our SPAM filtering platformWe have already added spam filtering to the mxlast platform as part of our recent mail delivery server upgrade. Whilst this helped to increase spam detection rates, it didn't help reduce the previously mentioned processing and storage problems. This is because the servers still have to deliver the SPAM to the customer.
Our SPAM filtering is a 'multi-layered' process. This basically means that we use a number of different SPAM filtering mechanisms before emails are delivered to the customer. If an email isn't identified as spam then it will be checked by the next process which will pass it on to the next process etc. If any of these processes identify the email as SPAM then the tag [-SPAM-] is appended to the subject line of the email and delivered to the customer.
Proposed ChangesOn April 12th we will be making some changes to the way that our mxlast platform handles the mail that is sent to it. Details regarding this work have been posted as a Planned Maintenance Announcement
here.
Each email will be checked against one of our SPAM filtering processes and if determined to be SPAM will not be accepted by the platform. This will greatly reduce the volume of email our servers have to process as there will no longer be the overhead associated with delivering these junk emails to customers.
Some customers have aired concern in the past about legitimate email they have received in the past that has been marked incorrectly as [-SPAM-]. It is worth mentioning at this point that this does
*not* mean that we will now be dropping this email without the customer knowing.
These mistakes are the result of our Bayesian filter which is a completely separate process to the one we will be using to refuse email on the mxlasts. Because we use this process now as part of our spam filter we have been able to make sure we are confident as possible that no email will be refused incorrectly.
What does this mean?Well most noticeably we hope it means that anyone plagued by large volumes of spam should see a marked reduction in the amount of junk email they receive.
We have estimated that these changes will reduce mail volumes on the mxlast platform by around 60-70%. That in itself is about 20-30% of the total volume of email passing through our platform. Whilst making SPAM less of a nuisance for customers we are also helping protect the integrity of the mail platform and reduce the potential for problems similar to those witnessed earlier this week.
We are aware that we have many customers who prefer to manage their own email by running their own SMTP server. We would like to assure these customers that the changes to the mxlast platform will not affect them and they will continue to manage their own email as normal.
Feel free to discuss these plans and ask any questions you may have and I'll be more than happy to try and source the answers.