I'm thinking it might be a good idea to use the "quality" of an XML/XHTML
email's structure as a metric for spamminess. More errors are likely to
imply spam. Does there exist a lightweight validator that can quickly
produce a metric of how many errors exist in a message? Ideally this would
be something I could invoke from a Perl process, perhaps over a pipe to a
validation server (similar to the way ClamAV and SpamAssassin can be
invoked).
email's structure as a metric for spamminess. More errors are likely to
imply spam. Does there exist a lightweight validator that can quickly
produce a metric of how many errors exist in a message? Ideally this would
be something I could invoke from a Perl process, perhaps over a pipe to a
validation server (similar to the way ClamAV and SpamAssassin can be
invoked).
Comment