Image of Ending Spam . Bayesian Content Filtering and the Art of Statistical Language Classification Zoom

Ending Spam . Bayesian Content Filtering and the Art of Statistical Language Classification

Other products of No Starch Press

Our Price: 35,00 € tax included

Out of stock.

Time to ship: between 3 and 14 working days approx.

Description

Through considerable research, creative minds have invented clever new ways to fight spam in all its nefarious forms. This landmark title describes, in depth, how statistical filtering is being used by next generation spam filters to identify and filter spam. Zdziarski explains how spam filtering works and how language classification and machine learning combine to produce remarkably accurate spam filters. Readers gain a complete understanding of the mathematical approaches used in today's spam filters, decoding, tokenization, the use of various algorithms (including Bayesian analysis and Markovian discrimination), and the benefits of using open-source solutions to end spam. Interviews with the creators of many of the best spam filters provide further insight into the anti-spam crusade.

Contenidos

Introduction

PART I: An Introduction to Spam Filtering

Chapter 1: The History of Spam

Chapter 2: Historical Approaches to Fighting Spam

Chapter 3: Language Classification Concepts

Chapter 4: Statistical Filtering Fundamentals

PART II: Fundamentals of Statistical Filtering

Chapter 5: Decoding: Uncombobulating Messages

Chapter 6: Tokenization: The Building Blocks of Spam

Chapter 7: The Low-Down Dirty Tricks of Spammers

Chapter 8: Data Storage for a Zillion Records

Chapter 9: Scaling in Large Environments

PART III: Advanced Concepts of Statistical Filtering

Chapter 10: Testing Theory

Chapter 11: Concept Identification: Advanced Tokenization

Chapter 12: Fifth-Order Markovian Discrimination

Chapter 13: Intelligent Feature Set Reduction

Chapter 14: Collaborative Algorithms

Appendix: Shining Examples of Filtering

Index

Contenidos extra

Download chapter 11: Sendmail

Más información

Editado: Julio 2005

312 páginas

ISBN-10: 1-593270-52-6

ISBN-13: 978-1-59327-052-0

Sobre los autores

Jonathan A. Zdziarski has been fighting spam for eight years, and has spent a significant portion of the past two years working on the next generation spam filter DSPAM, with up to 99.985% accuracy. Zdziarski lectures widely on the topic of spam.

Customer comments

No customer has given his opinion about this product yet.

Send a comment