Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification

Ending Spam: Bayesian Content Filtering and the Art of Statistical Language Classification, 9781593270520 (1593270526), No Starch Press, 2005

Join author John Zdziarski for a look inside the brilliant minds that have conceived clever new ways to fight spam in all its nefarious forms. This landmark title describes, in-depth, how statistical filtering is being used by next-generation spam filters to identify and filter unwanted messages, how spam filtering works and how language classification and machine learning combine to produce remarkably accurate spam filters.

After reading Ending Spam, you'll have a complete understanding of the mathematical approaches used by today's spam filters as well as decoding, tokenization, various algorithms (including Bayesian analysis and Markovian discrimination) and the benefits of using open-source solutions to end spam. Zdziarski interviewed creators of many of the best spam filters and has included their insights in this revealing examination of the anti-spam crusade.

If you're a programmer designing a new spam filter, a network admin implementing a spam-filtering solution, or just someone who's curious about how spam filters work and the tactics spammers use to evade them, Ending Spam will serve as an informative analysis of the war against spammers.

TOC Introduction

PART I: An Introduction to Spam Filtering Chapter 1: The History of Spam Chapter 2: Historical Approaches to Fighting Spam Chapter 3: Language Classification Concepts Chapter 4: Statistical Filtering Fundamentals

PART II: Fundamentals of Statistical Filtering Chapter 5: Decoding: Uncombobulating Messages Chapter 6: Tokenization: The Building Blocks of Spam Chapter 7: The Low-Down Dirty Tricks of Spammers Chapter 8: Data Storage for a Zillion Records Chapter 9: Scaling in Large Environments

PART III: Advanced Concepts of Statistical Filtering Chapter 10: Testing Theory Chapter 11: Concept Identification: Advanced Tokenization Chapter 12: Fifth-Order Markovian Discrimination Chapter 13: Intelligent Feature Set Reduction Chapter 14: Collaborative Algorithms

Appendix: Shining Examples of Filtering

Comments

Amazing Books

Numerical Methods in Finance and Economics: A MATLAB-Based Introduction (Statistics in Practice)

John Wiley & Sons, 2006

A state-of-the-art introduction to the powerful mathematical and statistical tools used in the field of finance The use of mathematical models and numerical techniques is a practice employed by a growing number of applied mathematicians working on applications in finance. Reflecting this development, Numerical Methods in Finance and...

Salivary Gland Disorders

Springer, 2007

Co-edited by Eugene N. Myers, a world-famous expert in the field, this has got to be the last word on salivary gland disorders. The disorders themselves cover a broad array of diseases, both benign and malignant. Thus, the contents of this book have been organized to reflect the diverse nature of salivary gland anatomy, physiology, and...

Data Mining Algorithms in C++: Data Patterns and Algorithms for Modern Applications

Apress, 2017

Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation of operation, essential...

Show Me Macromedia Flash MX 2004

Que, 2004

Show Me Macromedia Flash MX offers readers a fast, visual way to learn Flash MX, solve problems, and get work done!

Step-by-step instructions with accompanying visuals requires less time reading and more time learning this popular Web authoring...

Evolutionary Electronics: Automatic Design of Electronic Circuits and Systems by Genetic Algorithms

CRC Press, 2001

From the explosion of interest, research, and applications of evolutionary computation a new field emerges-evolutionary electronics. Focused on applying evolutionary computation concepts and techniques to the domain of electronics, many researchers now see it as holding the greatest potential for overcoming the drawbacks of conventional design...

Diagramming the Big Idea: Methods for Architectural Composition

Routledge, 2019

Becoming an architect is a daunting task. Beyond the acquisition of new skills and procedures, beginning designers face an entirely unfamiliar mode of knowledge: design thinking.

In Diagramming the Big Idea, Jeffrey Balmer and Michael T. Swisher introduce the fundamentals of design thinking by illustrating how...