|
Joint Colloquium of
the NYU Computer Science Department
and
the NYC NLP Forum
Stopping Spam
Joshua Goodman
Microsoft Research
Friday, October 8, 2004 11:30 A.M.
Room 1302 Warren Weaver Hall
251 Mercer Street
New York, NY 10012-1185
Directions: http://cs.nyu.edu/csweb/Location/directions.html
Colloquium Information: http://cs.nyu.edu/csweb/Calendar/colloquium/index.html
Hosts:
I. Dan Melamed melamed at cs dot nyu dot edu, (212) 998-3003
Abstract
Spam is a huge and growing problem. I'll first survey solutions to spam,
including filtering approaches (machine learning, fuzzy hashing, and
blackhole lists) and "postage" approaches, including reverse Turing
tests, computational puzzles, and monetary challenges. Our favorite
technique is a machine learning/text classification approach combined
with a challenge/response postage approach. I'll talk about problems and
solutions we've had in practice, especially how we have gotten millions
of messages of labeled training data, both good and spam. I'll also talk
briefly about my research on personalizing spam filters, which turns out
to be important, but harder than we thought. I'll show some analyses of
those millions of messages, including where spam actually comes from,
and why legal solutions can only stop a fraction of spam. Next, I'll
talk about why email in general and spam in particular need their own
new field, combining aspects of machine learning, networking,
cryptography/security, HCI, and economics.
Joint work with Geoff Hulten, Robert Rounthwaite, David Heckerman, and
others.
Bio
Joshua Goodman started his professional life as a developer at
Dragon Systems, working on speech recognition. He then went to grad
school at Harvard University, receiving a Ph.D. for his work in
statistical natural language processing, especially statistical parsing.
From there, he went to Microsoft Research, where he worked on language
modeling. For the past 2 and a half years, he has been working on
stopping spam, including helping start Microsoft's Anti-Spam Technology
Group. He is General Chair for the Conference on Email and Anti-Spam
2005, which everyone should attend.
top | contact webmaster@cs.nyu.edu
|