[Dailydave] approximate string matching - Bloom filters
Martin Roesch
roesch at sourcefire.com
Fri Sep 1 19:59:19 EST 2006
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi Mateusz,
Sorry it didn't work out, guess I got it backwards! :)
-Marty
On Sep 1, 2006, at 2:19 PM, Mateusz Berezecki wrote:
> On 9/1/06, Fausett, Mark (US SSA) <mark.fausett at baesystems.com> wrote:
>> Bloom filters are approximate in a different sense though -- Think of
>> them as space efficient, but lossy token sets; you put a bunch of
>> tokens
>> in, and subsequently can query whether a particular token was placed
>> into the set; to some degree of confidence.
>> Bloom filters are subject to false positives -- they'll sometimes
>> incorrectly tell you that a token is in the set -- but not false
>> negatives. Because hashing functions are used to insert tokens
>> into the
>> bloom filter, the false positives have nothing to do with approximate
>> string matches.
>>
>
> By trial end error I already discovered this really unwanted
> behavior :-/
>
> It's very good for representing what is not in the set rather than
> representing
> the set itself.
>
> Mateusz
> _______________________________________________
> Dailydave mailing list
> Dailydave at lists.immunitysec.com
> http://lists.immunitysec.com/mailman/listinfo/dailydave
>
- --
Martin Roesch - Founder/CTO, Sourcefire Inc. - +1-410-290-1616
Sourcefire - Security for the Real World - http://www.sourcefire.com
Snort: Open Source IDP - http://www.snort.org
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.1 (Darwin)
iD8DBQFE+Ndnqj0FAQQ3KOARAq7+AJ4j4s9inQ1aQsYyCD1Sx9gmzUdQUQCeKJDt
3lrJrj0VJG+9twrg3ip3Buc=
=ML7k
-----END PGP SIGNATURE-----
More information about the Dailydave
mailing list