gloom

A simple and intuitive implementation of a Bloom filter using enhanced double hashing.

Within gloom, enhanced double hashing is used to set bit positions. The choice for double hashing was shown to be effective without any loss in the asymptotic false positive probability, leading to less computation and potentially less need for randomness in practice by Adam Kirsch and Michael Mitzenmacher in Less Hashing, Same Performance: Building a Better Bloom Filter.

The enhanced double hash is of the form:

g_i(x) = (H₁(x) + iH₂(x) + f(i)) mod m, where

H₁ is FNV-1a 64-bit, H₂ is Murmur3 64-bit, and f(i) = i³

What is a Bloom filter?

A Bloom filter is a space-efficient probabilistic data structure used for set inclusion queries. False positive matches are possible, but false negatives are not – in other words, a query returns either "possibly in set" or "definitely not in set".

Essentially, a Bloom filter contains a single bit vector of size m and k independent and uniform hash functions to insert n set items. Upon inserting a set item into the filter, the k hash functions return all bit vector positions to set for said item.

To test inclusion of an item, all mapped k bit positions must contain a set bit. It is possible to get a false positive for an item, but under a desired and given probability. Although Bloom filters allow false positives, the space savings often outweigh this drawback.

API

To initialize a Bloom filter, a given set size and desired false positive probability is needed. The size of the bit vector and the number of hash functions to use is determined by these parameters.

import (
   	"github.com/alexanderbez/gloom"
)

bf, err := gloom.NewBloomFilter(n, gloom.DefaultFalsePosProb)

item := []byte("foo")
bf.Set(item)

ok, err := bf.Includes(item)

Tests

$ go test -v ./...

Contributing

Fork it
Create your feature branch (git checkout -b feature/my-new-feature)
Commit your changes (git commit -m 'Add some feature')
Push to the branch (git push origin feature/my-new-feature)
Create a new Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
murmur3		murmur3
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
gloom.go		gloom.go
gloom_test.go		gloom_test.go
utils.go		utils.go
utils_test.go		utils_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

gloom

What is a Bloom filter?

API

Tests

Contributing

About

Uh oh!

Releases

Packages

Languages

License

alexanderbez/gloom

Folders and files

Latest commit

History

Repository files navigation

gloom

What is a Bloom filter?

API

Tests

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages