Bloom Filter vs std::map in C++

Overview

This project demonstrates the implementation and performance comparison of a Bloom Filter and C++ std::map.

A Bloom Filter is a probabilistic data structure used for fast membership testing with very low memory usage. Unlike traditional data structures, it may return false positives, but it never returns false negatives.

The project benchmarks both data structures in terms of:

Insertion speed
Lookup performance
Memory consumption
Hit and miss query performance

Features

Custom Bloom Filter implementation
Multiple hash functions
Random string key generation
Memory usage tracking
Performance benchmarking
Interactive command-line interface
Comparison with ordered std::map

Technologies Used

C++
STL (map, vector, chrono)
Bloom Filter
Hashing
Performance Benchmarking

How Bloom Filter Works

A Bloom Filter uses:

A large bit array
Multiple hash functions

When inserting a key:

The key is hashed multiple times
Corresponding bit positions are set to 1

When searching:

The same hashes are computed
If any bit is 0 → key definitely does not exist
If all bits are 1 → key may exist

This makes Bloom Filters:

Extremely memory efficient
Very fast for lookups

But they can produce:

False positives
No false negatives

Project Structure

.
├── bloom.cpp
├── README.md
└── results.csv

Compilation

Linux / macOS

g++ bloom.cpp -O2 -o bloom
./bloom

Menu Options

1. Insert keys into std::map
2. Lookup hits (map)
3. Lookup misses (map)
4. Insert keys into Bloom Filter
5. Lookup hits (Bloom)
6. Lookup misses (Bloom)
7. Show total memory usage
8. Exit

Performance Metrics Measured

The program compares:

Insert time
Lookup hit time
Lookup miss time
Total memory usage

Why Bloom Filters Are Useful

Bloom Filters are widely used in:

Databases
Web browsers
Distributed systems
Network routers
Cache systems
Spell checkers

They are ideal when:

Memory efficiency is important
Fast lookups are required
Occasional false positives are acceptable

Example Workflow

Generate random keys
Store them in std::map
Store the same keys in Bloom Filter
Benchmark both structures
Compare memory and speed

Advantages of Bloom Filter

Very low memory usage
Extremely fast lookup
Scales well for huge datasets

Limitations

False positives are possible
Cannot retrieve stored values
Cannot remove elements easily (basic Bloom Filter)

Future Improvements

Counting Bloom Filter
Dynamic Bloom Filter
Better hash functions
Visualization graphs
CSV benchmark export
False positive rate analysis

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
benchmark_results.csv		benchmark_results.csv
bloomFilter.cpp		bloomFilter.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bloom Filter vs std::map in C++

Overview

Features

Technologies Used

How Bloom Filter Works

Project Structure

Compilation

Linux / macOS

Menu Options

Performance Metrics Measured

Why Bloom Filters Are Useful

Example Workflow

Advantages of Bloom Filter

Limitations

Future Improvements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bloom Filter vs std::map in C++

Overview

Features

Technologies Used

How Bloom Filter Works

Project Structure

Compilation

Linux / macOS

Menu Options

Performance Metrics Measured

Why Bloom Filters Are Useful

Example Workflow

Advantages of Bloom Filter

Limitations

Future Improvements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages