estimate
: new command for probabilistic algorithms starting with --cardinality-estimate
option
#2360
Labels
enhancement
New feature or request. Once marked with this label, its in the backlog.
Currently,
--cardinality
is computed by maintaining frequency tables in memory, which can potentially lead to OOM errors for very large datasets.Add a "streaming"
--cardinality-estimate
option to estimate the cardinality using constant memory.Use the HyperLogLog algorithm to do this.
There are several implementations in the Rust ecosystem - https://crates.io/search?q=hyperloglog
The text was updated successfully, but these errors were encountered: