I implemented @troyhunt's HIBP password list as a pure Python3 Bloom filter, in 629MB (false positive rate = 0.0005)https://gist.github.com/marcan/23e1ec416bf884dcd7f0e635ce5f2724 …
-
-
Is the password list really bigger than 659MB? Seems like a gzip on a sorted list would be smaller.
1 reply 1 retweet 1 like -
Over 5GB and distributed with hashed passwords so doesn't compresss well
1 reply 0 retweets 2 likes -
The text hash list is 13GB. In binary it would be 6.4GB and not compressible.
1 reply 2 retweets 5 likes
Replying to @marcan42 @copumpkin and
Also a sorted list lookup takes log2(n) disk seeks = 28. The Bloom filter takes 11.
0 replies
1 retweet
6 likes
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.