We need a new encoding which makes the maximum possible use of "safe" characters from the "good" range and uses none from the "bad" range.
-
Show this thread
-
If we can find 256 "good" characters (trivial), "Base256" would allow 280 bytes per 280-character Tweet, same as Base65536.
1 reply 0 retweets 3 likesShow this thread -
If we can find 512 "good" characters (easy?), "Base512" would express 9 bits per character, or 315 bytes per 280-character Tweet.
1 reply 0 retweets 2 likesShow this thread -
If we can find 1024 "good" characters (difficulty unclear), "Base1024" would express 10 bits per character = 350 bytes per Tweet
1 reply 0 retweets 2 likesShow this thread -
Base2048 -> 11 bits/character -> 385 bytes/Tweet (highly unlikely) Base4096 -> 12 bits/character -> 420 bytes/Tweet (definitely impossible)
1 reply 0 retweets 3 likesShow this thread -
Binary encodings using non-power-of-two Unicode character counts are possible but generally more inconvenient to implement.
1 reply 0 retweets 2 likesShow this thread -
With a power of two, you just dice the bits up and then put them back together in different groupings.
1 reply 0 retweets 2 likesShow this thread -
There are 0x1100 = 4352 good characters, and 1109760 bad ones.
1 reply 0 retweets 3 likesShow this thread -
A currently-unanswered question is how many of the good characters are "safe".
1 reply 0 retweets 3 likesShow this thread -
"Safeness" was a concept I defined while initially developing Base65536: https://github.com/qntm/base65536gen#what-makes-a-character-safe …
1 reply 0 retweets 4 likesShow this thread
I suppose I shall have to dig that old code out and do some further number crunching...
-
-
I think I'm going to call these characters "light" and "heavy" rather than "good" and "bad".
0 replies 0 retweets 4 likesShow this threadThanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.