I made something silly. https://iojcc.org/
The size tool thinks 𝌆 is 2 bytes. It’s 4 in UTF-8. Which encoding are you using?
-
-
Looking at the code, it seems it’s not counting bytes at all, but rather UCS-2/UTF-16 code units. If this is intentional, you may want to clarify that.
-
Correct. That's semi-intentional. The original idea was to use 7-bit ASCII, and reject everything which has an octet > 127. Although, thinking about it now, that might be a little bit too restricting. I'll fix the size tool to count actual bytes.
End of conversation
New conversation -
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.
JavaScript, HTML, CSS, HTTP, performance, security, Bash, Unicode, i18n, macOS.