I still haven't figured out a way to *reliably* extract Japanese SJIS zip files on Linux. unzip/7z work on files with extended UTF-8 filenames. unzip always mangles SJIS. LANG=C 7z passes through SJIS and convmv can convert to UTF-8, but that mangles UTF-8 extended zips.
-
-
... turns out this is already a thing, but nobody builds unzip with -DUSE_ICONV_MAPPING. Well then. Gentoo ebuild patch incoming to add IUSE="iconv"...
Show this thread -
Yeah, this totally works. Why does nobody build unzip with support for this option?!pic.twitter.com/fzGjbnIvhI
Show this thread -
PR filed for Gentoo and bug filed for Arch. https://github.com/gentoo/gentoo/pull/6276 … https://bugs.archlinux.org/task/56428
Show this thread
End of conversation
New conversation -
-
-
or have some sort of support for uchardet inside unzip
-
I mean I'm fine specifying the encoding manually. Also, even just *not mangling* the bytes would be better, then I can just unconditionally use -UU (ignore UTF-8 filenames) and LANG=C and convmv.
End of conversation
New conversation -
-
-
Yes, it is hard. Want me to show you how one can mix latin1/utf8/utf16... in a single filename? Knowing how to decode any string is non decidable.
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
-
-
Have you ever heard about WinRAR?
Thanks. Twitter will use this to make your timeline better. UndoUndo
-
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.