- cross-posted to:
- blahaj@lemmy.blahaj.zone
- programmer_humor@programming.dev
- darklamer ( @darklamer@lemmy.dbzer0.com ) 49•2 months ago
We have Unicode these days: blåhaj
- NeatNit ( @NeatNit@discuss.tchncs.de ) 26•2 months ago
Unicode in filenames? Are you crazy?!
Okay that was /s to some extent but I gotta rant, I’m totally convinced that there’s still new software today that completely trip over themselves when files or paths have non-ASCII characters, or sometimes even a space. Incompetence didn’t go anywhere.
- Rubanski ( @Rubanski@lemm.ee ) 22•2 months ago
I still use underscores for filenames, basically muscle memory at this point
- pmk ( @pmk@lemmy.sdf.org ) 14•2 months ago
Unicode in filenames can be a bad idea, since there are more than one way to achieve what looks like the same character. So matching patterns could fail if you think it’s one way, but it’s actually another representation in unicode.
- NeatNit ( @NeatNit@discuss.tchncs.de ) 5•2 months ago
Good point. Do filesystems use a normal form to at least prevent having two files with effectively the same name?
I should point out the flip side though, that there’s no avoiding Unicode in filenames. Users in languages that don’t use the Latin alphabet (such as Japanese, Chinese, Korean, Hebrew, Arabic, Greek and Russian, and the list could go on) can reasonably expect to be able to give a file a name they can read and understand with no extra effort. All the software woes that come with it - too bad, software needs to deal with it.
- pmk ( @pmk@lemmy.sdf.org ) 2•2 months ago
I’m not sure. A few years ago I remember that OpenBSD expected ASCII for files, but I think Linux expects utf-8. I could be wrong though.
- NeatNit ( @NeatNit@discuss.tchncs.de ) 3•2 months ago
I’m assuming Unicode anyway, and UTF-8 is by far the most natural because most files will be in ASCII. A “normal form” (see link above), you might think of it as a canonical form, is a way to check if two strings are equivalent, even if they encoded the text differently. Like the example mentioned on Wikipedia:
For example, the distinct Unicode strings “U+212B” (the angstrom sign “Å”) and “U+00C5” (the Swedish letter “Å”) are both expanded by NFD (or NFKD) into the sequence “U+0041 U+030A” (Latin letter “A” and combining ring above “°”) which is then reduced by NFC (or NFKC) to “U+00C5” (the Swedish letter “Å”).
- darklamer ( @darklamer@lemmy.dbzer0.com ) 8•2 months ago
Incompetence didn’t go anywhere.
Now that’s certainly true, but the beauty of open source software is that we can fix bugs when we encounter them.
- pingveno ( @pingveno@lemmy.ml ) 25•2 months ago
Why you torture blahaj?
- abcdqfr ( @abcdqfr@lemmy.ml ) 20•2 months ago
Why are we sttill kink shaming?
- pingveno ( @pingveno@lemmy.ml ) English20•2 months ago
Blahaj cannot speak, therefore Blahaj cannot give consent.
- yetAnotherUser ( @yetAnotherUser@discuss.tchncs.de ) 7•2 months ago
You don’t necessarily need speech for consent since non-verbal/mute people exist.
- ElectroLisa ( @ElectroLisa@lemmy.blahaj.zone ) 4•2 months ago
It can’t say no either /s
- flora_explora ( @flora_explora@beehaw.org ) 2•2 months ago
What? That shouldn’t be any basis for consent! Only if someone is able to consent, i.e. emphatically say ‘yes’ (or otherwise agree), should you start thinking about doing anything sexual involving them. If you do anything sexual involving someone who cannot say ‘no’ then that is a sexual violation of them.
- ElectroLisa ( @ElectroLisa@lemmy.blahaj.zone ) 4•2 months ago
This was a satire
- flora_explora ( @flora_explora@beehaw.org ) 1•2 months ago
Hard to spot and people might take it at face value.
- katy ✨ ( @cupcakezealot@lemmy.blahaj.zone ) 14•2 months ago
blahaj.zone
- kingthrillgore ( @KingThrillgore@lemmy.ml ) 4•2 months ago
You don’t need to tape archive it, it’s one thing
- Lena [she/her] ( @Xelnoc@lemmy.blahaj.zone ) 2•2 months ago
Yeah but you can
- No_Support_8363 ( @No_Support_8363@lemm.ee ) 1•20 days ago