! Please note that this is a snapshot of our old Bugzilla server, which is read only since May 29, 2020. Please go to gitlab.xfce.org for our new server !
Bad file/dir sorting when combining CJK characters with non-CJK ones in filen...
Status:
RESOLVED: WORKSFORME

Comments

Description Martin Sourada 2013-08-07 23:35:17 CEST
Description of problem:
Folders/files are sorted wrong when e.g. Japanese is combined with non-CJK characters.

How reproducible:
Always

Steps to Reproduce:
1. make an empty test directory
2. cd to the test dir
3. mkdir たまゆら らき☆すた たまこまーけっと たまこ たまゆら[BD] らき☆すた[BD] たまこまーけっと[BD] らきすた たまゆらBD
4. view the test dir in Thunar

Actual results:
Sort order is:
たまこ
たまゆら
らきすた
らき☆すた
たまこまーけっと
たまゆらBD
たまゆら[BD]
らき☆すた[BD]
たまこまーけっと[BD]

Expected results:
Sort order is:
たまこ 
たまこまーけっと 
たまこまーけっと[BD]
たまゆら
たまゆらBD
たまゆら[BD]
らき☆すた
らき☆すた[BD]
らきすた

Additional info:
The correct order is displayed e.g. by mc. I originally filled this at Fedora Bugzilla as https://bugzilla.redhat.com/show_bug.cgi?id=994396 but it's likely this is an upstream problem. It's possible it breaks for other non-latin scripts as well, but I didn't test that. In Japanese it's rather common to combine kana/kanji with non-CJK characters and having files apparently unsorted makes finding them really tedious.
Comment 1 Andre Miranda editbugs 2020-05-21 02:16:08 CEST
As of Thunar 1.8.14, the actual results are:

LANG=en_US.UTF-8:
たまこ
たまこまーけっと
たまこまーけっと[BD]
たまゆら
たまゆら[BD]
たまゆらBD
らき☆すた
らきすた
らき☆すた[BD]

LANG=ja_JP.UTF-8:
たまこ
たまこまーけっと
たまこまーけっと[BD]
たまゆら
たまゆら[BD]
たまゆらBD
らき☆すた
らき☆すた[BD]
らきすた

I think the results are satisfactory, if you disagree please reopen.

Bug #10287

Reported by:
Martin Sourada
Reported on: 2013-08-07
Last modified on: 2020-05-21

People

Assignee:
Jannis Pohlmann
CC List:
3 users

Version

Attachments

Additional information