file detects utf32-be wrongly

Bug #285309 reported by Michael Tsang
4
Affects Status Importance Assigned to Milestone
file (Debian)
Fix Released
Unknown
file (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

michael@ubuntu:~/src/unicode$ file utf32-be.txt
utf32-be.txt: data

Revision history for this message
Michael Tsang (michaeldadmum-deactivatedaccount) wrote :
Revision history for this message
Adam Buchbinder (adam-buchbinder) wrote :

Confirmed in file 4.24-4. According to the Unicode FAQ[1], the Byte Order Mark is serialized as 00 00 FE FF in UTF-32. The magic in file 4.24-4 is looking for FE FF 00 00, which is why it's not detecting this file. A patch will be forthcoming.

[1] http://unicode.org/faq/utf_bom.html#bom4

Changed in file:
status: New → Confirmed
Revision history for this message
Adam Buchbinder (adam-buchbinder) wrote :

The attached debdiff adds a patch to debian/patches which fixes UTF-32BE detection. There's still a problem in Debian; I'll be opening a bug report upstream.

Changed in file:
status: Unknown → New
Revision history for this message
Adam Buchbinder (adam-buchbinder) wrote :

This has been fixed since the release of 5.01-1 by Debian. I'm not sure why the Debian task is still showing as open; I'll try to get that to refresh.

Changed in file (Ubuntu):
status: Confirmed → Fix Released
Changed in file (Debian):
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Bug attachments

Remote bug watches

Bug watches keep track of this bug in other bug trackers.