6.0.0-alpha12
6/10/25

[#3746] illegal utf-8 sequence when using Finnish language
Summary illegal utf-8 sequence when using Finnish language
Queue IMP
Queue Version 4.1
Type Bug
State Resolved
Priority 1. Low
Owners
Requester tpeland (at) tkukoulu (dot) fi
Created 04/08/2006 (7003 days ago)
Due
Updated 05/09/2006 (6972 days ago)
Assigned 04/09/2006 (7002 days ago)
Resolved 05/09/2006 (6972 days ago)
Github Issue Link
Github Pull Request
Milestone
Patch No

History
05/09/2006 12:46:50 PM Jan Schneider Comment #8
State ⇒ Resolved
Reply to this comment
That makes sense. Thanks for the update.
05/09/2006 05:55:54 AM tpeland (at) tkukoulu (dot) fi Comment #7 Reply to this comment
So, does anybody have an idea what or if we could do something about
this? Or do we have to accept that Finnish locales seem to be broken
in libc?
After upgrading php from 4.x to 5.1.2 the problem disappeaded. As php 
4.x installations are slowly fading so maybe we can ignore the 
problem. This also suggest the problem isn't in libc but in php.


05/08/2006 03:37:37 PM Jan Schneider Comment #6 Reply to this comment
So, does anybody have an idea what or if we could do something about 
this? Or do we have to accept that Finnish locales seem to be broken 
in libc?
04/11/2006 09:24:49 AM tpeland (at) tkukoulu (dot) fi Comment #5 Reply to this comment
Using the examples the resulting separator is (in hexcode):

locale fi_FI: 0xa0

locale fi_FI.UTF-8: 0xc2 0xa0



The character is non-breakable space.




04/11/2006 08:13:53 AM Jan Schneider Comment #4 Reply to this comment
Try this instead (in a console):



<?php

setlocale(LC_ALL, 'fi_FI.UTF-8');

$localeinfo = localeconv();

var_dump($localeinfo['thousands_sep']);



and add header('text/plain; charset="utf-8"') if viewing in a browser.
04/11/2006 03:41:03 AM Michael Slusarz Comment #3 Reply to this comment
Weird - I can't reproduce this on my system.  Running this PHP script:



<?php

setlocale(LC_ALL, 'fi_FI');

$localeinfo = localeconv();

var_dump($localeinfo['thousands_sep']);



Gives me:



string(1) " "
04/09/2006 10:37:37 AM Jan Schneider Comment #2
State ⇒ Feedback
Reply to this comment
It looks like the system's localeconv is returning separators in the 
wrong charset. I can reproduce it, but there is probably nothing we 
can do about this, because we already explicitely request the 
separator characters in the interface's charset.

I was not able to determine which charset the separator has in fact, 
definitely not iso-8859-1.
04/08/2006 05:59:34 PM tpeland (at) tkukoulu (dot) fi Comment #1
Priority ⇒ 1. Low
Type ⇒ Bug
Summary ⇒ illegal utf-8 sequence when using Finnish language
Queue ⇒ IMP
State ⇒ Unconfirmed
Reply to this comment
Start a new message and add an attachment that is at least 1 MB in 
size. The English interface shows the size of the file correctly but 
in Finnish interface the character between 1 and 024 is a non-utf-8 
character.



English:

1. 1M  (application/x-executable-file)   Size: 1,024.00 KB   [Preview]

Finnish (Suomi):

1. 1M  (application/x-executable-file)   Koko: 1Â024,00 KB   [Esikatselu]





I think it is meant to be 1'024,00


Saved Queries