PDA

View Full Version : 3310 charset and char map



LowCost
25-05-2003, 05:59 PM
Hi!
In a 3310, how can I know which character set is used with a particular language, and how can I get its character map?

Thanks! :)

Low

gkagg
25-05-2003, 10:11 PM
Try to use Al's PPM maker and notice each language's flag in the corresponding subchunk...

08000000 - GSM alphabet (Normal)
8C000000 - UTF-8
88000000 - Unicode
A8000000 - Compressed Unicode

As for the character map use the same proggy too and manipulate the charset easily!!!
Hope I helped...

Cheers...

LowCost
26-05-2003, 10:42 PM
The COMM subchunk has the flags '00000000'. Which charset is it using? :confused:

Low

gkagg
27-05-2003, 06:43 AM
Originally posted by LowCost
The COMM subchunk has the flags '00000000'. Which charset is it using? :confused:

Low


for COMM chunk...

00000000 - GSM alphabet
84000000 - UTF-8
80000000 - Unicode
A0000000 - Compressed unicode

Cheers...

LowCost
27-05-2003, 12:26 PM
One million thanks!! :)

Low

gkagg
27-05-2003, 02:20 PM
Originally posted by LowCost
One million thanks!! :)

Low

Thanks goes to Al!!! Cheers mate wherever you are...

LowCost
27-05-2003, 03:44 PM
Just one last thing...
I've found in the COMM subchunk extended characters like \x88, \x89,... which do not belong to the standard GSM alphabet (which is a 7-bit charmap). Do you know which character map is used for these chars?

Low

gkagg
27-05-2003, 08:37 PM
Be more specific... Where are these bytes located??

Cheers...

LowCost
27-05-2003, 09:11 PM
The first strings in the COMM chunk of a 3310 5.79 C flash I have are:
Czech
Dansk
Deutsch
English
Espaòol
Test:
Fran\x88ais
Greek
Hebrew
Italiano
Net monitor
Nederlands
Norsk

Polish
Portugu\x89s

...so \xF2 (ò) maps to character 'ñ', \x88 is 'ç' and \x89 is character 'é'.
Which charset could it be?!!! :confused:

Low

gkagg
28-05-2003, 06:50 AM
Originally posted by LowCost
The first strings in the COMM chunk of a 3310 5.79 C flash I have are:
...so \xF2 (ò) maps to character 'ñ', \x88 is 'ç' and \x89 is character 'é'.
Which charset could it be?!!! :confused:

Low

Extended Unicode...


Cheers...