• isn't i am already on the "edge" version of 2v19?

    No - that's just the last release. Go to the firmware update app and use 'cutting edge' - it should be something like 2v19.60 or later

    how many chinese characters

    Well it's heartening to see that maybe 8000 chars would do it - now can you find out what their character codes are? :)

    will it use up a lot of resources?

    Adding all 20,000 chars uses about 850kB of flash. It takes a while to upload, but the Bangle has 8MB and usually uses maybe only 2% of that - so there is plenty to go around.

    what if other languages users also ...

    Well, most of the UN could be handled by under 1000 chars which can easily be part of the main font. It's nothing compared to 20,000.

    But my plan was to have separate font apps for different Locales, but it would seem that splitting the characters that way isn't actually very straightforward.

    If someone had a list of characters for each locale it'd make it really easy.

  • hi,

    1. i just re-installed the firmware, the commit is still the same as above, but this time i am pretty sure i clicked 2v19.62 something

    2. negative, still not wrapped at the punctuations. btw, i found that, even in the message UI, the same SMS when displayed and when "clicked into" display differently. i can by chance read the code by one method, but since the other parts is outside of the screen, this is still not appropriate.

    3. i just asked gpt4, pls fact check if possible. btw, gpt4 were "sold out" now. too many people want to buy their plans.

    gpt4 result:
    You:

    in unicode, there are just about 10000 daily use traditional chinese characters and some 10000 daily used simplified chinese characters. do you know their codes? the code ranges?


    ChatGPT:

    In Unicode, Traditional and Simplified Chinese characters are not separated into distinct blocks based solely on their writing system. Instead, they are mostly included together in several blocks, primarily in the "CJK Unified Ideographs" block. The range of these characters in Unicode is quite extensive, encompassing multiple blocks. Here's an overview:

    CJK Unified Ideographs: This is the largest block of Chinese characters. It includes both Traditional and Simplified forms. The range is U+4E00 to U+9FFF. This block contains the majority of characters used in daily Chinese text.

    CJK Unified Ideographs Extension A: This block extends the basic set and includes more obscure and less frequently used characters. The range is U+3400 to U+4DBF.

    CJK Unified Ideographs Extension B to Extension G: These blocks contain even more rarely used, historic, or dialect-specific characters. The ranges are:

    Extension B: U+20000 to U+2A6DF
    Extension C: U+2A700 to U+2B73F
    Extension D: U+2B740 to U+2B81F
    Extension E: U+2B820 to U+2CEAF
    Extension F: U+2CEB0 to U+2EBEF
    Extension G: U+30000 to U+3134F

    CJK Compatibility Ideographs: This block includes characters that are variants of those in the main CJK block and are used for compatibility purposes with other character sets. The range is U+F900 to U+FAFF.

    CJK Compatibility Ideographs Supplement: This is an extension of the compatibility ideographs. The range is U+2F800 to U+2FA1F.

    These ranges cover both Traditional and Simplified characters, and the distinction between the two is often based on context and regional usage rather than Unicode encoding. Unicode primarily focuses on the unification of characters that have the same etymological root, regardless of regional script variations.


    You:

    for the above ranges, can you also tell how many characters are within them? thanks


    ChatGPT:

    Here is the number of characters in each Unicode block relevant to Chinese characters:

    CJK Unified Ideographs: 20,992 characters
    CJK Unified Ideographs Extension A: 6,592 characters
    CJK Unified Ideographs Extension B: 42,720 characters
    CJK Unified Ideographs Extension C: 4,160 characters
    CJK Unified Ideographs Extension D: 224 characters
    CJK Unified Ideographs Extension E: 5,776 characters
    CJK Unified Ideographs Extension F: 7,488 characters
    CJK Unified Ideographs Extension G: 4,944 characters
    CJK Compatibility Ideographs: 512 characters
    CJK Compatibility Ideographs Supplement: 544 characters

    These counts reflect the number of individual characters encoded in each respective block.


About

Avatar for ccchan @ccchan started