• I could split every two characters which would be a bit more efficient, but it's just added complexity and it sounds like in some cases it might mess things up?

    what do you guys mean by every two characters?

    a chinese word = a chinese character. in the past, , it take up double the byte of that of an ASCII character.

    for the number of bytes you store "AB",
    you can only store 1 chinese "中".


  • While in Classical Chinese it was certainly the case that one character most often wrote one word, in modern Mandarin, it is more often the case that a word consists of two syllables, represented by two characters. This change can actually start to be seen as early as Xunzi, who uses a noticeably larger amount of disyllabic words than Mencius. Let's look again at the example sentence from the first article from the Universal Declaration of Human Rights, this time adding spaces between words*:

    人人 生 而 自由﹐在 尊嚴 和 權利 上 一律 平等。他們 賦有 理性 和 良心﹐並 應 以 兄弟 關係 的 精神 互相 對待。

    Sum of monosyllabic words (one graph, one word): 10
    Sum of disyllabic words (two graphs, one word): 15

    *Note that for the purposes of this conversation, a word should be understood as something one would encounter in daily life and also something that one would find in a dictionary of modern Mandarin.

  • i am not sure you natively speak chinese or not.
    for me, my mother tongue is chinese's cantonese.
    for your sentence:
    人人 生 而 自由﹐在 尊嚴 和 權利 上 一律 平等。他們 賦有 理性 和 良心﹐並 應 以 兄弟 關係 的 精神 互相 對待。

    you will be correct that some are "two words" , a short phase, like 人人, 自由 , 尊嚴, 權利, 一律, 平等。他們 , 理性 , 良心﹐, 兄弟 ,關係 , 精神, 互相, 對待。

    but some are "single words" like 在, 和 , 和 ,並 , 的.

    for a bangle.js v2, with that 512KB ram, there wont be a dictionary, there wont be an AI, i am pretty sure that is NO WAY for the bangle to understand the chinese.

    you may think this looks better, by adding empty spaces for easier reading:
    人人 生 而 自由﹐在 尊嚴 和 權利 上 一律 平等。他們 賦有 理性 和 良心﹐並 應 以 兄弟 關係 的 精神 互相 對待。

    but i am 100.000000% sure that, the lines looks 100.000000% same FUNCTIONALLY without the spaces. and the bangle watch dont have the luxury to add them.

    just A. wrap the msg untouched into new lines will be ok.
    if possible, then B. make news lines at puntuations.

    pls only do C. to introduce empty spaces only if you got AI, you got a dictionary to check for.

    ps: the author of the message will already consider whether the message is understandable, so you really DONT need to introduce empty spaces. this is a simple logic.

    人人生而自由﹐在尊嚴和權利上一律平等。他們賦有理性和良心﹐並應以兄弟關係 的精神互相對待。
    this one without empty spaces indeep is more formal than the above with the introduced spaces above.

    as seen in #1, the 2 messages i received dont have the empty spaces.
    those empty spaces may help some people, but formally not needed.
    ps: when i reply chinese in youtube's comment, i'll use those empty spaces, so that those keywords are separated out, and looks more like "highlighted", to emphasis what i want to express.


Avatar for houshou_m @houshou_m started