ÀÌ ±ÛÀº ÃÖ±Ù °ü½ÉÀÇ ´ë»óÀÌ µÇ°í ÀÖ´Â Unicode/ISO10646-BMPÁß¿¡¼­ Çѱۿ¡ °üÇÑ °£·«ÇÑ ÇØ¼³À» ¼¼ºÎºÐÀ¸·Î ³ª´©¾î ¼³¸íÇÑ °ÍÀÔ´Ï´Ù. A. ÇÑ±Û Code Points B. Unicode¿Í ISO10646-BMPÀÇ °ü°è C. ISO10646ÀÇ level 1-3ÀÇ Á¤ÀÇ µµ¿òÀÌ µÇ½Ã±â¸¦. ÀÌ ÁØ¿±. at Courant Institute in New York Univ. A. ÇÑ±Û Code Points in ISO 10646-BMP/Unicode 1.1 ----------------------------------------------------------------- . BMP is a 2byte char set in ISO 10646 and chars set in Unicode 1.1 has been changed to be identical with ISO 10646-BMP . Spacing and nonspacing consonant and vowel has been deleted in the final IS. 1. And we have the following codepoints in BMP: 6656 Precomposed chars : 3400 - 4dff (26pages*256char/page) 2. the codepoints of 240 hangul jamos are as follows accoreding to N868: choseong (syllable-initial): 1100 - 1159 (90 chars) choseong filler 115f jungseong filler 1160 jungseong(syllable-peak) : 1161 - 11a2 (66 chars) jongseong(syllable-final) : 11a8 - 11fa (82 chars) ^^^^ cf) 11f9-should be From: kskim@hyowon.pusan.ac.kr(kyongsok kim) B. Unicode¿Í ISO10646-BMPÀÇ °ü°è ------------------------------ Unicode 1.1 is an implementation of level 2 (new level 3). From: "K. Smith-Yoshimura" C. ISO10646ÀÇ level 1-3ÀÇ Á¤ÀÇ ---------------------------- ÀåÇü±Ô´Ô ±Û Áß¿¡¼­, C:¿ìÇü´Ô> Á¦°¡ Àß ¸ð¸£´Â Á¡Àº ¾Æ·¡¿Í °°½À´Ï´Ù. C:¿ìÇü´Ô> C:¿ìÇü´Ô> Unicode Level 1 °ú Unicode Level 2ÀÌ ExclusiveÇÑÁö? C:¿ìÇü´Ô> Unicode Level 1 °ú Unicode Level 2ÀÇ ÇÕÀÌ Unicode Level 3ÀÎÁö? C: C:ÁØ¿±´Ô> Unicode´Â 3°³ÀÇ levelÀ» °¡Áö´Âµ¥, level1¿¡¼­´Â 6656°³ÀÇ C:ÁØ¿±´Ô> Precomposed syllable°ú 240°³ÀÇ ÇÑ±Û Àڸ𸦠¾µ ¼ö ÀÖ°í, C:ÁØ¿±´Ô> level2¿¡¼­´Â 6656°³ÀÇ Precomposed syllable ·Î Ç¥Çö µÇÁö C:ÁØ¿±´Ô> ¾Ê´Â ÇѱÛÀ» 240ÀÇ Jamo·Î CombineÇÕ´Ï´Ù. level3¿¡¼­´Â C:ÁØ¿±´Ô> ÇѱÛÀ» precomposed syllableÀ̳ª, 240 Jamo combination C:ÁØ¿±´Ô> ¾î´À °ÍÀ¸·Î³ª ¾µ¼ö ÀÖÀ¾´Ï´Ù. C: C:> Date: Wed, 19 Aug 92 15:19:38 PDT C:> To: unicode@Sun.COM C:> From: "K. Smith-Yoshimura" C:> Subject: Hangul support C:> C:> Combining jamo are allowed in the new implementation level "1.5" C:> adopted (as requested by the UK), as well as level 2 (unrestricted C:> use of combining characters.) Implementation level 1 is unchanged C:> (precomposed only). The new level 1.5 (which will be the new level C:> 2, with current level 2 becoming level 3) creates a separate level C:> for scripts where no precomposed characters normally exist, and, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ C:> therefore, combining marks are mandatory, e.g., Indic scripts, Thai, ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ C:> Arabic, Hebrew, etc. Korea specifically requested that combining C:> jamos be included in this new level 2. Some of this was discussed C:> on the ISO10646@JHUVM listserv. C: C:ÁØ¿±´Ô, Á¦°¡ ±×µ¿¾È ±¹Á¦ µ¿Çâ¿¡ Á» ¹«°ü½ÉÇß¾ú±â¿¡ ¿©Â޾´Ï´Ù. C:À§¿¡ ÀÖ´Â Smith YoshimuraÀÇ ¿À·¡µÈ ÆíÁö¿¡´Â ·¹º§ 2¿¡¼­´Â ¿Ï¼ºÇü C:±ÛÀÚ°¡ Çϳªµµ ¾ø´Ù°í µÇ¾îÀִµ¥ Áö±ÝÀº ¹Ù²ï ¸ð¾çÀÌÁÒ? ÃÖ±ÙÀÇ C:µ¿Çâ¿¡ ´ëÇØ hangul@cair¿Í sg-inetÀ¸·Î postingÀ» ÇØÁÖ½Ã¸é °¨»çÇϰڽÀ´Ï´Ù. 92³â 6¿ù Seoul Meeting¿¡¼­ ÃÖÁ¾ IS 10646ÀÌ ³ª¿À±â Á÷Àü¿¡, ºñ°ø½ÄÀûÀ¸·Î level 1¿¡ ÀÖ´Â ¸ðµç precomposed(¿Ï¼ºÇü) ÇѱÛÀ» »èÁ¦Çϰí, level 2¿¡¼­ combining jamoµé·Î¸¸ HangulÀ» ¾²ÀÚ´Â ÀǰßÀÌ ÀÖ¾úÀ¾´Ï´Ù. ±×·¯³ª, ÀÌ ¾ÈÀº Çѹøµµ °ø½ÄÀûÀÎ ISO/unicode specÀÌ µÈ ÀûÀÌ ¾øÀ¾´Ï´Ù. À§¿¡¼­ º¸µíÀÌ, final IS10646 level 2¿¡¼­ÀÇ jamo cominationÀº ¿Ï¼ºÇü ±ÛÀÚ°¡ ¾ø´Â °æ¿ì¿¡ ÇÑÇÏ¿© (scripts where no precomposed characters normally exist) »ç¿ëµÇµµ·Ï µÇ¾îÀÖÀ¾´Ï´Ù. Áï ¸ðµç code point´Â uniqueÇÕ´Ï´Ù. 'Kak'À» 'K','a','k'À¸·Î »ç¿ëÇÏ´Â °ÍÀº level 2¿¡¼­´Â Çã¿ëµÇÁö ¾ÊÀ¾´Ï´Ù. ÇÏÁö¸¸, 'KK','yu','lm'°ú °°ÀÌ ¿Ï¼ºÇüÀÌ BMP¿¡ ¾ø´Â °æ¿ì´Â 3code points * 2byte/code points = 6byte·Î Ç¥Çö ÇÕ´Ï´Ù. º¸´Ù Á¤È®ÇÑ level ÇØ¼³Àº ´ÙÀ½ÀÇ discussion¿¡¼­ º¼ ¼ö ÀÖÀ¾´Ï´Ù. ------------------------------------------------------------- From: SCHEIN@TOROLAB5.VNET.IBM.COM Date: Wed, 2 Sep 92 17:22:25 EDT S:>- Does level 2 only allow combined characters ? S:> For example, can you use "A with Diaeresis (code point C4)" in level 2 ? S:Level 2 includes all characters from Level 1. "A DIAERESIS" is in level2. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ISO 10646 level1,2,3 ´Â upward compatible schemeÀÔ´Ï´Ù. S:>- If you mix precomposed characters and combined characters, S:> is it always considered level 3 ? ÇØ´äÀº combined characters°¡ precomposed set in BMP¿¡ ¼ÓÇØ ÀÖ´À³Ä, ±×·¸Áö ¾ÊÀ¸³Ä¿¡ µû¶ó ´Þ¶óÁý´Ï´Ù. combined character µéÀÌ BMP¿¡ ¾ø¾ú´Ù¸é ´ë´äÀº NO ÀÔ´Ï´Ù. ±×·¯³ª, S:Practically YES. S:Level 2 is simply defined by a list of combining characters which S:are allowed in this level. But the combining characters in this list S:have been chosen from the scripts where precomposed are not usually S:used, for example, Thai and Indic scripts. But theoretically, if S:tomorrow some precomposed Thai characters will be added, Thai combining S:characters will still remain in level 2. Essentially the same effect S:could be achieved by using subsets. S:>- If someone chooses to use only combined characters in korean S:> implementation, do you say "level 2" or do you still say "level 3" S:> only because there are code points for some korean precomposed S:> characters in ISO-10646? S: S:Yes, it is still level 3. This level 2 & 3 scheme is far from perfect, S:and, personally, I would rather do without. S: S:>Note that not all Korean syllables can be represented by precomposed S:>characters. S: S:Yes, I know. The same is true for Latin script. This is why we have S:level 3. S: S:-Isai ³¡À¸·Î ¿ìÇü´ÔÀÇ Áú¹®¿¡ ´äÇϸé, level 1,2,3 ´Â upward compatible schemeÀÔ´Ï´Ù. ±×·¯³ª, ±× »ç¿ë ¹æ¹ýÀ̳ª ¸ñÀûÀÌ »óÀÌÇÏ¿© level 3 ¸¦ level 2ÀÇ upgraded versionÀ¸·Î »ç¿ëÇÏÁö´Â ¾ÊÀ» °ÍÀ̶ó´Â °ÍÀÌ Á¦ ÃßÃøÀÔ´Ï´Ù. level 3Àº precomposed¿Í combining jamo¸¦ randomÇÏ°Ô ¼¯¾î ¾²´Â scheme À̶ó±â º¸´Ù´Â, level2 schemeÀ̳ª combination method µÑ ÁßÀÇ ¾î´À °ÍÀ̳ª¸¦ »ç¿ëÀÚ°¡ Á¤Çϵµ·Ï Çã¿ëÇÏ´Â shemeÀÌ µÉ°Í °°À¾´Ï´Ù.