ISO 10646-1/Unicode 1.1 ǥ ѱ ȣ Ұ



[漮 "ǻ ѱ̾߱" 21 Դϴ.]

<>

1.  ǥ ںȣ ISO 10646-1 ڵ(Unicode) 1.1Ұ           1
    1.1 ISO 10646-1 1992⿡  ǥ  ȣ Ȯ                                1
    1.2 ISO 10646 BMP(UCS-2) ڵ尡 ϳ                                       1
    1.3 UCS-2  ڸ  Ʈ Ÿ.                                        2
2.  ǥ ѱ ȣ Ұ                                             2
    2.1  ǥ ѱ ȣ迡  ִ ϼ                                           2
        2.1.1  ѱ 6656 Ҹ    ϼ  ִ.         2
        2.1.2 ϼ 6656 Ҹ  ټ   ʴ.               3
        2.1.3 KSC 5601  5657 ϼ 4280 Ҹ 
             ȣ ٲ 10646-1  ִ.                                      4
    2.2  ǥ ѱ ȣ迡  ̸ "ù ѱ ȣ"  ΰ?          4
        2.2.1 ù ѱ ȣ ο ̶  Ͽ                       4
        2.2.2 ù ȣ  ΰ?                                                 5
        2.2.3 ù ѱ ȣ 238 ڷ  ѱ Ӹ ƴ϶ 
             ѱ۵ Ϻϰ Ѵ.                                                   5
        2.2.4 ù ѱ ȣ ϼ  Ʈ  
               Ѳ Ǯ.                                                  6
3.  ǥѱ ȣ ü                                  7

1992 6-7 ISO/IEC JTC1/SC2/WG2 22 ȸǿ ISO 10646-1 ǥ ȣ Ȯ , 1993 5 ǥ μϿ Ͽ.

ISO 10646-1 ִ ǥ ѱۺȣ迡 ϼ Ҿ ù ѱ ȣ谡  ִ. 1992⿡ KSC 5601 ǥ  Ʈ ǥؿ ״  , ѱ Ÿ ־ Ѵٴ ù ȣ迡 ݿǾ ִ. ϼ ̹ 츮 ˰ , ù ȣ ǥؿ  ־, θ ˷ ʰ ִ. ۿ ǥ ȣ 10646-1 ˾ƺ , ISO 10646-1 ִ ǥ ѱ ȣ迡 ؼ .

1. ǥ ںȣ ISO 10646-1 ڵ(Unicode) 1.1Ұ

1.1 ISO 10646-1 1992⿡ ǥ ȣ Ȯ

1992 6-7 ISO/IEC JTC1/SC2/WG2 22 ȸǿ ISO-10646-1( ۿ ǥرںȣ Ǵ 10646-1 ̶ θڴ) ǥ(IS: international standard) ȮԿ ű⿡ ִ ѱ ȣ( ۿ ǥ ϴ ȣ θڴ) ȮǾ. 10646-1 μ⸦ ġ 1993 5 Ǿ. 10646-1 ѱ ϼ ȣ ù ѱ ȣ(̸ Ʈ ٸ) ΰ ִ.

1.2 ISO 10646 BMP(UCS-2) ڵ尡 ϳ

ڵ尡 ISO 10646 ν ȣ Ȯϰ ϸ ISO 10646-1 unicode version 1.1 . ISO 10646/unicode κ ϴ ܱ ȸ α׷ ̹ ȸ ִ.

ISO 10646-1 1980 ߹ݿ Ͽ 10 ɷ ܿ Ϻΰ ϼǾ. 10646-1 ( a, Z) ׹Ʈ Ÿ UCS-4(Universial Multiple-Octet Coded Character Set, 4 ׹Ʈ Ŵ) Ͽ ̷ ǥ Ȯ ϰ ִ ¿. ׷ 1980 ̱ ȸ簡 ߽ Ǿ Ʈ ü ڵ带 ü Ȱ Ͽ. ׷ٰ 1990 ʿ ISO ڵ尡 Ÿ ȣ谡 ϳ յǾ.

ISO-10646-1 ڵ带 ϳ ϴ ǥȭ ⱸ ϴ ׹Ʈ ü UCS-4 ̷ ιƮ ü UCS-2 Ͽ. ISO 10646-1 ڵ尡 Ǿٰ ϴµ ̸ Ȯϰ ڸ ISO 10646-1 UCS-2 ڵ尡 ϳ ̸ ISO 10646-1 UCS-4 쿡 ڵ尡 ؼ UCS-4 ƴ μ ϱ ƴ.

UCS-2 BMP(Basic Multilingual Plane) θ⵵ Ѵ. ISO 10646-1 ǥ "Universal Multiple-Octet Coded Character Set -Part 1: Architecture and Basic Multilingual Plane) Ǿ ־ UCS-2 10646 Ϻκп ʴ´ٴ ѷϰ ִ. ISO 1993 UCS-4 並 ϰ ִµ ̰ ǥ ˼ , ǥ DZ⿡  ɸ ǥ ǰ θ δ.

1.3 UCS-2 ڸ Ʈ Ÿ.

ǥ ȣ UCS-2 ڸ ιƮ Ἥ Ÿ ֿ ڸ Ÿ. ⸦ A 0x0041(⼭ 0x ڰ ´ٴ Ÿ ǥ̴) Ÿµ ̴ Ʈ Ѵ.

[׸1] ISO 10646-1 ƽŰ κ

2. ǥ ѱ ȣ Ұ

ǥ ѱ ȣ ISO 10646-1/Unicode 1.1ȿ ִ ѱ ȣ踦 ۵ ⺻ ϼ ù ȣ ֵ Ǿ ִ. ϼ ù ȣ迡 ؼ ˾ƺ , ǥѱ ȣ踦 ü ߵǾٴ 캸ڴ.

2.1 ǥ ѱ ȣ迡 ִ ϼ

2.1.1 ѱ 6656 Ҹ  ϼ  ִ.

10646-1 ϼδ 6656 Ҹ(, syllable)  ִµ, 6656 Ҹ ѱ̸, ѱ ϼ  ʴ. Ҹ ϼ ȣ(code position:ISO ϴ codeword:Ϲ codepoint:ڵ ) ŸǷ ᱹ ιƮ Ÿµ ϼ  ִ Ҹ 캸 .

  1. KSC 5601 2350 Ҹ ϼ
  2. KSC 5657 1930 Ҹ ϼ
  3. 1992⿡ ǥѱ ȣ迡  ǥؿ 𿡵 2376 Ҹ ϼ

3) 2376 Ҹ , ù Ҹ ߱ ؼ ߱ û ̸ 2370 Ҹ ѱۿ 11172 Ҹ  KSC 5601̳ 5657 ʴ Ҹμ ټ ó 2370 Ҹ̴. DIS 10646-1.2 KSC 5657 ִ ѱ 1673 Ҹ ־ 1992 6-7 ־ JTC1/SC2/WG2 22 ȸǿ ѱ 1673 Ҹ ѱ 2376 Ҹ ä.

ǥ 1) "Hangul" 2) "Hangul Supplementary-A" 3) "Hangul Supplementary-B" Ǿ θ ֵ ϱ ؼ ̱ۿ 1)2)3) ѱ ϼ ù° , ° , °  θڴ. ѱۿ 11,172 Ҹ  6656 Ҹ ϼ  Ƿ ϼ  4516 Ҹ ڿ ڼ ù ȣ Ÿ Ѵ. ѱ ϼ Ǿ Ƿ ѱ Ҹ ù ȣ Ÿ Ѵ.

[׸ 2] ISO 10646-KSC 5601 ϴ κ

[׸ 3] ISO 10646-KSC 0000 κ

2.1.2 ϼ 6656 Ҹ ټ  ʴ.

ϼ 6656 Ҹ ؼ Ѱ ˰ Ѿ ϼ ù°, °, °( û Ҹ  Ӹ ƴ϶ Ҹ ȿ Ƿ Ҹ )  ȿ Ҹ ̿ Ǿ ʴ. ̸ ڼ 캸

  1. ù°  (ISO 10646-KSC 5601) ִ 2350 Ҹ ȣ 0x3400 0x3d2d̸ ű⿡ ִ Ҹ (0x3400) (0x3401) (0x3402) ... (0x3d2d)̰
  2. °  (ISO 10646-KSC 5657) ִ 1930 Ҹ ȣ 0x3d2e 0x44b7̸ ű⿡ ִ Ҹ A(0x3d2e) E(0x3d2f) I N X... C J L O(0x44b7)̰
  3. °  ִ 2376 Ҹ ȣ 0x44b8 4dff ̸ ű⿡ ִ Ҹ

44b8 t44b9 44ba 44bb l44bc
44bd F44be C44bf D44c0 G44c1
44c2 I44c3 J44c4 K44c5 L44c6
...
Z4df7 a4df8 b4df9 c4dfa e4dfb
f4dfc g4dfd h4dfe i4dff ̴.

տ ù°  ȿ ִ 2350 Ҹ Ǿ °  ȿ ִ 1930 Ҹ ټ Ǿ ְ ̴ °  ù Ҹ ̴. ׷  ü Ǿ ʴٴ ִ.

2.1.3 KSC 5601 5657 ϼ 4280 Ҹ ȣ ٲ 10646-1  ִ.

KSC 5601 5657 ϼ 4280(2350+1930) Ҹ 10646-1 ٰ иϰ ˰ Ѿ ϼ Ҹ KSC 5601 5657 ȣ ״ ä 10646-1  ƴϸ ȣ ٲپ ٴ ̴. ⸦ Ҹ "" KSC 5601 16 1 (̸ 16 Ÿ 0x3021) msb 1 쿡 0xb0a1(=0x3021+0x8080) Ÿµ ؼ 10646-1 ִ ϼ Ҹ "" ȣ 0x3400̴.

ιƮ 10646-1 ״  Ʈ Ÿ ִ 11,172 Ҹ ǻ ù ȣ谡 Ÿ ֱ  Ʈ ȣ ٲپ 10646-1 ٰ 𸥴.

2.2 ǥ ѱ ȣ迡 ̸ "ù ѱ ȣ"  ΰ?

2.2.1 ù ѱ ȣ ο ̶  Ͽ

۾̴ 10646-1 238 ڸ ù ȣ θ 1ص ο ̶ ҷ. 10646-1 ü ڸ  ȣ迡 ̸ . 츮 ȣ迡 ̸ θ ִٰ .

ο ̶ ҷ Ʈ İ ٸ ׷ ڸ ؼ Ҹ Ÿٴ Ʈ Ƿ Ϲ ֵ ϱ ؼ̴.

Ϲ ֵ ϱ ؼ ο ̶ θ ó ̸ ؼ ٴ ϴٰ ߴ. "ο"̶ ʾƼ ο ö ̸ ϰ . ο ̶ ӽ÷  ùҸ-Ҹ-Ҹ ںȣȭ (syllable-initial-peak-final encoding approcah) ѱ ȣ θ ٶϴٰ . ̸ ù ѱ ȣ ٿ θ .

ѱ ȣ迡  캸 ͵ ѱ  µ ̴. ϼ Ҹ ϳġ ؼ ȣ ֱ syllable-encoding approcah(Ҹ ȣȭ ) ϰ, ù ȣ ڸ ϳġؼ ȣ ֱ character-encoding approcah( ȣȭ )̶ θ. Ʈ ̸ ̱Ⱑ  3x5 code θ⵵ Ѵ. ټ Ʈ  ٰ ؼ ׷ θµ ιƮ Ÿ ̸̶ ִ.

[׸ 4] ISO 10646-1 ù ѱ ȣ κ(1)(2)

2.2.2 ù ȣ  ΰ?

Ŀ ؼ 1988⿡ ó ִµ ϼó Ҹ(syllable:, , , ) ϳġ ؼ ȣȭ (letter or character:ùҸ : ... ): Ҹ ( ..) Ҹ Ǵ ħ( ) ϳġ ؼ ȣȭ Ҹ ڸ ؼ Ÿ ̴. Ҹ Ǵ ȣ Ÿ.

ù ѱ ȣ Ʈ ũ ٸ ϳ ִµ Ʈ Ҹ Ʈ ̰ Ǿ ù ѱ ȣ迡 ׷ ٴ ̴. ù ѱ ȣ 8Ʈ ȣ, 16Ʈ ȣ, 32Ʈ ȣ  ƹ ִ. ü ISO 8859 8Ʈ ȣ Ʋ ȿ ù ѱ ȣ ASCII ִ Ʈ ǥ ȣ踦 ִٴ ۾̰ ִ. 10646-1 ιƮ ȣ UCS-2 ׹Ʈ ȣ UCS-4 ƹ ù ѱ ȣ踦 ִ.

ù ѱ ȣ 8Ʈ ȣ迡 ѱڴ ѹƮ ѼҸ Ǵ Ʈ Ÿ 16Ʈ ȣ迡 ڴ Ʈ ѼҸ Ǵ Ʈ Ÿ ִ 뼺 . ̷ 뼺 16Ʈ ǥ ѱ ȣ迡 ù ѱ ȣԸ ä ̴. Ʈ ȣ UCS-4 ´ٰ ϴ ѱڴ Ʈ ѼҸ Ǵ Ʈ Ÿ ִ.

Ʈ ѱ ȣδ ó ˰ ιƮ ̰ Ʈ Ǿ Ѵٴ 2 ϼ̶ ƾ Ѵ. Ʈ 2 ϼ̶ ִ Ѱ ιƮ ѱ ٴ ̴. ѱ Ư ´ ѱ ȣ, ѱ ΰ? ιƮ ڰ ӵ ѱ ȭ Ǿ ʾҴ ѱ ⿩ ؾ ϰ Ʈ ù ѱ ȣ ٸ ̶ .

ù ѱ ȣ迡 ؼ 1988⿡[KimK 88] [KimK 90a, 90b, 92c] ƴµ ̴ ַ ܱ м ǥϿ. 츮 ȿ 1991⿡ 뱳[ 91] Ϻ[Ϻ91] Ͱ ѱ ȣ踦 ϱ⵵ Ͽ.

10646-1 ù ѱ ȣ迡 238 ѱ (Ҹ ƴ) 2 ä (ùҸ ä,  Ҹ ä) ȣ 0x1100 0x11f9̴.

Ѱ ˰ Ѿ KSC 5601 ִ 51 ( ) ׾ ѱ ȣ ȣȯ 10646-1 0x31a1 0x31fe ִ , ù ѱ ȣ ƴϴ.

ù ѱ ȣ 240 ѱ ڴ 1992 22 ȸǶ 10646-1 ó µ .

ùҸ 90 0x1100-1159
ڸ(Ȯ) 5ڸ 0x115a-115e
ùҸ ä 0x115f
Ҹ ä 0x1160
Ҹ 66 0x1161-11a2
ڸ(Ȯ)5ڸ 0x11a3-11a7
Ҹ 82 0x11a-11f9
ڸ(Ȯ) 6ڸ 0x11fa-11ff

ù ѱ ȣ迡 ѱ ڸ Ʈ Ÿ. ùҸ-Ҹ ڷ ̷ Ҹ. ٽ ؼ Ҹ ڰ Ҹ Ʈ Ÿ ùҸ-Ҹ-Ҹ ڷ ̷ Ҹ, ٽ ؼ Ҹ (ħ) ִ Ҹ Ʈ Ÿ. ̷ μ ѱ 11,172 Ҹ Ÿ ִ.

ù ѱ ȣ 238 ڷδ ѱ Ӹ ƴ϶ ѱ۵ Ÿ ֵ Ǿ ־ ѱ ϴ ڴ ̰ ѱ б л̳ 鵵 Ǿ ִ. ⿡ ( ) Ƽ Ѻȣ ־.

ѱ 쿡 ̹ ˰ ִ ڿ ȣ ־ ãƳ 𸣴 ڸ Ÿ ִ ־ ξ.

ѱ Ÿ ؼ Ҹ ʿ پ Ÿ ־ ϴµ Ȭ  ִ.

Ȭ(.) 0x303e: hagul single dot tone mark (:) 0x303f: hangul double dot tone mark

ֱ ̸ ġ ۾ ISO ϰ ִ.

2.2.4 ù ѱ ȣ ϼ Ʈ Ѳ Ǯ.

̹ ٿ ǥ ѱ ȣ ϼ  Ҹ(c氢) Ÿ ٴ Ǯ ιƮ ѱ ٷ ٴ Ǫ ǻ ѱ ϰ Ÿ ִٴ ü ̶ µ ع ο ִ.

3. ǥѱ ȣ ü

ǥ ѱ ȣ ڼ κп  ü Ƽ ߵ .

ù°  Ҹ Ÿ ־ ѱ ȣ谡 ѱ ʰ Ǿ ѱ ߴϰ 츮ȭ ߴ ְ Ǿ. ѱ Ư ´ ο ִ.

° ϼ ιƮ ѱ ȣ ο ̶ 3 ͼ ϼ Ʈ Ѳ Ǯμ ǻ ̳ ٸ ѱ ȣ ο ϰ Ǿ. 츮 ؾ ߿ ǥ ѱ ȣ ذ Ͽ ǥ ġ ǥ ѱ ȣ踦 ִ ϴ ̴.

츮 238 ù ȣ ϴ ϴ α׷ ν ѱ Ư ´ ѱ ȣ踦 ۵ ← ̴.