UTF-8 (c4, 84) Ą name: LATIN CAPITAL LETTER A WITH OGONEK | |
old name: | |
Adobe glyph name: Aogonek | |
mnemonic name(s): <A;> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0041 0328 | |
comment: | |
found in charsets: CP775 (B5); CP1250 (A5); CENTEURO (84); 8859-10 (A1); 8859-2 (A1); 8859-4 (A1); CP852 (A4); 8859-13 (C0); CP1257 (C0); 8859-16 (A1); | |
found in languages: sla [Kashubian]; lt [Lithuanian]; pl [Polish]; | |
used in romanization of: | |
lowercase: 0105 |
UTF-8 (c4, 85) ą name: LATIN SMALL LETTER A WITH OGONEK | |
old name: | |
Adobe glyph name: aogonek | |
mnemonic name(s): <a;> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0061 0328 | |
comment: | |
found in charsets: CP775 (D0); CP1250 (B9); CENTEURO (88); 8859-10 (B1); 8859-2 (B1); 8859-4 (B1); CP852 (A5); 8859-13 (E0); CP1257 (E0); 8859-16 (A2); | |
found in languages: sla [Kashubian]; lt [Lithuanian]; pl [Polish]; | |
used in romanization of: | |
uppercase: 0104 |
UTF-8 (c4, 8c) Č name: LATIN CAPITAL LETTER C WITH CARON | |
old name: | |
Adobe glyph name: Ccaron | |
mnemonic name(s): <C<> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0043 030C | |
comment: | |
found in charsets: CP775 (B6); CP1250 (C8); CENTEURO (89); 8859-10 (C8); 8859-2 (C8); 8859-4 (C8); SAMI_MAC (A2); CP852 (AC); 8859-13 (C8); CP1257 (C8); SAMI_WIN (82); 8859-16 (B2); | |
found in languages: bs [Bosnian]; sami1 [Inari Sámi]; hr [Croatian]; sk [Slovak]; cs [Czech]; sorb2 [Upper Sorbian]; lv [Latvian]; sorb1 [Lower Sorbian]; be [Belarusian]; sami2 [North Sámi]; sr [Serbian]; sami4 [Skolt Sámi]; sl [Slovenian]; lt [Lithuanian]; | |
used in romanization of: bg_r [Bulgarian (cyrillic)]; mk_r [Macedonian (cyrillic)]; sr_r [Serbian (cyrillic)]; ru_r [Russian (cyrillic)]; be_r [Belarusian (cyrillic)]; | |
lowercase: 010D |
UTF-8 (c4, 8d) č name: LATIN SMALL LETTER C WITH CARON | |
old name: | |
Adobe glyph name: ccaron | |
mnemonic name(s): <c<> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0063 030C | |
comment: | |
found in charsets: CP775 (D1); CP1250 (E8); CENTEURO (8B); 8859-10 (E8); 8859-2 (E8); 8859-4 (E8); SAMI_MAC (B8); CP852 (9F); 8859-13 (E8); CP1257 (E8); SAMI_WIN (84); 8859-16 (B9); | |
found in languages: bs [Bosnian]; sami1 [Inari Sámi]; hr [Croatian]; sk [Slovak]; cs [Czech]; sorb2 [Upper Sorbian]; lv [Latvian]; sorb1 [Lower Sorbian]; be [Belarusian]; sami2 [North Sámi]; sr [Serbian]; sami4 [Skolt Sámi]; sl [Slovenian]; lt [Lithuanian]; | |
used in romanization of: bg_r [Bulgarian (cyrillic)]; mk_r [Macedonian (cyrillic)]; sr_r [Serbian (cyrillic)]; ru_r [Russian (cyrillic)]; be_r [Belarusian (cyrillic)]; | |
uppercase: 010C |
UTF-8 (c4, 98) Ę name: LATIN CAPITAL LETTER E WITH OGONEK | |
old name: | |
Adobe glyph name: Eogonek | |
mnemonic name(s): <E;> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0045 0328 | |
comment: | |
found in charsets: CP775 (B7); CP1250 (CA); CENTEURO (A2); 8859-10 (CA); 8859-2 (CA); 8859-4 (CA); CP852 (A8); 8859-13 (C6); CP1257 (C6); 8859-16 (DD); | |
found in languages: sla [Kashubian]; lt [Lithuanian]; pl [Polish]; | |
used in romanization of: | |
lowercase: 0119 |
UTF-8 (c4, 99) ę name: LATIN SMALL LETTER E WITH OGONEK | |
old name: | |
Adobe glyph name: eogonek | |
mnemonic name(s): <e;> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0065 0328 | |
comment: | |
found in charsets: CP775 (D2); CP1250 (EA); CENTEURO (AB); 8859-10 (EA); 8859-2 (EA); 8859-4 (EA); CP852 (A9); 8859-13 (E6); CP1257 (E6); 8859-16 (FD); | |
found in languages: sla [Kashubian]; lt [Lithuanian]; pl [Polish]; | |
used in romanization of: | |
uppercase: 0118 |
UTF-8 (c4, 96) Ė name: LATIN CAPITAL LETTER E WITH DOT ABOVE | |
old name: | |
Adobe glyph name: Edotaccent | |
mnemonic name(s): <E.> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0045 0307 | |
comment: | |
found in charsets: CP775 (B8); CENTEURO (96); 8859-10 (CC); 8859-4 (CC); 8859-13 (CB); CP1257 (CB); | |
found in languages: ulit [Ulithian]; lt [Lithuanian]; | |
used in romanization of: tg_r [Tajik (cyrillic)]; kk_r [Kazakh (cyrillic)]; | |
lowercase: 0117 |
UTF-8 (c4, 97) ė name: LATIN SMALL LETTER E WITH DOT ABOVE | |
old name: | |
Adobe glyph name: edotaccent | |
mnemonic name(s): <e.> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0065 0307 | |
comment: | |
found in charsets: CP775 (D3); CENTEURO (98); 8859-10 (EC); 8859-4 (EC); 8859-13 (EB); CP1257 (EB); | |
found in languages: ulit [Ulithian]; lt [Lithuanian]; | |
used in romanization of: tg_r [Tajik (cyrillic)]; kk_r [Kazakh (cyrillic)]; | |
uppercase: 0116 |
UTF-8 (c4, ae) Į name: LATIN CAPITAL LETTER I WITH OGONEK | |
old name: | |
Adobe glyph name: Iogonek | |
mnemonic name(s): <I;> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0049 0328 | |
comment: | |
found in charsets: CP775 (BD); CENTEURO (AF); 8859-10 (C7); 8859-4 (C7); 8859-13 (C1); CP1257 (C1); | |
found in languages: lt [Lithuanian]; | |
used in romanization of: | |
lowercase: 012F |
UTF-8 (c4, af) į name: LATIN SMALL LETTER I WITH OGONEK | |
old name: | |
Adobe glyph name: iogonek | |
mnemonic name(s): <i;> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0069 0328 | |
comment: | |
found in charsets: CP775 (D4); CENTEURO (B0); 8859-10 (E7); 8859-4 (E7); 8859-13 (E1); CP1257 (E1); | |
found in languages: lt [Lithuanian]; | |
used in romanization of: | |
uppercase: 012E |
UTF-8 (c5, a0) Š name: LATIN CAPITAL LETTER S WITH CARON | |
old name: | |
Adobe glyph name: Scaron | |
mnemonic name(s): <S<> | |
HTML 4 mnemonic name:Š | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0053 030C | |
comment: | |
found in charsets: CP775 (BE); 8859-15 (A6); CP1250 (8A); CENTEURO (E1); 8859-10 (AA); 8859-2 (A9); CP1252 (8A); 8859-4 (A9); CP1122 (AC); CP1254 (8A); SAMI_MAC (B4); CP852 (E6); 8859-13 (D0); CP1257 (D0); CP1116 (D1); SAMI_WIN (8A); 8859-16 (A6); VENTURA_INT (D3); | |
found in languages: fi [Finnish]; et [Estonian]; bs [Bosnian]; sami1 [Inari Sámi]; tn [Tswana]; hr [Croatian]; sk [Slovak]; cs [Czech]; sorb2 [Upper Sorbian]; lv [Latvian]; sorb1 [Lower Sorbian]; be [Belarusian]; sami2 [North Sámi]; sr [Serbian]; nso [Northern Sotho]; livo [Livonian]; sami4 [Skolt Sámi]; sl [Slovenian]; lt [Lithuanian]; | |
used in romanization of: bg_r [Bulgarian (cyrillic)]; mk_r [Macedonian (cyrillic)]; sr_r [Serbian (cyrillic)]; ru_r [Russian (cyrillic)]; be_r [Belarusian (cyrillic)]; | |
lowercase: 0161 |
UTF-8 (c5, a1) š name: LATIN SMALL LETTER S WITH CARON | |
old name: | |
Adobe glyph name: scaron | |
mnemonic name(s): <s<> | |
HTML 4 mnemonic name:š | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0073 030C | |
comment: | |
found in charsets: CP775 (D5); 8859-15 (A8); CP1250 (9A); CENTEURO (E4); 8859-10 (BA); 8859-2 (B9); CP1252 (9A); 8859-4 (B9); CP1122 (8C); CP1254 (9A); SAMI_MAC (BB); CP852 (E7); 8859-13 (F0); CP1257 (F0); CP1116 (D0); SAMI_WIN (9A); 8859-16 (A8); VENTURA_INT (D4); | |
found in languages: fi [Finnish]; et [Estonian]; bs [Bosnian]; sami1 [Inari Sámi]; tn [Tswana]; hr [Croatian]; sk [Slovak]; cs [Czech]; sorb2 [Upper Sorbian]; lv [Latvian]; sorb1 [Lower Sorbian]; be [Belarusian]; sami2 [North Sámi]; sr [Serbian]; nso [Northern Sotho]; livo [Livonian]; sami4 [Skolt Sámi]; sl [Slovenian]; lt [Lithuanian]; | |
used in romanization of: bg_r [Bulgarian (cyrillic)]; mk_r [Macedonian (cyrillic)]; sr_r [Serbian (cyrillic)]; ru_r [Russian (cyrillic)]; be_r [Belarusian (cyrillic)]; | |
uppercase: 0160 |
UTF-8 (c5, b2) Ų name: LATIN CAPITAL LETTER U WITH OGONEK | |
old name: | |
Adobe glyph name: Uogonek | |
mnemonic name(s): <U;> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0055 0328 | |
comment: | |
found in charsets: CP775 (C6); CENTEURO (F6); 8859-10 (D9); 8859-4 (D9); 8859-13 (D8); CP1257 (D8); | |
found in languages: lt [Lithuanian]; | |
used in romanization of: | |
lowercase: 0173 |
UTF-8 (c5, b3) ų name: LATIN SMALL LETTER U WITH OGONEK | |
old name: | |
Adobe glyph name: uogonek | |
mnemonic name(s): <u;> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0075 0328 | |
comment: | |
found in charsets: CP775 (D6); CENTEURO (F7); 8859-10 (F9); 8859-4 (F9); 8859-13 (F8); CP1257 (F8); | |
found in languages: lt [Lithuanian]; | |
used in romanization of: | |
uppercase: 0172 |
UTF-8 (c5, aa) Ū name: LATIN CAPITAL LETTER U WITH MACRON | |
old name: | |
Adobe glyph name: Umacron | |
mnemonic name(s): <U-> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0055 0304 | |
comment: | |
found in charsets: CP775 (C7); CENTEURO (ED); 8859-10 (AE); 8859-4 (DE); 8859-13 (DB); CP1257 (DB); | |
found in languages: mi [Maori]; mh [Marshallese]; lv [Latvian]; cor [Cornish]; livo [Livonian]; haw [Hawaiian]; lt [Lithuanian]; | |
used in romanization of: bn_r [Bengali (bengali)]; as_r [Assamese (assamese)]; ja_r [Japanese (sino-japanese)]; zh_r [Chinese (sino-japanese)]; ta_r [Tamil (tamil)]; pa_r [Punjabi]; kn_r [Kannada]; ml_r [Malayalam]; hi_r [Hindi (devanagari)]; te_r [Telugu]; fa_r [Persian (perso-arabic)]; gu_r [Gujarati]; or_r [Oriya]; kk_r [Kazakh (cyrillic)]; ar_r [Arabic (perso-arabic)]; ur_r [Urdu (perso-arabic)]; ps_r [Pashto (perso-arabic)]; | |
lowercase: 016B |
UTF-8 (c5, ab) ū name: LATIN SMALL LETTER U WITH MACRON | |
old name: | |
Adobe glyph name: umacron | |
mnemonic name(s): <u-> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0075 0304 | |
comment: | |
found in charsets: CP775 (D7); CENTEURO (F0); 8859-10 (BE); 8859-4 (FE); 8859-13 (FB); CP1257 (FB); | |
found in languages: mi [Maori]; mh [Marshallese]; lv [Latvian]; cor [Cornish]; livo [Livonian]; haw [Hawaiian]; lt [Lithuanian]; | |
used in romanization of: bn_r [Bengali (bengali)]; as_r [Assamese (assamese)]; ja_r [Japanese (sino-japanese)]; zh_r [Chinese (sino-japanese)]; ta_r [Tamil (tamil)]; pa_r [Punjabi]; kn_r [Kannada]; ml_r [Malayalam]; hi_r [Hindi (devanagari)]; te_r [Telugu]; fa_r [Persian (perso-arabic)]; gu_r [Gujarati]; or_r [Oriya]; kk_r [Kazakh (cyrillic)]; ar_r [Arabic (perso-arabic)]; ur_r [Urdu (perso-arabic)]; ps_r [Pashto (perso-arabic)]; | |
uppercase: 016A |
UTF-8 (c5, bd) Ž name: LATIN CAPITAL LETTER Z WITH CARON | |
old name: | |
Adobe glyph name: Zcaron | |
mnemonic name(s): <Z<> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 005A 030C | |
comment: | |
found in charsets: CP775 (CF); 8859-15 (B4); CP1250 (8E); CENTEURO (EB); 8859-10 (AC); 8859-2 (AE); CP1252 (8E); 8859-4 (AE); CP1122 (AE); SAMI_MAC (B7); CP852 (A6); 8859-13 (DE); CP1257 (DE); CP1116 (E8); SAMI_WIN (BE); 8859-16 (B4); | |
found in languages: fi [Finnish]; et [Estonian]; bs [Bosnian]; sami1 [Inari Sámi]; hr [Croatian]; sk [Slovak]; cs [Czech]; sorb2 [Upper Sorbian]; lv [Latvian]; sorb1 [Lower Sorbian]; be [Belarusian]; sami2 [North Sámi]; sr [Serbian]; livo [Livonian]; sami4 [Skolt Sámi]; tk [Turkmen]; sl [Slovenian]; lt [Lithuanian]; | |
used in romanization of: bg_r [Bulgarian (cyrillic)]; mk_r [Macedonian (cyrillic)]; sr_r [Serbian (cyrillic)]; ru_r [Russian (cyrillic)]; be_r [Belarusian (cyrillic)]; | |
lowercase: 017E |
UTF-8 (c5, be) ž name: LATIN SMALL LETTER Z WITH CARON | |
old name: | |
Adobe glyph name: zcaron | |
mnemonic name(s): <z<> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 007A 030C | |
comment: | |
found in charsets: CP775 (D8); 8859-15 (B8); CP1250 (9E); CENTEURO (EC); 8859-10 (BC); 8859-2 (BE); CP1252 (9E); 8859-4 (BE); CP1122 (8E); SAMI_MAC (BD); CP852 (A7); 8859-13 (FE); CP1257 (FE); CP1116 (E7); SAMI_WIN (BF); 8859-16 (B8); | |
found in languages: fi [Finnish]; et [Estonian]; bs [Bosnian]; sami1 [Inari Sámi]; hr [Croatian]; sk [Slovak]; cs [Czech]; sorb2 [Upper Sorbian]; lv [Latvian]; sorb1 [Lower Sorbian]; be [Belarusian]; sami2 [North Sámi]; sr [Serbian]; livo [Livonian]; sami4 [Skolt Sámi]; tk [Turkmen]; sl [Slovenian]; lt [Lithuanian]; | |
used in romanization of: bg_r [Bulgarian (cyrillic)]; mk_r [Macedonian (cyrillic)]; sr_r [Serbian (cyrillic)]; ru_r [Russian (cyrillic)]; be_r [Belarusian (cyrillic)]; | |
uppercase: 017D |
Demanding publications such as dictionaries, maps, schoolbooks etc need additional diacritical marks to differentiate homographs.
Fot that purpose Lithuanian uses:
Small E and I (also with ogonek) must retain the dot when additional accent
mark is added to the character, the use of ì and
í (note the missing dot) is considered unacceptable.
[In the two pre-computer era books I saw, the dot was centered on E WITH DOT
ABOVE and to the left of acute on E WITH DOT ABOVE AND ACUTE. On both E and
I WITH TILDE, the dot moved slightly to the left. There was no dot on
I WITH ACUTE and I WITH GRAVE].
A proposal to allocate still missing characters is submitted
to the Unicode Consortium.