UTF-8 (c6, 8f) Ə name: LATIN CAPITAL LETTER SCHWA | |
old name: | |
Adobe glyph name: | |
mnemonic name(s): | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
comment: | |
found in charsets: | |
found in languages: tt [Tatar]; az [Azerbaijani]; | |
used in romanization of: | |
lowercase: 0259 |
UTF-8 (c9, 99) ə name: LATIN SMALL LETTER SCHWA | |
old name: | |
Adobe glyph name: | |
mnemonic name(s): | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
comment: | |
found in charsets: | |
found in languages: tt [Tatar]; az [Azerbaijani]; | |
used in romanization of: | |
uppercase: 018F |
UTF-8 (c3, 87) Ç name: LATIN CAPITAL LETTER C WITH CEDILLA | |
old name: | |
Adobe glyph name: Ccedilla | |
mnemonic name(s): <C,> | |
HTML 4 mnemonic name:Ç | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0043 0327 | |
comment: | |
found in charsets: CP863 (80); 8859-15 (C7); CP1250 (C7); 8859-3 (C7); CP1258 (C7); 8859-9 (C7); ROMAN (82); 8859-2 (C7); CP857 (80); CP1252 (C7); CP437 (80); CP860 (80); CP1122 (68); CP1254 (C7); SAMI_MAC (82); CP852 (80); CP865 (80); CP861 (80); CP850 (80); 8859-1 (C7); CP1116 (80); SAMI_WIN (C7); 8859-14 (C7); 8859-16 (C7); VENTURA_INT (80); | |
found in languages: ca [Catalan]; eu [Basque]; tr [Turkish]; tt [Tatar]; fr [French]; oc [Occitan]; wa [Walloon]; en [English]; az [Azerbaijani]; pt [Portuguese]; ku [Kurdish]; tk [Turkmen]; es [Spanish]; sq [Albanian]; | |
used in romanization of: ar_r [Arabic (perso-arabic)]; | |
lowercase: 00E7 |
UTF-8 (c3, a7) ç name: LATIN SMALL LETTER C WITH CEDILLA | |
old name: | |
Adobe glyph name: ccedilla | |
mnemonic name(s): <c,> | |
HTML 4 mnemonic name:ç | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0063 0327 | |
comment: | |
found in charsets: CP863 (87); 8859-15 (E7); CP1250 (E7); 8859-3 (E7); CP1258 (E7); 8859-9 (E7); ROMAN (8D); 8859-2 (E7); CP857 (87); CP1252 (E7); CP437 (87); CP860 (87); CP1122 (48); CP1254 (E7); SAMI_MAC (8D); CP852 (87); CP865 (87); CP861 (87); CP1256 (E7); CP850 (87); 8859-1 (E7); CP1116 (87); SAMI_WIN (E7); 8859-14 (E7); 8859-16 (E7); VENTURA_INT (87); | |
found in languages: ca [Catalan]; eu [Basque]; tr [Turkish]; tt [Tatar]; fr [French]; oc [Occitan]; wa [Walloon]; en [English]; az [Azerbaijani]; pt [Portuguese]; ku [Kurdish]; tk [Turkmen]; es [Spanish]; sq [Albanian]; | |
used in romanization of: ar_r [Arabic (perso-arabic)]; | |
uppercase: 00C7 |
UTF-8 (c4, 9e) Ğ name: LATIN CAPITAL LETTER G WITH BREVE | |
old name: | |
Adobe glyph name: Gbreve | |
mnemonic name(s): <G(> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0047 0306 | |
comment: | |
found in charsets: 8859-3 (AB); 8859-9 (D0); CP857 (A6); CP1254 (D0); | |
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; | |
used in romanization of: | |
lowercase: 011F |
UTF-8 (c4, 9f) ğ name: LATIN SMALL LETTER G WITH BREVE | |
old name: | |
Adobe glyph name: gbreve | |
mnemonic name(s): <g(> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0067 0306 | |
comment: | |
found in charsets: 8859-3 (BB); 8859-9 (F0); CP857 (A7); CP1254 (F0); | |
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; | |
used in romanization of: | |
uppercase: 011E |
UTF-8 (c4, b0) İ name: LATIN CAPITAL LETTER I WITH DOT ABOVE | |
old name: | |
Adobe glyph name: Idotaccent | |
mnemonic name(s): <I.> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0049 0307 | |
comment: | |
found in charsets: 8859-3 (A9); 8859-9 (DD); CP857 (98); CP1254 (DD); | |
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; | |
used in romanization of: | |
lowercase: 0069 |
UTF-8 (c4, b1) ı name: LATIN SMALL LETTER DOTLESS I | |
old name: | |
Adobe glyph name: dotlessi | |
mnemonic name(s): <i.> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
comment: | |
found in charsets: 8859-3 (B9); 8859-9 (FD); ROMAN (F5); CP857 (8D); CP1254 (FD); SAMI_MAC (F5); CP850 (D5); CP1116 (D5); | |
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; | |
used in romanization of: | |
uppercase: 0049 |
name: LATIN CAPITAL LETTER N WITH DESCENDER | |
old name: | |
Adobe glyph name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
comment: | |
found in charsets: | |
found in languages: tt [Tatar]; | |
used in romanization of: | |
lowercase: E01B |
name: LATIN SMALL LETTER N WITH DESCENDER | |
old name: | |
Adobe glyph name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
comment: | |
found in charsets: | |
found in languages: tt [Tatar]; | |
used in romanization of: | |
uppercase: E01A |
name: LATIN CAPITAL LETTER BARRED O | |
old name: | |
Adobe glyph name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
comment: | |
found in charsets: | |
found in languages: tt [Tatar]; | |
used in romanization of: | |
lowercase: E01D |
name: LATIN SMALL LETTER BARRED O | |
old name: | |
Adobe glyph name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
comment: | |
found in charsets: | |
found in languages: tt [Tatar]; | |
used in romanization of: | |
uppercase: E01C |
UTF-8 (c5, 9e) Ş name: LATIN CAPITAL LETTER S WITH CEDILLA | |
old name: | |
Adobe glyph name: Scommaaccent | |
mnemonic name(s): <S,> | |
HTML 4 mnemonic name: | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0053 0327 | |
comment: * | |
Note: Please see note 1 | |
found in charsets: CP1250 (AA); 8859-3 (AA); 8859-9 (DE); 8859-2 (AA); CP857 (9E); CP1254 (DE); CP852 (B8); | |
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; ku [Kurdish]; tk [Turkmen]; | |
used in romanization of: fa_r [Persian (perso-arabic)]; ar_r [Arabic (perso-arabic)]; ps_r [Pashto (perso-arabic)]; | |
lowercase: 015F |
UTF-8 (c5, 9f) ş name: LATIN SMALL LETTER S WITH CEDILLA | |
old name: | |
Adobe glyph name: scommaaccent | |
mnemonic name(s): <s,> | |
HTML 4 mnemonic name: | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0073 0327 | |
comment: * | |
found in charsets: CP1250 (BA); 8859-3 (BA); 8859-9 (FE); 8859-2 (BA); CP857 (9F); CP1254 (FE); CP852 (AD); | |
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; ku [Kurdish]; tk [Turkmen]; | |
used in romanization of: fa_r [Persian (perso-arabic)]; ar_r [Arabic (perso-arabic)]; ps_r [Pashto (perso-arabic)]; | |
uppercase: 015E |
UTF-8 (c3, 9c) Ü name: LATIN CAPITAL LETTER U WITH DIAERESIS | |
old name: | |
Adobe glyph name: Udieresis | |
mnemonic name(s): <U:> | |
HTML 4 mnemonic name:Ü | |
category: Lu (Letter, Uppercase) | |
combining: 0 | |
decomposition info: 0055 0308 | |
comment: | |
found in charsets: CP863 (9A); CP775 (9A); 8859-15 (DC); CP1250 (DC); 8859-3 (DC); CENTEURO (86); CP1258 (DC); 8859-10 (DC); 8859-9 (DC); ROMAN (86); 8859-2 (DC); CP857 (9A); CP1252 (DC); CP437 (9A); CP860 (9A); 8859-4 (DC); CP1122 (FC); CP1254 (DC); SAMI_MAC (86); CP852 (9A); CP865 (9A); CP861 (9A); 8859-13 (DC); CP850 (9A); CP1257 (DC); 8859-1 (DC); CP1116 (9A); SAMI_WIN (DC); 8859-14 (DC); 8859-16 (DC); VENTURA_INT (9A); | |
found in languages: lb [Luxembourgian]; ca [Catalan]; eu [Basque]; br [Breton]; tr [Turkish]; tt [Tatar]; et [Estonian]; fr [French]; de [German]; nl [Dutch]; ch [Chamorro]; bi [Bislama]; cy [Welsh]; cor [Cornish]; fy [Frisian]; sv [Swedish]; az [Azerbaijani]; gl [Galician]; pt [Portuguese]; tk [Turkmen]; sl [Slovenian]; es [Spanish]; hu [Hungarian]; | |
used in romanization of: ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; zh_r [Chinese (sino-japanese)]; tk_r [Turkmen (cyrillic)]; kk_r [Kazakh (cyrillic)]; | |
lowercase: 00FC |
UTF-8 (c3, bc) ü name: LATIN SMALL LETTER U WITH DIAERESIS | |
old name: | |
Adobe glyph name: udieresis | |
mnemonic name(s): <u:> | |
HTML 4 mnemonic name:ü | |
category: Ll (Letter, Lowercase) | |
combining: 0 | |
decomposition info: 0075 0308 | |
comment: | |
found in charsets: CP863 (81); CP775 (81); 8859-15 (FC); CP1250 (FC); 8859-3 (FC); CENTEURO (9F); CP1258 (FC); 8859-10 (FC); 8859-9 (FC); ROMAN (9F); 8859-2 (FC); CP857 (81); CP1252 (FC); CP437 (81); CP860 (81); 8859-4 (FC); CP1122 (A1); CP1254 (FC); SAMI_MAC (9F); CP852 (81); CP865 (81); CP861 (81); 8859-13 (FC); CP1256 (FC); CP850 (81); CP1257 (FC); 8859-1 (FC); CP1116 (81); SAMI_WIN (FC); 8859-14 (FC); 8859-16 (FC); VENTURA_INT (81); | |
found in languages: lb [Luxembourgian]; ca [Catalan]; eu [Basque]; br [Breton]; tr [Turkish]; tt [Tatar]; et [Estonian]; fr [French]; de [German]; nl [Dutch]; ch [Chamorro]; bi [Bislama]; cy [Welsh]; cor [Cornish]; fy [Frisian]; sv [Swedish]; az [Azerbaijani]; gl [Galician]; pt [Portuguese]; tk [Turkmen]; sl [Slovenian]; es [Spanish]; hu [Hungarian]; | |
used in romanization of: ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; zh_r [Chinese (sino-japanese)]; tk_r [Turkmen (cyrillic)]; kk_r [Kazakh (cyrillic)]; | |
uppercase: 00DC |
Latin alphabet 1927-1939, Yan(g)alif, used different letters for
dotless i (Cyrillic soft sign), j (barred z), and g with breve (U01a2).
A convention to use e for schwa, í for e, ñ for n with descender
and for barred o has also gained some popularity as more compatible with
other Turkic languages.
Unicode currently does not have neither precomposed allocation for N with descender nor combining descender. There is LATIN SMALL LETTER BARRED O (U0275), but the uppercase variant is called LATIN CAPITAL LETTER O WITH MIDDLE TILDE (U019f) with usage note 'African'.
The conversion from Cyrillic to Latin script is planned within years 2001-2011.