UTF-8 (c6, 8f) Ə name: LATIN CAPITAL LETTER SCHWA | |
| old name: | |
| Adobe glyph name: | |
| mnemonic name(s): | |
| HTML 4 mnemonic name: | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: | |
| found in languages: az [Azerbaijani]; tt [Tatar]; | |
| used in romanization of: | |
| lowercase: 0259 |
UTF-8 (c9, 99) ə name: LATIN SMALL LETTER SCHWA | |
| old name: | |
| Adobe glyph name: | |
| mnemonic name(s): | |
| HTML 4 mnemonic name: | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: | |
| found in languages: az [Azerbaijani]; tt [Tatar]; | |
| used in romanization of: | |
| uppercase: 018F |
UTF-8 (c3, 87) Ç name: LATIN CAPITAL LETTER C WITH CEDILLA | |
| old name: | |
| Adobe glyph name: Ccedilla | |
| mnemonic name(s): <C,> | |
| HTML 4 mnemonic name:Ç | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| decomposition info: 0043 0327 | |
| comment: | |
| found in charsets: 8859-1 (C7); 8859-14 (C7); 8859-15 (C7); 8859-16 (C7); 8859-2 (C7); 8859-3 (C7); 8859-9 (C7); SAMI_MAC (82); CP1116 (80); CP1122 (68); CP1250 (C7); CP1252 (C7); CP1254 (C7); CP1258 (C7); CP437 (80); CP850 (80); CP852 (80); CP857 (80); CP860 (80); CP861 (80); CP863 (80); CP865 (80); ROMAN (82); SAMI_WIN (C7); VENTURA_INT (80); | |
| found in languages: az [Azerbaijani]; ca [Catalan]; en [English]; es [Spanish]; eu [Basque]; fr [French]; ku [Kurdish]; oc [Occitan]; pt [Portuguese]; sq [Albanian]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; wa [Walloon]; | |
| used in romanization of: ar_r [Arabic (perso-arabic)]; | |
| lowercase: 00E7 |
UTF-8 (c3, a7) ç name: LATIN SMALL LETTER C WITH CEDILLA | |
| old name: | |
| Adobe glyph name: ccedilla | |
| mnemonic name(s): <c,> | |
| HTML 4 mnemonic name:ç | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| decomposition info: 0063 0327 | |
| comment: | |
| found in charsets: 8859-1 (E7); 8859-14 (E7); 8859-15 (E7); 8859-16 (E7); 8859-2 (E7); 8859-3 (E7); 8859-9 (E7); SAMI_MAC (8D); CP1116 (87); CP1122 (48); CP1250 (E7); CP1252 (E7); CP1254 (E7); CP1256 (E7); CP1258 (E7); CP437 (87); CP850 (87); CP852 (87); CP857 (87); CP860 (87); CP861 (87); CP863 (87); CP865 (87); ROMAN (8D); SAMI_WIN (E7); VENTURA_INT (87); | |
| found in languages: az [Azerbaijani]; ca [Catalan]; en [English]; es [Spanish]; eu [Basque]; fr [French]; ku [Kurdish]; oc [Occitan]; pt [Portuguese]; sq [Albanian]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; wa [Walloon]; | |
| used in romanization of: ar_r [Arabic (perso-arabic)]; | |
| uppercase: 00C7 |
UTF-8 (c4, 9e) Ğ name: LATIN CAPITAL LETTER G WITH BREVE | |
| old name: | |
| Adobe glyph name: Gbreve | |
| mnemonic name(s): <G(> | |
| HTML 4 mnemonic name: | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| decomposition info: 0047 0306 | |
| comment: | |
| found in charsets: 8859-3 (AB); 8859-9 (D0); CP1254 (D0); CP857 (A6); | |
| found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: | |
| lowercase: 011F |
UTF-8 (c4, 9f) ğ name: LATIN SMALL LETTER G WITH BREVE | |
| old name: | |
| Adobe glyph name: gbreve | |
| mnemonic name(s): <g(> | |
| HTML 4 mnemonic name: | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| decomposition info: 0067 0306 | |
| comment: | |
| found in charsets: 8859-3 (BB); 8859-9 (F0); CP1254 (F0); CP857 (A7); | |
| found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: | |
| uppercase: 011E |
UTF-8 (c4, b0) İ name: LATIN CAPITAL LETTER I WITH DOT ABOVE | |
| old name: | |
| Adobe glyph name: Idotaccent | |
| mnemonic name(s): <I.> | |
| HTML 4 mnemonic name: | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| decomposition info: 0049 0307 | |
| comment: | |
| found in charsets: 8859-3 (A9); 8859-9 (DD); CP1254 (DD); CP857 (98); | |
| found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: | |
| lowercase: 0069 |
UTF-8 (c4, b1) ı name: LATIN SMALL LETTER DOTLESS I | |
| old name: | |
| Adobe glyph name: dotlessi | |
| mnemonic name(s): <i.> | |
| HTML 4 mnemonic name: | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: 8859-3 (B9); 8859-9 (FD); SAMI_MAC (F5); CP1116 (D5); CP1254 (FD); CP850 (D5); CP857 (8D); ROMAN (F5); | |
| found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: | |
| uppercase: 0049 |
| name: LATIN CAPITAL LETTER N WITH DESCENDER | |
| old name: | |
| Adobe glyph name: | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: | |
| found in languages: tt [Tatar]; | |
| used in romanization of: | |
| lowercase: E01B |
| name: LATIN SMALL LETTER N WITH DESCENDER | |
| old name: | |
| Adobe glyph name: | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: | |
| found in languages: tt [Tatar]; | |
| used in romanization of: | |
| uppercase: E01A |
| name: LATIN CAPITAL LETTER BARRED O | |
| old name: | |
| Adobe glyph name: | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: | |
| found in languages: tt [Tatar]; | |
| used in romanization of: | |
| lowercase: E01D |
| name: LATIN SMALL LETTER BARRED O | |
| old name: | |
| Adobe glyph name: | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| comment: | |
| found in charsets: | |
| found in languages: tt [Tatar]; | |
| used in romanization of: | |
| uppercase: E01C |
UTF-8 (c5, 9e) Ş name: LATIN CAPITAL LETTER S WITH CEDILLA | |
| old name: | |
| Adobe glyph name: Scommaaccent | |
| mnemonic name(s): <S,> | |
| HTML 4 mnemonic name: | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| decomposition info: 0053 0327 | |
| comment: * | |
| Note: Please see note 1 | |
| found in charsets: 8859-2 (AA); 8859-3 (AA); 8859-9 (DE); CP1250 (AA); CP1254 (DE); CP852 (B8); CP857 (9E); | |
| found in languages: az [Azerbaijani]; ku [Kurdish]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: ar_r [Arabic (perso-arabic)]; fa_r [Persian (perso-arabic)]; ps_r [Pashto (perso-arabic)]; | |
| lowercase: 015F |
UTF-8 (c5, 9f) ş name: LATIN SMALL LETTER S WITH CEDILLA | |
| old name: | |
| Adobe glyph name: scommaaccent | |
| mnemonic name(s): <s,> | |
| HTML 4 mnemonic name: | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| decomposition info: 0073 0327 | |
| comment: * | |
| found in charsets: 8859-2 (BA); 8859-3 (BA); 8859-9 (FE); CP1250 (BA); CP1254 (FE); CP852 (AD); CP857 (9F); | |
| found in languages: az [Azerbaijani]; ku [Kurdish]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: ar_r [Arabic (perso-arabic)]; fa_r [Persian (perso-arabic)]; ps_r [Pashto (perso-arabic)]; | |
| uppercase: 015E |
UTF-8 (c3, 9c) Ü name: LATIN CAPITAL LETTER U WITH DIAERESIS | |
| old name: | |
| Adobe glyph name: Udieresis | |
| mnemonic name(s): <U:> | |
| HTML 4 mnemonic name:Ü | |
| category: Lu (Letter, Uppercase) | |
| combining: 0 | |
| decomposition info: 0055 0308 | |
| comment: | |
| found in charsets: CENTEURO (86); 8859-1 (DC); 8859-10 (DC); 8859-13 (DC); 8859-14 (DC); 8859-15 (DC); 8859-16 (DC); 8859-2 (DC); 8859-3 (DC); 8859-4 (DC); 8859-9 (DC); SAMI_MAC (86); CP1116 (9A); CP1122 (FC); CP1250 (DC); CP1252 (DC); CP1254 (DC); CP1257 (DC); CP1258 (DC); CP437 (9A); CP775 (9A); CP850 (9A); CP852 (9A); CP857 (9A); CP860 (9A); CP861 (9A); CP863 (9A); CP865 (9A); ROMAN (86); SAMI_WIN (DC); VENTURA_INT (9A); | |
| found in languages: az [Azerbaijani]; bi [Bislama]; br [Breton]; ca [Catalan]; ch [Chamorro]; cor [Cornish]; cy [Welsh]; de [German]; es [Spanish]; et [Estonian]; eu [Basque]; fr [French]; fy [Frisian]; gl [Galician]; hu [Hungarian]; lb [Luxembourgian]; nl [Dutch]; pt [Portuguese]; sl [Slovenian]; sv [Swedish]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: kk_r [Kazakh (cyrillic)]; ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; tk_r [Turkmen (cyrillic)]; zh_r [Chinese (sino-japanese)]; | |
| lowercase: 00FC |
UTF-8 (c3, bc) ü name: LATIN SMALL LETTER U WITH DIAERESIS | |
| old name: | |
| Adobe glyph name: udieresis | |
| mnemonic name(s): <u:> | |
| HTML 4 mnemonic name:ü | |
| category: Ll (Letter, Lowercase) | |
| combining: 0 | |
| decomposition info: 0075 0308 | |
| comment: | |
| found in charsets: CENTEURO (9F); 8859-1 (FC); 8859-10 (FC); 8859-13 (FC); 8859-14 (FC); 8859-15 (FC); 8859-16 (FC); 8859-2 (FC); 8859-3 (FC); 8859-4 (FC); 8859-9 (FC); SAMI_MAC (9F); CP1116 (81); CP1122 (A1); CP1250 (FC); CP1252 (FC); CP1254 (FC); CP1256 (FC); CP1257 (FC); CP1258 (FC); CP437 (81); CP775 (81); CP850 (81); CP852 (81); CP857 (81); CP860 (81); CP861 (81); CP863 (81); CP865 (81); ROMAN (9F); SAMI_WIN (FC); VENTURA_INT (81); | |
| found in languages: az [Azerbaijani]; bi [Bislama]; br [Breton]; ca [Catalan]; ch [Chamorro]; cor [Cornish]; cy [Welsh]; de [German]; es [Spanish]; et [Estonian]; eu [Basque]; fr [French]; fy [Frisian]; gl [Galician]; hu [Hungarian]; lb [Luxembourgian]; nl [Dutch]; pt [Portuguese]; sl [Slovenian]; sv [Swedish]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; | |
| used in romanization of: kk_r [Kazakh (cyrillic)]; ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; tk_r [Turkmen (cyrillic)]; zh_r [Chinese (sino-japanese)]; | |
| uppercase: 00DC |
Latin alphabet 1927-1939, Yan(g)alif, used different letters for
dotless i (Cyrillic soft sign), j (barred z), and g with breve (U01a2).
A convention to use e for schwa, í for e, ñ for n with descender
and for barred o has also gained some popularity as more compatible with
other Turkic languages.
Unicode currently does not have neither precomposed allocation for N with descender nor combining descender. There is LATIN SMALL LETTER BARRED O (U0275), but the uppercase variant is called LATIN CAPITAL LETTER O WITH MIDDLE TILDE (U019f) with usage note 'African'.
The conversion from Cyrillic to Latin script is planned within years 2001-2011.