? Short description LDB Letter Database query page

Tatar

Required characters

U018F

decimal: Ə
UTF-8 (c6, 8f) Ə
name: LATIN CAPITAL LETTER SCHWA
old name:
Adobe glyph name:
mnemonic name(s):
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
comment:
found in charsets:
found in languages: az [Azerbaijani]; tt [Tatar];
used in romanization of:
lowercase: 0259

U0259

decimal: ə
UTF-8 (c9, 99) ə
name: LATIN SMALL LETTER SCHWA
old name:
Adobe glyph name:
mnemonic name(s):
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets:
found in languages: az [Azerbaijani]; tt [Tatar];
used in romanization of:
uppercase: 018F

U00C7

decimal: Ç
UTF-8 (c3, 87) Ç
name: LATIN CAPITAL LETTER C WITH CEDILLA
old name: LATIN CAPITAL LETTER C CEDILLA
Adobe glyph name: Ccedilla
mnemonic name(s): <C,>
HTML 4 mnemonic name:&Ccedil;
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0043 0327
comment:
found in charsets: 8859-1 (C7); 8859-14 (C7); 8859-15 (C7); 8859-16 (C7); 8859-2 (C7); 8859-3 (C7); 8859-9 (C7); SAMI_MAC (82); CP1116 (80); CP1122 (68); CP1250 (C7); CP1252 (C7); CP1254 (C7); CP1258 (C7); CP437 (80); CP850 (80); CP852 (80); CP857 (80); CP860 (80); CP861 (80); CP863 (80); CP865 (80); ROMAN (82); SAMI_WIN (C7); VENTURA_INT (80);
found in languages: az [Azerbaijani]; ca [Catalan]; en [English]; es [Spanish]; eu [Basque]; fr [French]; ku [Kurdish]; oc [Occitan]; pt [Portuguese]; sq [Albanian]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; wa [Walloon];
used in romanization of: ar_r [Arabic (perso-arabic)];
lowercase: 00E7

U00E7

decimal: &#231;
UTF-8 (c3, a7) ç
name: LATIN SMALL LETTER C WITH CEDILLA
old name: LATIN SMALL LETTER C CEDILLA
Adobe glyph name: ccedilla
mnemonic name(s): <c,>
HTML 4 mnemonic name:&ccedil;
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0063 0327
comment:
found in charsets: 8859-1 (E7); 8859-14 (E7); 8859-15 (E7); 8859-16 (E7); 8859-2 (E7); 8859-3 (E7); 8859-9 (E7); SAMI_MAC (8D); CP1116 (87); CP1122 (48); CP1250 (E7); CP1252 (E7); CP1254 (E7); CP1256 (E7); CP1258 (E7); CP437 (87); CP850 (87); CP852 (87); CP857 (87); CP860 (87); CP861 (87); CP863 (87); CP865 (87); ROMAN (8D); SAMI_WIN (E7); VENTURA_INT (87);
found in languages: az [Azerbaijani]; ca [Catalan]; en [English]; es [Spanish]; eu [Basque]; fr [French]; ku [Kurdish]; oc [Occitan]; pt [Portuguese]; sq [Albanian]; tk [Turkmen]; tr [Turkish]; tt [Tatar]; wa [Walloon];
used in romanization of: ar_r [Arabic (perso-arabic)];
uppercase: 00C7

U011E

decimal: &#286;
UTF-8 (c4, 9e) Ğ
name: LATIN CAPITAL LETTER G WITH BREVE
old name: LATIN CAPITAL LETTER G BREVE
Adobe glyph name: Gbreve
mnemonic name(s): <G(>
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0047 0306
comment:
found in charsets: 8859-3 (AB); 8859-9 (D0); CP1254 (D0); CP857 (A6);
found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar];
used in romanization of:
lowercase: 011F

U011F

decimal: &#287;
UTF-8 (c4, 9f) ğ
name: LATIN SMALL LETTER G WITH BREVE
old name: LATIN SMALL LETTER G BREVE
Adobe glyph name: gbreve
mnemonic name(s): <g(>
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0067 0306
comment:
found in charsets: 8859-3 (BB); 8859-9 (F0); CP1254 (F0); CP857 (A7);
found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar];
used in romanization of:
uppercase: 011E

U0130

decimal: &#304;
UTF-8 (c4, b0) İ
name: LATIN CAPITAL LETTER I WITH DOT ABOVE
old name: LATIN CAPITAL LETTER I DOT
Adobe glyph name: Idotaccent
mnemonic name(s): <I.>
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0049 0307
comment:
found in charsets: 8859-3 (A9); 8859-9 (DD); CP1254 (DD); CP857 (98);
found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar];
used in romanization of:
lowercase: 0069

U0131

decimal: &#305;
UTF-8 (c4, b1) ı
name: LATIN SMALL LETTER DOTLESS I
old name:
Adobe glyph name: dotlessi
mnemonic name(s): <i.>
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets: 8859-3 (B9); 8859-9 (FD); SAMI_MAC (F5); CP1116 (D5); CP1254 (FD); CP850 (D5); CP857 (8D); ROMAN (F5);
found in languages: az [Azerbaijani]; tr [Turkish]; tt [Tatar];
used in romanization of:
uppercase: 0049

UE01A

not an UCS character!
name: LATIN CAPITAL LETTER N WITH DESCENDER
old name:
Adobe glyph name:
category: Lu (Letter, Uppercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
lowercase: E01B

UE01B

not an UCS character!
name: LATIN SMALL LETTER N WITH DESCENDER
old name:
Adobe glyph name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
uppercase: E01A

UE01C

not an UCS character!
name: LATIN CAPITAL LETTER BARRED O
old name:
Adobe glyph name:
category: Lu (Letter, Uppercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
lowercase: E01D

UE01D

not an UCS character!
name: LATIN SMALL LETTER BARRED O
old name:
Adobe glyph name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
uppercase: E01C

U015E

decimal: &#350;
UTF-8 (c5, 9e) Ş
name: LATIN CAPITAL LETTER S WITH CEDILLA
old name: LATIN CAPITAL LETTER S CEDILLA
Adobe glyph name: Scommaaccent
mnemonic name(s): <S,>
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0053 0327
comment: *
Note: Please see note 1
found in charsets: 8859-2 (AA); 8859-3 (AA); 8859-9 (DE); CP1250 (AA); CP1254 (DE); CP852 (B8); CP857 (9E);
found in languages: az [Azerbaijani]; ku [Kurdish]; tk [Turkmen]; tr [Turkish]; tt [Tatar];
used in romanization of: ar_r [Arabic (perso-arabic)]; fa_r [Persian (perso-arabic)]; ps_r [Pashto (perso-arabic)];
lowercase: 015F

U015F

decimal: &#351;
UTF-8 (c5, 9f) ş
name: LATIN SMALL LETTER S WITH CEDILLA
old name: LATIN SMALL LETTER S CEDILLA
Adobe glyph name: scommaaccent
mnemonic name(s): <s,>
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0073 0327
comment: *
found in charsets: 8859-2 (BA); 8859-3 (BA); 8859-9 (FE); CP1250 (BA); CP1254 (FE); CP852 (AD); CP857 (9F);
found in languages: az [Azerbaijani]; ku [Kurdish]; tk [Turkmen]; tr [Turkish]; tt [Tatar];
used in romanization of: ar_r [Arabic (perso-arabic)]; fa_r [Persian (perso-arabic)]; ps_r [Pashto (perso-arabic)];
uppercase: 015E

U00DC

decimal: &#220;
UTF-8 (c3, 9c) Ü
name: LATIN CAPITAL LETTER U WITH DIAERESIS
old name: LATIN CAPITAL LETTER U DIAERESIS
Adobe glyph name: Udieresis
mnemonic name(s): <U:>
HTML 4 mnemonic name:&Uuml;
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0055 0308
comment:
found in charsets: CENTEURO (86); 8859-1 (DC); 8859-10 (DC); 8859-13 (DC); 8859-14 (DC); 8859-15 (DC); 8859-16 (DC); 8859-2 (DC); 8859-3 (DC); 8859-4 (DC); 8859-9 (DC); SAMI_MAC (86); CP1116 (9A); CP1122 (FC); CP1250 (DC); CP1252 (DC); CP1254 (DC); CP1257 (DC); CP1258 (DC); CP437 (9A); CP775 (9A); CP850 (9A); CP852 (9A); CP857 (9A); CP860 (9A); CP861 (9A); CP863 (9A); CP865 (9A); ROMAN (86); SAMI_WIN (DC); VENTURA_INT (9A);
found in languages: az [Azerbaijani]; bi [Bislama]; br [Breton]; ca [Catalan]; ch [Chamorro]; cor [Cornish]; cy [Welsh]; de [German]; es [Spanish]; et [Estonian]; eu [Basque]; fr [French]; fy [Frisian]; gl [Galician]; hu [Hungarian]; lb [Luxembourgian]; nl [Dutch]; pt [Portuguese]; sl [Slovenian]; sv [Swedish]; tk [Turkmen]; tr [Turkish]; tt [Tatar];
used in romanization of: kk_r [Kazakh (cyrillic)]; ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; tk_r [Turkmen (cyrillic)]; zh_r [Chinese (sino-japanese)];
lowercase: 00FC

U00FC

decimal: &#252;
UTF-8 (c3, bc) ü
name: LATIN SMALL LETTER U WITH DIAERESIS
old name: LATIN SMALL LETTER U DIAERESIS
Adobe glyph name: udieresis
mnemonic name(s): <u:>
HTML 4 mnemonic name:&uuml;
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0075 0308
comment:
found in charsets: CENTEURO (9F); 8859-1 (FC); 8859-10 (FC); 8859-13 (FC); 8859-14 (FC); 8859-15 (FC); 8859-16 (FC); 8859-2 (FC); 8859-3 (FC); 8859-4 (FC); 8859-9 (FC); SAMI_MAC (9F); CP1116 (81); CP1122 (A1); CP1250 (FC); CP1252 (FC); CP1254 (FC); CP1256 (FC); CP1257 (FC); CP1258 (FC); CP437 (81); CP775 (81); CP850 (81); CP852 (81); CP857 (81); CP860 (81); CP861 (81); CP863 (81); CP865 (81); ROMAN (9F); SAMI_WIN (FC); VENTURA_INT (81);
found in languages: az [Azerbaijani]; bi [Bislama]; br [Breton]; ca [Catalan]; ch [Chamorro]; cor [Cornish]; cy [Welsh]; de [German]; es [Spanish]; et [Estonian]; eu [Basque]; fr [French]; fy [Frisian]; gl [Galician]; hu [Hungarian]; lb [Luxembourgian]; nl [Dutch]; pt [Portuguese]; sl [Slovenian]; sv [Swedish]; tk [Turkmen]; tr [Turkish]; tt [Tatar];
used in romanization of: kk_r [Kazakh (cyrillic)]; ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; tk_r [Turkmen (cyrillic)]; zh_r [Chinese (sino-japanese)];
uppercase: 00DC


None of the codepages in this database contain all the characters listed above.
Best match (10 out of 16 characters) is:
  1. 8859-3
  2. 8859-9
  3. CP1254
  4. CP857

Apostrophe is required and has two different meanings. It is used to denote glottal stop in Arabic loanwords and palatalization in (mostly Russian) loans.

Latin alphabet 1927-1939, Yan(g)alif, used different letters for dotless i (Cyrillic soft sign), j (barred z), and g with breve (U01a2).
A convention to use e for schwa, í for e, ñ for n with descender and for barred o has also gained some popularity as more compatible with other Turkic languages.

Unicode currently does not have neither precomposed allocation for N with descender nor combining descender. There is LATIN SMALL LETTER BARRED O (U0275), but the uppercase variant is called LATIN CAPITAL LETTER O WITH MIDDLE TILDE (U019f) with usage note 'African'.

The conversion from Cyrillic to Latin script is planned within years 2001-2011.


Please send your comments to kiisu@eki.ee