? Short description LDB Letter Database query page

Tatar

Required characters

U018F

decimal: Ə
UTF-8 (c6, 8f) Ə
name: LATIN CAPITAL LETTER SCHWA
old name:
Adobe glyph name:
mnemonic name(s):
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar]; az [Azerbaijani];
used in romanization of:
lowercase: 0259

U0259

decimal: ə
UTF-8 (c9, 99) ə
name: LATIN SMALL LETTER SCHWA
old name:
Adobe glyph name:
mnemonic name(s):
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar]; az [Azerbaijani];
used in romanization of:
uppercase: 018F

U00C7

decimal: Ç
UTF-8 (c3, 87) Ç
name: LATIN CAPITAL LETTER C WITH CEDILLA
old name: LATIN CAPITAL LETTER C CEDILLA
Adobe glyph name: Ccedilla
mnemonic name(s): <C,>
HTML 4 mnemonic name:&Ccedil;
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0043 0327
comment:
found in charsets: CP863 (80); 8859-15 (C7); CP1250 (C7); 8859-3 (C7); CP1258 (C7); 8859-9 (C7); ROMAN (82); 8859-2 (C7); CP857 (80); CP1252 (C7); CP437 (80); CP860 (80); CP1122 (68); CP1254 (C7); SAMI_MAC (82); CP852 (80); CP865 (80); CP861 (80); CP850 (80); 8859-1 (C7); CP1116 (80); SAMI_WIN (C7); 8859-14 (C7); 8859-16 (C7); VENTURA_INT (80);
found in languages: ca [Catalan]; eu [Basque]; tr [Turkish]; tt [Tatar]; fr [French]; oc [Occitan]; wa [Walloon]; en [English]; az [Azerbaijani]; pt [Portuguese]; ku [Kurdish]; tk [Turkmen]; es [Spanish]; sq [Albanian];
used in romanization of: ar_r [Arabic (perso-arabic)];
lowercase: 00E7

U00E7

decimal: &#231;
UTF-8 (c3, a7) ç
name: LATIN SMALL LETTER C WITH CEDILLA
old name: LATIN SMALL LETTER C CEDILLA
Adobe glyph name: ccedilla
mnemonic name(s): <c,>
HTML 4 mnemonic name:&ccedil;
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0063 0327
comment:
found in charsets: CP863 (87); 8859-15 (E7); CP1250 (E7); 8859-3 (E7); CP1258 (E7); 8859-9 (E7); ROMAN (8D); 8859-2 (E7); CP857 (87); CP1252 (E7); CP437 (87); CP860 (87); CP1122 (48); CP1254 (E7); SAMI_MAC (8D); CP852 (87); CP865 (87); CP861 (87); CP1256 (E7); CP850 (87); 8859-1 (E7); CP1116 (87); SAMI_WIN (E7); 8859-14 (E7); 8859-16 (E7); VENTURA_INT (87);
found in languages: ca [Catalan]; eu [Basque]; tr [Turkish]; tt [Tatar]; fr [French]; oc [Occitan]; wa [Walloon]; en [English]; az [Azerbaijani]; pt [Portuguese]; ku [Kurdish]; tk [Turkmen]; es [Spanish]; sq [Albanian];
used in romanization of: ar_r [Arabic (perso-arabic)];
uppercase: 00C7

U011E

decimal: &#286;
UTF-8 (c4, 9e) Ğ
name: LATIN CAPITAL LETTER G WITH BREVE
old name: LATIN CAPITAL LETTER G BREVE
Adobe glyph name: Gbreve
mnemonic name(s): <G(>
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0047 0306
comment:
found in charsets: 8859-3 (AB); 8859-9 (D0); CP857 (A6); CP1254 (D0);
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani];
used in romanization of:
lowercase: 011F

U011F

decimal: &#287;
UTF-8 (c4, 9f) ğ
name: LATIN SMALL LETTER G WITH BREVE
old name: LATIN SMALL LETTER G BREVE
Adobe glyph name: gbreve
mnemonic name(s): <g(>
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0067 0306
comment:
found in charsets: 8859-3 (BB); 8859-9 (F0); CP857 (A7); CP1254 (F0);
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani];
used in romanization of:
uppercase: 011E

U0130

decimal: &#304;
UTF-8 (c4, b0) İ
name: LATIN CAPITAL LETTER I WITH DOT ABOVE
old name: LATIN CAPITAL LETTER I DOT
Adobe glyph name: Idotaccent
mnemonic name(s): <I.>
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0049 0307
comment:
found in charsets: 8859-3 (A9); 8859-9 (DD); CP857 (98); CP1254 (DD);
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani];
used in romanization of:
lowercase: 0069

U0131

decimal: &#305;
UTF-8 (c4, b1) ı
name: LATIN SMALL LETTER DOTLESS I
old name:
Adobe glyph name: dotlessi
mnemonic name(s): <i.>
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets: 8859-3 (B9); 8859-9 (FD); ROMAN (F5); CP857 (8D); CP1254 (FD); SAMI_MAC (F5); CP850 (D5); CP1116 (D5);
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani];
used in romanization of:
uppercase: 0049

UE01A

not an UCS character!
name: LATIN CAPITAL LETTER N WITH DESCENDER
old name:
Adobe glyph name:
category: Lu (Letter, Uppercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
lowercase: E01B

UE01B

not an UCS character!
name: LATIN SMALL LETTER N WITH DESCENDER
old name:
Adobe glyph name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
uppercase: E01A

UE01C

not an UCS character!
name: LATIN CAPITAL LETTER BARRED O
old name:
Adobe glyph name:
category: Lu (Letter, Uppercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
lowercase: E01D

UE01D

not an UCS character!
name: LATIN SMALL LETTER BARRED O
old name:
Adobe glyph name:
category: Ll (Letter, Lowercase)
combining: 0
comment:
found in charsets:
found in languages: tt [Tatar];
used in romanization of:
uppercase: E01C

U015E

decimal: &#350;
UTF-8 (c5, 9e) Ş
name: LATIN CAPITAL LETTER S WITH CEDILLA
old name: LATIN CAPITAL LETTER S CEDILLA
Adobe glyph name: Scommaaccent
mnemonic name(s): <S,>
HTML 4 mnemonic name:
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0053 0327
comment: *
Note: Please see note 1
found in charsets: CP1250 (AA); 8859-3 (AA); 8859-9 (DE); 8859-2 (AA); CP857 (9E); CP1254 (DE); CP852 (B8);
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; ku [Kurdish]; tk [Turkmen];
used in romanization of: fa_r [Persian (perso-arabic)]; ar_r [Arabic (perso-arabic)]; ps_r [Pashto (perso-arabic)];
lowercase: 015F

U015F

decimal: &#351;
UTF-8 (c5, 9f) ş
name: LATIN SMALL LETTER S WITH CEDILLA
old name: LATIN SMALL LETTER S CEDILLA
Adobe glyph name: scommaaccent
mnemonic name(s): <s,>
HTML 4 mnemonic name:
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0073 0327
comment: *
found in charsets: CP1250 (BA); 8859-3 (BA); 8859-9 (FE); 8859-2 (BA); CP857 (9F); CP1254 (FE); CP852 (AD);
found in languages: tr [Turkish]; tt [Tatar]; az [Azerbaijani]; ku [Kurdish]; tk [Turkmen];
used in romanization of: fa_r [Persian (perso-arabic)]; ar_r [Arabic (perso-arabic)]; ps_r [Pashto (perso-arabic)];
uppercase: 015E

U00DC

decimal: &#220;
UTF-8 (c3, 9c) Ü
name: LATIN CAPITAL LETTER U WITH DIAERESIS
old name: LATIN CAPITAL LETTER U DIAERESIS
Adobe glyph name: Udieresis
mnemonic name(s): <U:>
HTML 4 mnemonic name:&Uuml;
category: Lu (Letter, Uppercase)
combining: 0
decomposition info: 0055 0308
comment:
found in charsets: CP863 (9A); CP775 (9A); 8859-15 (DC); CP1250 (DC); 8859-3 (DC); CENTEURO (86); CP1258 (DC); 8859-10 (DC); 8859-9 (DC); ROMAN (86); 8859-2 (DC); CP857 (9A); CP1252 (DC); CP437 (9A); CP860 (9A); 8859-4 (DC); CP1122 (FC); CP1254 (DC); SAMI_MAC (86); CP852 (9A); CP865 (9A); CP861 (9A); 8859-13 (DC); CP850 (9A); CP1257 (DC); 8859-1 (DC); CP1116 (9A); SAMI_WIN (DC); 8859-14 (DC); 8859-16 (DC); VENTURA_INT (9A);
found in languages: lb [Luxembourgian]; ca [Catalan]; eu [Basque]; br [Breton]; tr [Turkish]; tt [Tatar]; et [Estonian]; fr [French]; de [German]; nl [Dutch]; ch [Chamorro]; bi [Bislama]; cy [Welsh]; cor [Cornish]; fy [Frisian]; sv [Swedish]; az [Azerbaijani]; gl [Galician]; pt [Portuguese]; tk [Turkmen]; sl [Slovenian]; es [Spanish]; hu [Hungarian];
used in romanization of: ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; zh_r [Chinese (sino-japanese)]; tk_r [Turkmen (cyrillic)]; kk_r [Kazakh (cyrillic)];
lowercase: 00FC

U00FC

decimal: &#252;
UTF-8 (c3, bc) ü
name: LATIN SMALL LETTER U WITH DIAERESIS
old name: LATIN SMALL LETTER U DIAERESIS
Adobe glyph name: udieresis
mnemonic name(s): <u:>
HTML 4 mnemonic name:&uuml;
category: Ll (Letter, Lowercase)
combining: 0
decomposition info: 0075 0308
comment:
found in charsets: CP863 (81); CP775 (81); 8859-15 (FC); CP1250 (FC); 8859-3 (FC); CENTEURO (9F); CP1258 (FC); 8859-10 (FC); 8859-9 (FC); ROMAN (9F); 8859-2 (FC); CP857 (81); CP1252 (FC); CP437 (81); CP860 (81); 8859-4 (FC); CP1122 (A1); CP1254 (FC); SAMI_MAC (9F); CP852 (81); CP865 (81); CP861 (81); 8859-13 (FC); CP1256 (FC); CP850 (81); CP1257 (FC); 8859-1 (FC); CP1116 (81); SAMI_WIN (FC); 8859-14 (FC); 8859-16 (FC); VENTURA_INT (81);
found in languages: lb [Luxembourgian]; ca [Catalan]; eu [Basque]; br [Breton]; tr [Turkish]; tt [Tatar]; et [Estonian]; fr [French]; de [German]; nl [Dutch]; ch [Chamorro]; bi [Bislama]; cy [Welsh]; cor [Cornish]; fy [Frisian]; sv [Swedish]; az [Azerbaijani]; gl [Galician]; pt [Portuguese]; tk [Turkmen]; sl [Slovenian]; es [Spanish]; hu [Hungarian];
used in romanization of: ky_r [Kyrgyz (cyrillic)]; mn_r [Mongolian (cyrillic)]; zh_r [Chinese (sino-japanese)]; tk_r [Turkmen (cyrillic)]; kk_r [Kazakh (cyrillic)];
uppercase: 00DC


None of the codepages in this database contain all the characters listed above.
Best match (10 out of 16 characters) is:
  1. 8859-3
  2. 8859-9
  3. CP1254
  4. CP857

Apostrophe is required and has two different meanings. It is used to denote glottal stop in Arabic loanwords and palatalization in (mostly Russian) loans.

Latin alphabet 1927-1939, Yan(g)alif, used different letters for dotless i (Cyrillic soft sign), j (barred z), and g with breve (U01a2).
A convention to use e for schwa, í for e, ñ for n with descender and for barred o has also gained some popularity as more compatible with other Turkic languages.

Unicode currently does not have neither precomposed allocation for N with descender nor combining descender. There is LATIN SMALL LETTER BARRED O (U0275), but the uppercase variant is called LATIN CAPITAL LETTER O WITH MIDDLE TILDE (U019f) with usage note 'African'.

The conversion from Cyrillic to Latin script is planned within years 2001-2011.


Please send your comments to kiisu@eki.ee