GetTeluguCategory

Signature

function GetTeluguCategory(CP: Cardinal): Integer;

Purpose

Pure Unicode-codepoint → Telugu syllabic-category lookup. No font state required. Returns one of 13 category codes (same numbering as GetDevanagariCategory).

Return values

CodeCategoryExample codepoints
0OtherU+0C04 reserved, U+0C3D Avagraha
1ConsonantU+0C15–U+0C28, U+0C2A–U+0C39, U+0C58–U+0C5A
2Independent vowelU+0C05–U+0C0C, U+0C0E–U+0C10, U+0C12–U+0C14, U+0C60–U+0C61
3Matra (dependent vowel sign)U+0C3E–U+0C44, U+0C46–U+0C48, U+0C4A–U+0C4C, U+0C55–U+0C56, U+0C62–U+0C63
4Virama (Halant)U+0C4D
5NuktaU+0C3C
6BinduU+0C00–U+0C02 (Combining candrabindu / Candrabindu / Anusvara)
7VisargaU+0C03
9DigitU+0C66–U+0C6F
10ZWJU+200D
11ZWNJU+200C

Notable Telugu-specific assignments

  • No pre-base matras. MatraPos = 1 is unused for Telugu — every Telugu matra is above-base (3), below-base (4), or split (5).
  • Above-base matras (MatraPos = 3): AA (U+0C3E), I (U+0C3F), II (U+0C40), E (U+0C46), EE (U+0C47), O (U+0C4A), OO (U+0C4B), AU (U+0C4C), length mark (U+0C55).
  • Below-base matras (MatraPos = 4): U (U+0C41), UU (U+0C42), Vocalic R (U+0C43), Vocalic RR (U+0C44), AI length mark (U+0C56), Vocalic L matra (U+0C62), Vocalic LL matra (U+0C63).
  • Split matra (MatraPos = 5): only U+0C48 AI, decomposed by ApplyTeluguReorder into U+0C46 (above) + U+0C56 (below).
  • Halant: U+0C4D.
  • Three bindu codepoints (combining candrabindu / candrabindu / anusvara) at U+0C00U+0C02; visarga at U+0C03.

See also

Version history

  • v2.119.74 — Introduced in Phase 8f.5.