GetTeluguCategory
Signature
function GetTeluguCategory(CP: Cardinal): Integer;
Purpose
Pure Unicode-codepoint → Telugu syllabic-category lookup. No font
state required. Returns one of 13 category codes (same numbering as
GetDevanagariCategory).
Return values
| Code | Category | Example codepoints |
|---|---|---|
| 0 | Other | U+0C04 reserved, U+0C3D Avagraha |
| 1 | Consonant | U+0C15–U+0C28, U+0C2A–U+0C39, U+0C58–U+0C5A |
| 2 | Independent vowel | U+0C05–U+0C0C, U+0C0E–U+0C10, U+0C12–U+0C14, U+0C60–U+0C61 |
| 3 | Matra (dependent vowel sign) | U+0C3E–U+0C44, U+0C46–U+0C48, U+0C4A–U+0C4C, U+0C55–U+0C56, U+0C62–U+0C63 |
| 4 | Virama (Halant) | U+0C4D |
| 5 | Nukta | U+0C3C |
| 6 | Bindu | U+0C00–U+0C02 (Combining candrabindu / Candrabindu / Anusvara) |
| 7 | Visarga | U+0C03 |
| 9 | Digit | U+0C66–U+0C6F |
| 10 | ZWJ | U+200D |
| 11 | ZWNJ | U+200C |
Notable Telugu-specific assignments
- No pre-base matras.
MatraPos = 1is unused for Telugu — every Telugu matra is above-base (3), below-base (4), or split (5). - Above-base matras (
MatraPos = 3): AA (U+0C3E), I (U+0C3F), II (U+0C40), E (U+0C46), EE (U+0C47), O (U+0C4A), OO (U+0C4B), AU (U+0C4C), length mark (U+0C55). - Below-base matras (
MatraPos = 4): U (U+0C41), UU (U+0C42), Vocalic R (U+0C43), Vocalic RR (U+0C44), AI length mark (U+0C56), Vocalic L matra (U+0C62), Vocalic LL matra (U+0C63). - Split matra (
MatraPos = 5): onlyU+0C48AI, decomposed byApplyTeluguReorderintoU+0C46(above) +U+0C56(below). - Halant:
U+0C4D. - Three bindu codepoints (combining candrabindu / candrabindu / anusvara) at
U+0C00–U+0C02; visarga atU+0C03.
See also
Version history
- v2.119.74 — Introduced in Phase 8f.5.