IITM Syllable Encoding Scheme
Definition files
The two byte syllable
level encoding followed in the IITM software is based on a superset of
vowels, consonants and samyuktakshars seen across the languages of the
country. The set of syllables has been chosen from manuscripts and texts
prepared in the country over a long period of time.
This set is sufficiently
representative of the syllables which one might encounter in normal use
of the languages. It is by no means complete. Three tables are shown below
which specify numerical values for basic vowels, consonants and the special
characters used across the scripts. The tables have a three column format.
The name indicated for a vowel, consonant or a special character is the
ASCII string used to refer to them in the programs which proces the codes.
These names have been chosen arbitrarily and have no relationship with
any transliteration schemes.
Column1 - Name of vowel,
consonant or special character
Column2- Numeric code assigned
Column3- Recommended ASCII
value of key for mapping the vowel, consonant or special character.
Generic.vow
(The set of vowels)
|
ah
|
0
|
0x61
|
|
aah
|
1
|
0x41
|
|
ih
|
2
|
0x69
|
|
ieh
|
3
|
0x49
|
|
ou
|
4
|
0x75
|
|
oou
|
5
|
0x55
|
|
ru
|
6
|
0x21
|
|
yeh
|
7
|
0x65
|
|
yaeh
|
8
|
0x45
|
|
eye
|
9
|
0x57
|
|
oh
|
10
|
0x6f
|
|
ooh
|
11
|
0x4f
|
|
au
|
12
|
0x77
|
|
am
|
13
|
0x40
|
|
aha
|
14
|
0x23
|
|
ak
|
15
|
0x71
|
Generic.con
(The set of consonants)
|
ka
|
1
|
0x6B
|
|
kha
|
2
|
0x4B
|
|
ga
|
3
|
0x67
|
|
gha
|
4
|
0x47
|
|
nga
|
5
|
0x48
|
|
cha
|
6
|
0x63
|
|
chha
|
7
|
0x43
|
|
ja
|
8
|
0x6A
|
|
jha
|
9
|
0x4A
|
|
nja
|
10
|
0x4d
|
|
ta
|
11
|
0x7a
|
|
tta
|
12
|
0x5a
|
|
dda
|
13
|
0x66
|
|
ddha
|
14
|
0x46
|
|
nna
|
15
|
0x4e
|
|
tha
|
16
|
0x74
|
|
ttha
|
17
|
0x54
|
|
da
|
18
|
0x64
|
|
dha
|
19
|
0x44
|
|
na
|
20
|
0x6e
|
|
pa
|
21
|
0x70
|
|
pha
|
22
|
0x50
|
|
ba
|
23
|
0x62
|
|
bha
|
24
|
0x42
|
|
ma
|
25
|
0x6d
|
|
ya
|
26
|
0x79
|
|
yab
|
27
|
0x59
|
|
ra
|
28
|
0x72
|
|
ra1
|
29
|
0x52
|
|
rrte
|
30
|
0x24
|
|
la
|
31
|
0x6c
|
|
llte
|
32
|
0x25
|
|
va
|
33
|
0x76
|
|
ca
|
34
|
0x53
|
|
sha
|
35
|
0x78
|
|
sa
|
36
|
0x73
|
|
ha
|
37
|
0x68
|
|
ksha
|
38
|
0x58
|
|
zha
|
39
|
0x5b
|
|
lla
|
40
|
0x4c
|
|
rra
|
41
|
0x56
|
|
nas
|
42
|
0x5d
|
|
visarg
|
43
|
0x3a
|
|
music
|
44
|
0x26
|
|
vedic
|
45
|
0x5e
|
|
urdu
|
46
|
0x51
|
Generic.spl
(The special characters including punctuation)
|
semcol
|
0x00
|
0x3b
|
|
viram
|
0x01
|
0x2e
|
|
pviram
|
0x02
|
0x3d
|
|
avagrah
|
0x03
|
0x7b
|
|
excla
|
0x04
|
0x7c
|
|
bindu
|
0x05
|
0x27
|
|
cbindu
|
0x06
|
0x22
|
|
quote
|
0x07
|
0x60
|
|
slash
|
0x08
|
0x2f
|
|
comma
|
0x09
|
0x2c
|
|
qmark
|
0x0a
|
0x3f
|
|
star
|
0x0b
|
0x2a
|
|
lpar
|
0x0c
|
0x28
|
|
rpar
|
0x0d
|
0x29
|
|
res1
|
0x0e
|
0x2b
|
|
res2
|
0x0f
|
0x2d
|
Generic.cnj
(The list of defined syllables)
The base consonant
is specified first. The conjunct definitions follow the colon separator.
Each consonant specified after the colon separator can form a second level
conjunct with the base consonant. Consonant names within parentheses refer
to third level conjuncts. The numeric code assigned to a conjunct is its
relative position with respect to the ones specified for each base consonant.
For instance, the conjunct value for (ka, cha) will be 3 and for (ga, na,
ya) will be 15. It may be noted that the codes conform to proper lexical
ordering of the Samyuktakshars.
ka : ka , kha, cha, ja, ta (ra),
nna, tha (ya, ra, va), ttha (na , ya), da, na (ya), pa, ba, ma (ya), ya,
ra, la, va, ca, sa (tha, va), lla, urdu
kha : kha, tha, na, ma, ya,
ra, la, va, ca, sha, ksha, lla, rra, urdu
ga : ga, gha (rra), cha,
chha, ja, ta, tta, dda, nna, tha, da, dha,
na(ya), pa, ba, bha(rra),
ma, ya, ra(ya), la, va (ya), sha, sa, lla, rra, urdu
gha : gha, nna, na (ya),
ma, ya, ra, la, va, ca, sha, sa, lla, rra
nga : ka (tha, ya, ra, sha),
kha (ya), ga (ya, ra, la, rra), gha (ya, ra, rra), nga, tha (va), ma, ya,
ksha
cha : ga, cha, chha (cha,
ra, va, rra), nja, na, pa, pha, bha, ma, ya, ra, la, va, ca, sha, sa, lla,
rra
chha : cha, chha, na, ma,
ya, ra, la, va, sa, rra
ja : ga, cha, ja (va), jha,
nja (ya), tha, da, dha, na, pa, pha, ba, bha,
ma, ya, ra, la, va, ca,
sha, sa, rra, urdu
jha : jha, na, ma, ya, ra,
la, va, sha, sa, rra
nja : cha, chha, ja (cha,
ya, va), nja (ya), ddha, pa, ba, ya, rra
ta : ka, kha, gha, cha, ta
(ya), tta (ya), tha(ra, rra), pa, ba, ma, ya, ra, la, va, sa(tha), lla,
rra
tta : ka, tta, ya, ra, va,
lla, rra
dda : ga, gha(ra), cha, ja,
dda, ddha, tha, ma, ya, ra, la, lla, rra, urdu
ddha : ddha, ya, ra, lla,
rra, urdu
nna : ka, ga, ta (ya), tta
(ya), dda, ddha, nna, pa, ba, ma, ya, va, lla, rra
tha : ka (ra), cha, ja, tha
(pha, ya, ra), ttha, na (ya), pa, bha, ma (ya), ya, ra (ya), la, va, ca,
sha, sa ( na, ya, va), ksha, lla, rra (ya)
ttha : ttha (ya), na, pa,
bha (ya), ma, ya, ra, la, va, ca, sa, lla, rra
da : ga (rra),gha, cha, ja,
da, dha (ya, va, rra), na (ya), pa, pha (ya), ba (ra, rra), bha (ya, rra),
ma, ya, ra, la, va (ya, ra, rra), lla, rra
dha : dha, na(ya), ma, ya,
ra (ya), va, lla, rra
na : ka, ga, cha, ja, ta,
tha (ya, ra, va), ttha (ya), da (ra), dha (ya, ra, rra), na, pa (ra), bha,
ma, ya, ra, la, va, ca, sha, sa, ha, rra
pa : ka, tha (ya, va), dha
(va), na, pa, ma, ya, ra, la, va, sa (va), lla, rra
pha : ka, tha (va), pha,
ya, ra, lla, rra, nas, urdu
ba : ka, ja, tha, da, dha
(va), na, ba, bha, ma, ya, ra, va, lla, rra
bha : ka, na, bha, ya, ra,
va, rra
ma : ka, cha, chha, ja, na,
pa (ra, rra), pha, ba (ya), bha, ma, ya, ra (ya), la, va, ca, sa, ha, lla,
rra
ya : ka (ka), ga, nga, cha,
tha, da, na, pa, ya, ra, la, va, sa
yab : ka, yab
ra : ka (ya), kha, ga(ya),
gha(ga), nga (ja), cha(ya, rra), chha, ja(ya), jha, ta,
tta, nna(ya), tha (ma, ya),
ttha(ya), da(ya, ra, va)
ra1 : dha (na, ya, va), na,
pa, pha, ba, bha, ma(ya), ya, ra, la, va(ya), ca(ya, va), sha(ka, ta,nna,ya),
sa, ha, ksha(ya, va)
rrte : ka, rrte
la : ka, ga, ta, tta, tha,
ttha, da, na, pa (ya), ba, bha (ya, ra), ma (ya),
ya, la, va, ha, rra
llte : ka, llte
va : ka, tha, na, ya, ra,
va, lla, rra
ca : ka, kha, cha (ya), chha,
nja, tha, na, ma, ya, ra (ya), la, va (ya), ca(pa, ba), lla, rra ( ya)
sha : ka(ra), ta (ya, ra,
va), tta (ya, ra, va), nna (ya), pa (ra), ma (ya), ya, ra, va, sha, lla,
rra (ya)
sa : ka (rra), kha, ja, ta,
tta, tha (ya, ra, va), ttha (ya), na (ya), pa (ra), pha , ma, ya, ra, la,
va, sa (ttha, ya, va), lla, rra (ya)
ha : ka, nna, na (ya), ma,
ya, ra, la, va, ha, lla, rra
ksha : nna, tha, na, pa,
ma, ya, la, va, ksha, rra
zha : ka, kha, ga, gha, nga,
cha, chha, ja, nja, ta, tta, dda, nna, tha (tha), da, dha, na, pa, ba,
bha, ma, ya, va, ca, sha, sa, ha, zha
lla : ka, kha, ga, gha, nga,
cha, chha, ja, ta, tta, dda, nna, tha, ttha, da, dha, na, pa, ba, bha,
ma, ya, va, ca, sha, sa, ha, lla, rra
rra : ka, kha, ga, gha, nga,
cha, chha, ja, nja, ta, tta, dda, nna, tha, ttha, da, dha, na, pa, ba,
bha, ma, ya, la, va, ca, sha, sa, ha,lla, rra
nas : ka, kha, ga, gha, nga,
cha, chha, ja, nja, ta, tta, dda, nna, tha , da, dha, na, pa, ba, bha,
ma, ya, va, ca, sha, sa, ha, zha, nas
visarg : ka, cha, ta, tha,
pa, ba, ya, ra, la, va, ca, sha, sa, ha
music : ga, da, na, pa, ma,
ra, sa
vedic : ka, cha, ta, tha,
pa, ya, ra, la, va, ca, sha, sa, ha
urdu : ka, kha, ga, gha,
nga, cha, chha, ja,jha, nja, ta, tta, dda, nna, tha, ttha, da, dha, na,
pa, ba, bha, ma, ya, la, va, ca, sha, sa, ha, ksha