Beruflich Dokumente
Kultur Dokumente
Abstract : Now a days Natural language processing is one in Telugu language, system need to understand different
of the most emerging technology in computer science. properties of Chandhassu types. To achieve this her we
Challenging topics in this field is processing the Indian are using rule based approach it is one of the NLP
languages. The main objective here is to reduce human
technique in syntax analysis. Here rules are defined by
effort to learn other languages like artificial languages to
linguistic experts.
operate a machines and make a system to understand
Natural language processing (NLP) is a
natural languages. Chandhasu is a basic information to
know whether a given poem is syntactically correct or not. subfield of artificial intelligence and linguistics. It
Proposed system is help full to check whether a given studies the problems of automated understanding and
poem is syntactically correct or not. This system can be generation of natural human languages. Natural
developed based on predefined rules for each type of language understanding systems convert samples of
Chandhasu. In Sanskrit Chandhasu types are called human language into more formal representations that
vruthas, some vruthas are adapted to write telugu poetry, are easier for computer programs to manipulate and
Vruthas based on number of letters. In Telugu Chandhasu
natural language generation systems convert
types are called jaathulu and upajaathulu. These are
information from computer databases into normal-
dependent on number of ganalu. And also it checks
sounding human language. Language processing
whether poem follows yathi and prasa properties. Poetry
can be useful to represent large amount of information in a applications will use knowledge about language. In
small and structured format such a way that it can be easy theory, natural-language processing is a very attractive
to remember. method of human-computer interaction .Modern NLP
algorithms are grounded in machine learning, especially
Keywords:Chandhassu, Laguvu, Guruvu, Ganalu, Yathi,
Prasa, Labels of Ganalu, Vruthalu, Jathulu, Upajathulu, in rule based and statistical machine learning.
tokenization, Poems, Unicode, Rule based approach.
CHANDHASSU:
www.ijcsit.com 1425
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
number of letters in particular order and that will very In telugu Chandhassu types are called jathulu and
delight to lesion. upajathulu. This are dependent on indhraganalu and
suryaganalu those are
There are 26 types of Chandhassu in Sanskrit.
iంద్రగణాలు
Each type of Chandhassu is recognized by the number
నల IIII
of akxaras present in each line of the poem. As an
నగ IIIU
akxara can be either a ‘laghuvu’ or a ‘guruvu’. Some
corers of vruthas are possible with this 26 types. Among భగణము UII
those mostly below two hundred vruthas are regularly రగణము UIU
used in writing a slokas, among this some vruthas are తగణము UUI
used in writing a telugu poetry. All this vruthas are సల IIUI
dependent on “య మా తా రా జ భా న స ల గం “ ganalu. In సూర్యగణాలు
నగణము III
samavruthas number of lines four, number of letters in హగణము UI
each line should be same, second letter in each line call
SYSTEM ARCHITECTURE:
it as a prasa.
యగణము IUU Chandhassu Recognizer takes input as a telugu
మగణము UUU poem, check whether a given poem fallow any type of
తగణము UUI Chandhassu rules or not. If it fallow Chandhassu rules
రగణము UIU then it produce which type of Chandhassu it fallowed,
జగణము IUI letter used for prasa if it present, letters used in
భగణము UII yathimaithri places. If it was not fallowed any type of
నగణము III Chandhassu then it produce output as syntax error in a
సగణము IIU given poem. Architecture for RBSA shown in below
లగం(వగణము) IU figure
గగ UU
www.ijcsit.com 1426
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
Chandhassu Recognizer (C.R) take input as a to give request to the C.R and C.R can send the
poem then send it to deciding of lagu or guru stage. In response to the user, through this module we
this stage C.R count number of lines, each line divided can establish the communication between
into words, each word divided into letters then each client and C.R. In this page user able to know
latter labelled with either lagu or guru. That labelled about the overview of the whole application
data send to labelled with ganalu. Hear labelled data is and having some link to get better knowledge
going to be divided into ganalu, each ganam is labelled about the whole application.
with appropriate latter from “ya ma tha ra jab ha na sa la
Deciding of Lagu/Guru:
gam”. This data is send to Chandhassu mapper it is
going to be identify type of Chandhassu and return to In this module it takes input as a poem
user and also send it to recognizer of yathi and prasa and output would be each letter in a poem
letters. It takes poem and poem type as a input and tagged with either lagu or guru. For this
finally it produce prasa letter, yathi mithri placed module input space is {all the telugu letters}
between the letters. output space is{ U,I }
SYNTAX ANALYZER FOR TELUGU POEMS: String tokenizer :
C.R takes input as a Telugu poem, check String tokenizer takes input as a given
whether a given poem fallow any type of Chandhassu poem. Poem divided into words based on word
rules or not. If it fallow then it produce which type of delimiter, Hear’-’.
Chandhassu it fallowed, letter used in prasa and Symbol tokenizer :
yathimaithri places. If it was not fallowed any type of Symbol tokenizer takes input words
Chandhassu then it produce output as syntax error in a from String tokenizer, each word divided into
given poem. letters.
Labelling with Lagu /Guru :
Modules: Labelling with Lagu/Guru takes input
Fallowing 5 modules are involves in Syntax analyzer for as letters, word boundaries from symbol
Telugu poems tokenizer and different types of rules regarding
1. User interface lagu/Guru from proposed ‘algorithm 1’,this
2. Deciding of Lagu/Guru algorithm works based on Unicode’s, finally it
3. Divided and Labelled with Ganalu gives output as each letter in given poem
4. Chandhassu mapper labelled with either Laguvu or Guruvu.
5. Recognizer of Yathi and Prasa Letters Initially given input poem divided into
word and letters. Hear each letter need to check
User interface
all properties defined in algorithm 3.1. To
In this module we create a user page using
recognize dirgakxaralu, dithva, samyukthaxalu
GUI, which will be the media to interact user
it is going to be use Unicode representations of
with the C.R and through which client can able
www.ijcsit.com 1427
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
label with 'U‘ if ‘i’ is not starting of a word Step7:Repet Step4 to step6 for remaining three lines
Step6:Poluhallu occur then label with 'U' step8: Check names of ganas in each line in the poem is
Step7:Other wise label with 'I' equals to "భరనభభరవ" and n.of letters in each
Step8:Stop line equals to 20 then
Divided and Labelled with Ganalu: return "uత్పలమాల"
Divided and Labelled with Ganalu take input or equals to "నజభజజజర" and n.of letters in
www.ijcsit.com 1428
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
or equals to “తభజజగగ" and n.of letters in each line or equals to "నభభర" and n.of letters in each line equals
equals to 14 then to 12 then
return “వసంతతిలకము" ్ర తవిలంబిత"
return “దు
or equals to "రసజజభర" and n.of letters in each line or equals to “మసజగ" and n.of letters in each line equals
equals to 18 then to 10 then
return “మత్తకోకిల" return “శుద్ధవిరాటి"
or equals to “తతజగగ" and n.of letters in each line or equals to "నసననసననన" and n.of letters in each line
equals to 11 then equals to 24 then
return “iంద్రవజ్ర" return “లయవిభాతి"
or equals to “సససస" and n.of letters in each line equals or equals to “జతజగగ" and n.of letters in each line
to 12 then equals to 11 then
return “తోటకము" return “uపేంధ్రవజా
్ర "
or equals to “భభభగగ" and n.of letters in each line or equals to "రరరర" and n.of letters in each line equals
equals to 11 then to 12 then
return “దోధకము" return “స్రగి్వణి"
or equals to "సజసజగ" and n.of letters in each line equals or equals to “మనయగ" and n.of letters in each line
to 13 then equals to 10 then
return “మంజుభాషిణి" return “ప్రణవ"
or equals to "నజజయ" and n.of letters in each line Step9:stop
To recognize vruthas initially it is going to be
equals to 12 then
return “తోదకము" check basic property of vruthas that is number of letters
or equals to “యయయయ" and n.of letters in each line in each line. If number of letters in each line is equal in
equals to 12 then a given poem then it decides that a given poem belongs
return “భుజంగప్రయాతము"
to vruthas, otherwise it was going to be check for other
or equals to "భభభభభభభగ" and n.of letters in each line
Chandhassu types. To recognize type of Chandhassu for
equals to 22 then
return “మానినీ" a given input poem after decided as it belongs to vruthas
or equals to "సజసజసజస" and n.of letters in each line it is going to be divide input labels generated by
equals to 21 then algorithm 3.1 into ganas such that each ganamu contains
return “మణిమాల" three labels. To achieve this number of letters in a line
or equals to "రనభగగ" and n.of letters in each line equals divided with 3 reminder is removed from total number
to 11 then of letters and divide into ganas. Remaining labels can
return “సా్వగత"
consider it as a one ganamu. To recognize Chandhassu
or equals to "భజసనగగ" and n.of letters in each line
types initially C.R is going to check number of letters.
equals to 14 then
return “వనమయూర" Number of letters is equals to appropriate Chandhassu
or equals to "నజభజజవ" and n.of letters in each line type letters then it is going to be check for labels of
equals to 17 then ganalu. Based on labels it is going to be decide type of
return “నరు్కట"
Chandhassu.
or equals to “ననననసభభభగ" and n.of letters in each line
జాతులు మరియు uపజాతులు
equals to 25 then
return “భంధుర" కంధము:
www.ijcsit.com 1429
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
Algorithm 3.3: To recognize kandhamu: combine first and second lines into one line. Third and
Step1:Start fourth lines into another line. Now it is going to be
Step2:Check number of lines in poem is equals to 4 then check for odd ganas labels should not be ‘ja’ and finally
Step3:Check number of mathras in each line it is going to check for sixth ganam should be ‘nala or
If 'U' occur take it as two mathras ja’ then RBSA is going to tell ‘Kandhamu’ is a type of
If 'I' occur take it as one mathra. Chandhassu for a given input poem.
If number of mathras in line 1&3 is euals to 12 ది్వపద:
, 2&4 is equals to 20 then
Algorithm 3.4 : To recognize Dvipadha :
Step4:Check first letter label of each line is equal then
Step1:Start
Step5:chek last letter label of 2 and 4 th line is euql to
Step2:Check number of lines in poem is equals to 2 then
'U' then
Step3:Inshialize ganalu=0
Step6:Divide into ganalu such that each ganam haveing
Step4:If laghu occur then divide 4 labels into one
4 mathras
ganame other wise three labels
Step5:Label ganalu based on "గగ,భ,జ,స,నల"
into one ganamu
Step6:consider 1&2 into one line , 3&4 into one line ganalu+1
Step7:Check labels of 1,3,5,7 ganalu not equal to 'జ' Step5:Repet Step4 untill ganalu<3
Step6:Label ganalu based on "Indhraganalu"
then
Step7:Remining labels equals to 'III' then 'న ' or equals
Step8:check label of 6 ganam is equal to 'నల or జ ' then
to 'UI' then 'హ'
Step9: Return కందము
Step8:Repet step3 to step7 for second line
Step10:stop
Step9:If it satisfied then return "ది్వపదము"
To recognize kandhamu initially RBSA is
going to check number of mathras in each line in a Step10:Stop
given poem. Number of mathras calculated based on To recognize dvipada type poems initially C.R
lagu guruvulu came from Algorithm 3.1. If ‘U‘ occur is going to be check for number of lines in a given
count it as a two mathras. If ‘I’ occurs count it as a one poem. If number of lines in a given poem is equals to
st rd
mathra. In kandhamu 1 and 3 line contains 12 two then it is going to check for dvipada properties in a
nd th
mathras, 2 and 4 lines contains 20 mathras. Initially it given input poem. Dvipada contains three indhraganalu
is going to check number of matharas if it satisfies and one suryaganamu in each line. Indhra and surya
above conditions then only it is going to be check for ganalu represented in
remaining properties. Kandhamu form based on Table 2.4. To divide labels forwarded from algorithm
chathurmathraganalu, those are gaga, bha, ja, sa, nala. 3.1 into ganas it fallowed this processor. If ganamu
First and third line countains three ganalu. Second and starting with ‘U’ means it is going to be divide three
fourth lines contains five ganalu. Then it is going to be labels into one ganamu or else if ganamu starting with
check first letter label in each line is same are not. After ‘I’ then it is going to be divide four labels into one
this it is going to be check for second and fourth line ganamu. To give names to ganas it refers indhraganalu
last letters labels should be ‘U’. Next it is going to be from ‘Table 2.4’. After completion of first ganam it is
www.ijcsit.com 1430
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
going to check suryaganamu. To Recognize this it is suryaganamulu in second and fourth line. Recognition
going to check current label is ‘U’ then two labels, of suryaganamulu based on number of mathras. If
otherwise if current label is ‘I’ then three labels consider number of mathras is equals to three then make it as a
it as a one ganamu. To label gans with appropriate one ganamu up to three ganas in first and third line, up
names it refers suryaganalu from ‘Table 2.4’. Finally if to five ganas in second and fourth line. After that If
a given poem satisfies then it is going to be tell given ganamu starting with ‘U’ means it is going to be divide
poem is ‘dvipada’ Chandhassu type. three labels into one ganamu or else if ganamu starting
ఆటవెలది: with ‘I’ then it is going to be divide four labels into one
ganamu in first and third line only. To give names to
Algorithm 3.5 : To recognize Ataveladhi :
ganas it refers indhraganalu from ‘Table 2.4’.Finally if a
Step1:Start
given poem satisfies above conditions then it is going to
Step2:Check number of lines in poem is equals to 4 then
be tell given poem is ‘ataveladhi’ Chandhassu type.
Step3:Inshialize ganalu=0
తేటగీతి:
Step4:If laghu occur then divide 3 labels into one
ganam other wise two labels Algorithm 3.6 : To recognize Thetagethi :
into one ganamu ganalu+1 Step1:Start
Step5:Repet Step4 until ganalu<3 for 1&3 lines, Step2:Check number of lines in poem is equals to 4 then
ganalu<5 for 2&4 lines Step3:If laghu occur then divide 3 labels into one
Step6:Label ganalu based on"Suryaganalu" ganam
Step7:Skip step8 to 11 for 2&4 lines other wise two labels into one ganamu
Step8:Inshialize ganalu=0 Step4:Label ganalu based on"Suryaganalu"
Step9:If laghu occur then divide 4 labels into one Step5:If laghu occur then divide 4 labels into one
ganame other wise three labels ganame
into one ganamu other wise three labels into one ganamu
ganalu+1 Step6:Label ganalu based on"Indhraganalu"
Step10:Repet Step4 until ganalu<2 Step7:Repet Step5 and 6 once againe
Step11:Label ganalu based on "Indhraganalu" Step8:Repet Step3 and 4 two times
Step12:If it satisfy then return "ఆటవెలది" Step9:Repet Step3 to 8 for all lines in a poem
Step8:If it satisfy then return "తేటగీతి"
Step11:Stop
To recognize Ataveladhi initially C.R is going Step9:Stop
to check number of mathras in second and fourth line in Thetagethi contains one suryaganamu, two
a given poem. Number of mathras calculated based on indhraganamulu and then two suryaganamulu in each
lagu guruvulu came from Algorithm 3.1. If ‘U‘ occur line. To recognition of suryaganamulu based on number
count it as a two mathras. If ‘I’ occurs count it as a one of mathras. If number of mathras is equals to three then
nd th
mathra. In Ataveladhi 2 and 4 line contains 15 make it as a one ganamu . After that If ganamu starting
mathras, Ataveladhi contains three suryaganamulu and with ‘U’ means it is going to be divide three labels into
two indhraganamulu in first and third line, five one ganamu or else if ganamu starting with ‘I’ then it is
www.ijcsit.com 1431
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
going to be divide four labels into one ganamu up to to ganas it refers from ‘Table 2.4’.Finally if a given
two ganas, and then If number of mathras is equals to poem satisfies above conditions then it is going to be
three then make it as a one ganamu up to two ganas in tell given poem is ‘Sesamu’ Chandhassu type.
four lines. To give names to ganas it refers from ‘Table
2.4’.Finally if a given poem satisfies above conditions uతా్సహ:
then it is going to be tell given poem is ‘Thetagethi’
Algorithm 3.8 : To recognize Usthaha :
Chandhassu type.
Step1:Start
Step2:Check number of lines in poem is equals to 4 then
సీసము:
Step3:compute number of mathras in each line 'U' occur
Algorithm 3.7 : To recognize Sisamu : 2 else 1
Step1:Start check number of mathras in each line is equals
Step2:Check number of lines in poem is equals to 4 then to 23 then
Step3:Inshialize ganalu=0 Step4:divide into ganalu such that each ganam haveing
Step4:If laghu occur then divide 4 labels into one thrre mathras
ganame other wise up to 7 ganas in each line
three labels into one ganamu Step5:last label of each line is equals to'U' then
ganalu+1 Step6:Return "uసా్తహ"
Step5:Repet Step4 unlitl ganalu<6
Step7:Stop
Step6:Label ganalu based on"Indhraganalu"
To recognize Usthaha initially RBSA is going
Step7:If laghu occur then divide 3 labels into one
to check number of mathras in each line in a given
ganam
poem. Number of mathras calculated based on lagu
other wise two labels into one ganamu
guruvulu came from Algorithm 3.1. If ‘U‘ occur count it
Step8:Label ganalu based on"Suryaganalu"
as a two mathras. If ‘I’ occurs count it as a one mathra.
Step9:Repet once againe
In Usthaha contains 23 mathras in each line. Initially it
Step10:Repet step3 to step9 for all line in apoem
is going to check number of matharas if it satisfies
Step11:If it satisfy then return "సీసము"
above conditions then only it is going to be check for
Step12:Stop remaining properties. In Usthaha each line contains
Sesamu contains six indhraganamulu and then seven suryaganamulu and then one guruvu. To
two suryaganamulu in each line. To recognize recognize suryaganamulu based on number of mathras.
indhraganamulu If ganamu starting with ‘U’ means it is If number of mathras is equals to three then make it as a
going to be divide three labels into one ganamu or else one ganamu up to seven ganas and then last label should
if ganamu starting with ‘I’ then it is going to be divide be guruvu in four lines. To give names to ganas it refers
four labels into one ganamu up to six ganas, to from ‘Table 2.4’.Finally if a given poem satisfies above
recognize suryaganamulu based on number of mathras. conditions then it is going to be tell given poem is
If number of mathras is equals to three then make it as a ‘Usthaha’ Chandhassu type.
one ganamu up to two ganas four lines. To give names
www.ijcsit.com 1432
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
www.ijcsit.com 1433
I. Reddy Sekhar Reddy et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 7 (3) , 2016, 1425-1434
RBSA is very useful. Where ever the Chandhassu is [1] Thelugu Bhashalo Chandhorethulu By Ravuri Dhorasamisharma
used in all that cases to identify those Chandhassu [2] Rule Based Syntax Analyzer for Telugu Poems by Mr. I. Reddy
RBSA is usefull. sekhar Reddy
www.ijcsit.com 1434