Assignment 1 :- Syllable Detection in Indian Languages

Plots made by plotly

Code is present here


Language 1

Language Chosen :- Hindi

Corpus :- Premchand's Novel :- Godan

Corpus : Godan

Complete list of syllables and frequency :- Hindi_Godan


Plot of Decreasing Log Frequency of top 1000 syllables for Godan, base 2


Language 2

Language Chosen :- Sanskrit

Corpus : Cleaned Gita Corpus (only devanagari alphabets)

Complete list of syllables with thier respective frequencies :- Sanskrit_bgita


Plot of Decreasing Log Frequency of top 1000 syllables for Gita, base 2