Skip to content

The goal of this project is to learn about language development in India. In particular we wish to see how the two languages, Sanskrit, and it's degenerate Pali, influenced the creation of Marathi and Hindi differently.

Notifications You must be signed in to change notification settings

vanshcsingh/indic-lang-development

Repository files navigation

indic-lang-development

The goal of this project is to use principles of Natural Language Processing to discover how the ancient languages Sanskrit and it's degenerate Pali, influenced the creation of the two modern languages Hindi and Marathi differently.

Dependencies

This program is built on Python 2.7 and requires the user to have installed NLTK. It builds upon the IndicNLP library, which is included within the project.

Data

To run the program on our data files on Windows, run python confusability_matrix.py. To run the program on our data files on Unix, run make. This will output the data in a file called program.data

Note: be prepared to wait a very large amount of time

Results

Our paper, FinalPaper.pdf, summarizes the results of our historical linguistic experiments. We found that Hindi was much more similar to Sanskrit than was Marathi. and Marathi was much more similar to Pali than was Hindi.

About

The goal of this project is to learn about language development in India. In particular we wish to see how the two languages, Sanskrit, and it's degenerate Pali, influenced the creation of Marathi and Hindi differently.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages