CS19-001HW-5 workspace consists of code cky.py which takes in a binarized pcfg and returns a debinarized parsed tree. It also consists of replace_counts.py which reads a training file of tree and returns a dictionary of lexicons whose frequency is more than once in trainning file and returns another trainning file that replaces lexicons of frequency 1.
Ex-2 workspace consists of character language model that is smoothed unigram, bigram and trigram.py which generates wfsa, trained over a text file.