Correct spellings of misspelled Odia words.
See the dependencies in requirements.txt.
The code has been tested with Python 3.6.
We use Fasttext for correcting spelling mistakes. Check out this to get a guide 📘 to Fasttext.
- First download Odia text data with mixed vocabulary.
mkdir data
cd data
!wget https://storage.googleapis.com/ai4bharat-public-indic-nlp-corpora/data/monolingual/indicnlp_v1/sentence/or.txt.gz
tar -zxvf or.txt.gz
head or
- Then download text with clean vocabulary from Kaggle.
Put that inside
datadirectory. - Train Fasttext embeddings. See the notebook
fasttext.ipynb. - Build two sets of vocabulary, one mixed and another clean. See the notebook
vocabulary.ipynb. - Finally run
controller.pyto start the web app. Go to http://127.0.0.1:31137/spelling to access the web app.
# web app
python controller.py # open http://127.0.0.1:31137/spelling in browser