N-grams

Introduction to generative language modeling using an n-gram model.

This project is an assignment for the Park Tudor data science class. See assignment.md for detailed instructions.

Requirements

This repo requires Python 3.12 or later. There are no additional dependencies.

Name	Description
assignment.md	The instructions for the assignment
tiny_shakespeare.txt	The dataset we use to train our language model
--	--
dataset.py	Utilities for loading and splitting the dataset
model.py	The n-gram model implementation
--	--
train.py	A CLI script to train the model
generate.py	A CLI script to generate text with the model
grade.py	A CLI script to grade the assignment
--	--
grading_utils.py	Utilities for grading, can be ignored

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
.gitignore		.gitignore
README.md		README.md
assignment.md		assignment.md
dataset.py		dataset.py
generate.py		generate.py
grade.py		grade.py
grading_utils.py		grading_utils.py
model.py		model.py
tiny_shakespeare.txt		tiny_shakespeare.txt
train.py		train.py
tune.py		tune.py