model-optimization

Prepare and optimize a transformers-based PyTorch model for deployment.

The following options, as well as their impact on runtime and memory consumption, are explored:

Model serialization with TorchScript or ONNX
Model quantization with TorchScript or ONNX Runtime
Pruning attention heads
Different model architectures (Huggingface.co roberta-base vs. distilroberta-base)

Getting Started

Clone the repository:

git clone https://github.com/prolego-team/model-optimization.git

Use pyenv to install python v. 3.9.2:

pyenv install 3.9.2

Use poetry to create the environment and install dependencies. Note: If you would like the virtual environment to be created inside the project's root directory, first configure poetry settings (poetry config virtualenvs.in-project true)

cd model-optimization
poetry install

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
docs		docs
example_data		example_data
plots		plots
unit_tests		unit_tests
.gitignore		.gitignore
.python-version		.python-version
ModelOptimization.ipynb		ModelOptimization.ipynb
README.md		README.md
inference_profiling.py		inference_profiling.py
model_utils.py		model_utils.py
optimization_adventure.py		optimization_adventure.py
optimize_models.py		optimize_models.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
sentiment_inference.py		sentiment_inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

model-optimization

Getting Started

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

prolego-team/model-optimization

Folders and files

Latest commit

History

Repository files navigation

model-optimization

Getting Started

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages