Skip to content

prolego-team/model-optimization

Repository files navigation

model-optimization

Prepare and optimize a transformers-based PyTorch model for deployment.

The following options, as well as their impact on runtime and memory consumption, are explored:

  • Model serialization with TorchScript or ONNX
  • Model quantization with TorchScript or ONNX Runtime
  • Pruning attention heads
  • Different model architectures (Huggingface.co roberta-base vs. distilroberta-base)

Getting Started

Clone the repository:

git clone https://github.com/prolego-team/model-optimization.git

Use pyenv to install python v. 3.9.2:

pyenv install 3.9.2

Use poetry to create the environment and install dependencies. Note: If you would like the virtual environment to be created inside the project's root directory, first configure poetry settings (poetry config virtualenvs.in-project true)

cd model-optimization
poetry install

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors