Skip to content

FFCV imagenet serial#28

Open
ksreenivasan wants to merge 40 commits intomasterfrom
ffcv_imagenet_serial
Open

FFCV imagenet serial#28
ksreenivasan wants to merge 40 commits intomasterfrom
ffcv_imagenet_serial

Conversation

@ksreenivasan
Copy link
Owner

@ksreenivasan ksreenivasan commented Mar 22, 2022

Creating a PR to merge FFCV imagenet serial code.
Note: Currently things work but loss is infinity (My guess is that this is an issue with ResNet18 architecture that we have in the codebase. It might be a tinyimagenet vs imagenet thing)

TODO:

  • Fix inf loss by implementing resnet18 for imagenet
  • Check to see if half precision training is violated anywhere else
  • Include scaler and other training tricks from FFCV
  • Use reasonable hyperparamaters

ksreenivasan and others added 30 commits March 21, 2022 11:51
bunch of things will break but we'll have to fix them along
the way.
my guess is that the last layer is not right. will fix.
turns out, using .half() causes some issues with the scaler.
need mixed precision for running ffcv.
looks like scaler is None for the finetuning step
which causes an issue. Need to handle this a little
better in the future but for now, this works.
this isn't the right branch for it, but needed to keep it somewhere
…ning_is_enough into regular_imagenet_serial
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant