#

gradient-accumulation

Here are 17 public repositories matching this topic...

rentruewang / koila

Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.

python machine-learning deep-learning neural-network pytorch memory-management lazy-evaluation out-of-memory gradient-accumulation

Updated Nov 13, 2025
Python

mvoelk / ssd_detectors

SSD-based object and text detection with Keras, SSD, DSOD, TextBoxes, SegLink, TextBoxes++, CRNN

keras ssd crnn textboxes focal-loss dsod seglink textboxespp densnet-seglink densnet-textboxespp virtual-batch-size gradient-accumulation distance-iou-loss shrikage-loss

Updated Feb 23, 2023
Jupyter Notebook

hkproj / pytorch-transformer-distributed

Distributed training (multi-node) of a Transformer model

machine-learning tutorial deep-learning pytorch data-parallelism model-parallelism distributed-training gradient-accumulation distributed-data-parallel collective-communication

Updated Apr 10, 2024
Python

andreped / GradientAccumulator

🎯 Gradient Accumulation for TensorFlow 2

deep-learning tensorflow gpu keras tf2 hacktoberfest multi-gpu distributed-training float16 tpu batch-size mixed-precision gradient-accumulation tensorflow2 huggingface adaptive-gradient-clipping accumulated-gradients memory-constraints accumulated-batch-normalization

Updated Feb 11, 2024
Python

CyberZHG / keras-gradient-accumulation

Gradient accumulation for Keras

keras optimizer gradient-accumulation

Updated Jun 27, 2021
Python

deephub-ai / torch-handle

TorchHandle makes your PyTorch development more efficient and make you use PyTorch more comfortable

python machine-learning deep-learning gpu cross-validation pytorch tensorboard deep-learning-library notebooks custom-metrics deep-learning-framework gradient-accumulation

Updated Sep 16, 2022
Python

gradient-accumulation-tf-estimator

hpandana / gradient-accumulation-tf-estimator

Gradient accumulation on tf.estimator

distributed tensorflow-estimator gradient-accumulation

Updated Dec 15, 2020
Python

xueyouluo / Multi-Passage-BERT

A simple implementation of Multi-passage BERT

bert openqa gradient-accumulation multi-passage-bert

Updated Feb 4, 2021
Python

Ibzie / PyTorch-Video-Prediction-Models

🎯 Production-ready implementation of video prediction models using PyTorch. Features Enhanced ConvLSTM with temporal attention, PredRNN with spatiotemporal memory, and Transformer-based architecture.

deep-learning pytorch transformer lstm attention-mechanism ucf101 convlstm video-prediction gradient-accumulation temporal-attention predrnn spatiotemporal-memory

Updated Dec 23, 2024
Python

shi510 / tensorflow-gradient-accumulation

tensorflow2-keras gradient accumulation

gradient-accumulation tensorflow2

Updated May 1, 2021
Python

jimth001 / my-tf-framework-for-nlp-tasks

This project aims to help people implement tensorflow model pipelines quickly for different nlp tasks.

tensorflow-framework multi-gpu gradient-accumulation nlp-pipeline

Updated Mar 9, 2020
Python

TanyaChutani / Gradient-Accumulation-Tensorflow2.x

Gradient Accumulation with Tensorflow2.x

computer-vision tensorflow keras tf2 image-classification gradient-accumulation

Updated Mar 25, 2022
Python

SatvikPraveen / LightningMasterPro

Comprehensive PyTorch Lightning framework featuring 20+ educational notebooks, advanced ML patterns, and production-ready workflows. Covers vision, NLP, tabular, and time series domains with distributed training, mixed precision, custom loops, and deployment pipelines. Complete with synthetic data generators and testing.

Updated Dec 13, 2025
Jupyter Notebook

amrgaberM / GPT-Implementation

Research code implementing the "Attention Is All You Need" architecture. Engineers a stable training loop for a 163M LLM using reduced-precision techniques on free-tier compute.

nlp deep-learning cuda pytorch transformer attention-mechanism mixed-precision gradient-accumulation gpt-2 llm

Updated Dec 14, 2025
Jupyter Notebook

cankocagil / Low-Memory-Transformer-Finetuning

Implementation of Gradient Accumulation for low-memory language modelling transformer fine tuning.

transformer language-model gradient-accumulation

Updated Jan 2, 2022
Jupyter Notebook

Behradsadeghi / flower-classification-efficientnet

Classifying images of flowers into 17 categories using EfficientNet-B0 and PyTorch.

deep-learning cnn pytorch image-classification transfer-learning mixed-precision gradient-accumulation efficientnet

Updated Aug 29, 2024
Jupyter Notebook

Behradsadeghi / AlzMRI-Net

AlzMRI-Net: Classify Alzheimer's stages from MRI scans.

deep-learning cnn medical-imaging transfer-learning alzheimer-disease-diagnostics mixed-precision gradient-accumulation efficientnet

Updated Aug 29, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the gradient-accumulation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gradient-accumulation topic, visit your repo's landing page and select "manage topics."