D
19

My fine-tuning run kept crashing until I set the gradient accumulation steps to 4

It was failing on a 12GB VRAM card after about 20 minutes every single time. Anyone know other tricks for memory issues with smaller models?
2 comments

Log in to join the discussion

Log In
2 Comments
gavin_kim3
Try lowering your batch size to one.
6
ericfox
ericfox7h ago
Honestly, lowering the batch size just makes things slower for me.
4