How to fine tune VERY large model if it doesn’t fit on your GPU
Memory-efficient techniques to defeat the problem of “CUDA memory error..” during training
https://bestasoff.medium.com/how-to-fine-tune-very-large-model-if-it-doesnt-fit-on-your-gpu-3561e50859af