Avoid deepspeed plugin converting the whole model #20543

Boltzmachine · 2025-01-11T01:26:45Z

Boltzmachine
Jan 11, 2025

I am using deepspeed plugin in lightning to train my model. I want the first part of my model to be float32, while the second part to be bfloat16 (the optimizer only trains the first part). However, I found lightning will convert the whole model to float32 if I do not specify the precision. How to keep my pre-defined model dtype untouched?

Answered by Boltzmachine

Jan 11, 2025

I found you should implement the dtype conversion in your LightningModule, and avoid DeepSpeedStrategy from converting your module.

from lightning.pytorch.plugins import DeepSpeedPrecision
from lightning.pytorch.strategies import DeepSpeedStrategy
from typing_extensions import override

class DeepSpeedPrecisionWithoutModuleConversion(DeepSpeedPrecision):
    @override
    def convert_module(self, module):
        return module

and pass to the trainer as

trainer = Trainer(
    ...,
    DeepSpeedStrategy(stage=2, precision_plugin=DeepSpeedPrecisionWithoutModuleConversion('32-true'))
)

View full answer

Boltzmachine · 2025-01-11T07:13:26Z

Boltzmachine
Jan 11, 2025
Author

I found you should implement the dtype conversion in your LightningModule, and avoid DeepSpeedStrategy from converting your module.

from lightning.pytorch.plugins import DeepSpeedPrecision
from lightning.pytorch.strategies import DeepSpeedStrategy
from typing_extensions import override

class DeepSpeedPrecisionWithoutModuleConversion(DeepSpeedPrecision):
    @override
    def convert_module(self, module):
        return module

and pass to the trainer as

trainer = Trainer(
    ...,
    DeepSpeedStrategy(stage=2, precision_plugin=DeepSpeedPrecisionWithoutModuleConversion('32-true'))
)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid deepspeed plugin converting the whole model #20543

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Avoid deepspeed plugin converting the whole model #20543

Boltzmachine Jan 11, 2025

Replies: 1 comment

Boltzmachine Jan 11, 2025 Author

Boltzmachine
Jan 11, 2025

Boltzmachine
Jan 11, 2025
Author