-
Notifications
You must be signed in to change notification settings - Fork 3.4k
Lightning-AI pytorch-lightning Discussions
Pinned Discussions
Sort by:
Latest activity
Categories, most helpful, and community links
Categories
Community links
Discussions
-
You must be logged in to vote 😎 -
You must be logged in to vote ⚡ -
You must be logged in to vote ⚡ correct way to launch on slurm clusters?
environment: slurmmin-xu-ai askedDec 10, 2022 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Answered -
You must be logged in to vote 💬 -
You must be logged in to vote 💬 -
You must be logged in to vote 🤖 -
You must be logged in to vote 🤖 -
You must be logged in to vote 🤖 extra process when running ddp across multiple GPUs
strategy: ddpDistributedDataParallel accelerator: cudaCompute Unified Device Architecture GPU -
You must be logged in to vote ⚡ trainer and callback concurrency
atuysuz askedFeb 20, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ Decimal rounding - logged metrics
Arderiu askedFeb 19, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 😎 -
You must be logged in to vote 💬 -
You must be logged in to vote ⚡ -
You must be logged in to vote 💬 -
You must be logged in to vote 🤖 Multi gpus resume error
checkpointingRelated to checkpointing accelerator: cudaCompute Unified Device Architecture GPU -
You must be logged in to vote 🤖 -
You must be logged in to vote ⚡ How to define example_input_array for computational graph
minwang-ai askedNov 8, 2021 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote ⚡ 16-mixed precision returns nan when multiplying tensors
mshooter askedOct 5, 2023 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 😎 -
You must be logged in to vote ⚡ PermissionError
with ModelCheckpointingaaprasad askedFeb 2, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered -
You must be logged in to vote 💬 -
You must be logged in to vote 🤖 -
You must be logged in to vote ⚡ MLFlowLogger implementation: checkpoints downloaded during mlflow.pytorch.load_model.
croth1-liveeo askedFeb 2, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Closed · Unanswered -
You must be logged in to vote ⚡ -
You must be logged in to vote ⚡ How to predict with 1000s of dataloaders?
HadiSDev askedFeb 1, 2024 in Lightning Trainer API: Trainer, LightningModule, LightningDataModule · Unanswered