Add experimental Megatron-FSDP fully_shard implementation#4976
Closed
wujingyue wants to merge 13 commits into
Closed
Add experimental Megatron-FSDP fully_shard implementation#4976wujingyue wants to merge 13 commits into
wujingyue wants to merge 13 commits into
Loading