Dear Authors,
May I ask how many steps are needed for the training on CIFAR10 to converge with the default setting in the code base? I ran for about 20k steps with a batchsize 64 instead of 256, but I still get very noisy outputs and wonder whether you have any insights.
Really appreciate your help.
Best,
ChicyChen
Dear Authors,
May I ask how many steps are needed for the training on CIFAR10 to converge with the default setting in the code base? I ran for about 20k steps with a batchsize 64 instead of 256, but I still get very noisy outputs and wonder whether you have any insights.
Really appreciate your help.
Best,
ChicyChen