News

i want to use videos as input training data, the per iter time cost increase so much! when i train on images, the shape of data after vae is torch.Size ( [1, 16, 1, 128, 200]), 4s/iter when i train on ...