
Boosting Wan2.2 I2V inference on 8xH100s, 56% faster than baseline with Sequence Parallelism
Authored by Muhammad Ali Afridi. Introduction We have seen the rapid development of open-source Video Generation DiT models with MOE architectures, such as Wan2.1[1] and Wan2.2[2]. It is very exciting to see that these open-source generation models are going to beat closed-source benchmarks. However, the inference