Interesting that they compare to totally different models than Stable Fast 3D’s paper. I’m not clear on relative size of VFusion3D vs Stable Fast 3D, but I do think the training idea is good and novel — getting relatively good quality out of movies is a much easier ask in terms of collecting training data than getting 3D model renderings.