Welcome to the official repository for GemDepth! GemDepth is a framework built on the insight that an explicit awareness of camera motion and global 3D structure is a prerequisite for 3D consistency.
All frames were generated directly from text2video model, without any post process. MoreCase is in project, including 1-2 minute video. yongen_c.mp4 (masterpiece ...