The paper arXiv:2504.01016 introduces GeometryCrafter, a framework designed to estimate temporally consistent, high-quality point maps from open-world videos. These point maps are crucial for applications like 3D/4D reconstruction, depth-based video editing, and camera parameter estimation.
I tested this week on the huggingface.co GeometryCrafter project and works with a short video because I don't have a Pro account ...
The GeometryCrafter can generates .npz files. These are NumPy compressed archive files that store the predicted point maps corresponding to each frame of the input video. Each .npz file encapsulates the 3D spatial information derived from the video frames, facilitating further processing or visualization.
I tested a python script for .npz with Blender 3D software and works , not finished yet!