videodepthestimation_mapper

February 4, 2026 · View on GitHub

Perform depth estimation on the video.

对视频执行深度估计。

Type 算子类型: mapper

Tags 标签: gpu, video

🔧 Parameter Configuration 参数配置

name 参数名type 类型default 默认值desc 说明
video_depth_model_path<class 'str'>'video_depth_anything_vitb.pth'The path to the Video-Depth-Anything model. If the model is a 'metric' model, the code will automatically switch to metric mode, and the user should input the path for storing point clouds.
point_cloud_dir_for_metric<class 'str'>DATA_JUICER_ASSETS_CACHEThe path for storing point clouds (for a 'metric' model).
max_res<class 'int'>1280The maximum resolution threshold for videos; videos exceeding this threshold will be resized.
torch_dtype<class 'str'>'fp16'The floating point type used for model inference. Can be one of ['fp32', 'fp16']
if_save_visualization<class 'bool'>FalseWhether to save visualization results.
save_visualization_dir<class 'str'>DATA_JUICER_ASSETS_CACHEThe path for saving visualization results.
grayscale<class 'bool'>FalseIf True, the colorful palette will not be applied.
args''
kwargs''

📊 Effect demonstration 效果演示

test

VideoDepthEstimationMapper(video_depth_model_path='video_depth_anything_vits.pth', point_cloud_dir_for_metric=DATA_JUICER_ASSETS_CACHE, max_res=1280, torch_dtype='fp16', if_save_visualization=True, save_visualization_dir=DATA_JUICER_ASSETS_CACHE, grayscale=False)

📥 input data 输入数据

Sample 1: 1 video
video4.mp4:
Sample 2: 1 video
video3.mp4:

📤 output data 输出数据

Sample 1: empty
depth_data
[673, 360, 480]
fps30.0
Sample 2: empty
depth_data
[1190, 640, 362]
fps24.0

test_metric

VideoDepthEstimationMapper(video_depth_model_path='metric_video_depth_anything_vits.pth', point_cloud_dir_for_metric=DATA_JUICER_ASSETS_CACHE, max_res=1280, torch_dtype='fp16', if_save_visualization=True, save_visualization_dir=DATA_JUICER_ASSETS_CACHE, grayscale=False)

📥 input data 输入数据

Sample 1: 1 video
video4.mp4:
Sample 2: 1 video
video3.mp4:

📤 output data 输出数据

Sample 1: empty
depth_data
[673, 360, 480]
fps30.0
Sample 2: empty
depth_data
[1190, 640, 362]
fps24.0