DIAMOND vs Real KungFuMaster — World Model Prediction

DIAMONDNeurIPS 2024 Spotlight·64x64 @ 15fps
Ground Truth
Atari KungFuMaster real gameplay (ground truth frames, 64x64 @ 15fps)
Prediction
Atari KungFuMaster predicted by DIAMOND world model (64x64 @ 15fps)
SSIMStructural Similarity Index. Measures pixel-level structural similarity. Range 0-1, higher = more similar.Higher is better
0.9711
FVDFrechet Video Distance. Measures distribution-level similarity of video features. Lower = closer to real.Lower is better
10.6
LPIPSLearned Perceptual Image Patch Similarity. Measures perceptual similarity via deep features. Range 0-1, lower = more similar.Lower is better
0.0171
FramesPrediction horizon
100