We propose a simple yet effective model to fuse working memory features in BEV space for online vectorized HD map construction. MemFusionMap focuses on improving the network's temporal reasoning capability while also maintaining a versatile design for scalability and compatibility.
We propose a novel design of maintaining a temporal overlap heatmap, providing a strong cue for the model to reason across a history of frames and also implicitly encoding valuable insights of the vehicle's trajectory.
We conduct extensive evaluation on nuScenes and Argoverse2 to demonstrate the effectiveness of MemFusionMap. The proposed method significantly outperforms the state-of-the-art method, achieving a maximum improvement of 5.4% in mAP.
Qualitative results on nuScenes at the 100x50 m perception ranges. We show a consecutive sequence of 5 frames.
@InProceedings{song2025memfusionmap,
author = {Song, Jingyu and Chen, Xudong and Lu, Liupei and Li, Jie and Skinner, Katherine A.},
title = {MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction},
booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
month = {February},
year = {2025},
pages = {9230-9239}
}