OctFormer: Efficient Octree-Based Transformer for Point Cloud Compression with Local Enhancement

Published in Proceedings of the 2023 AAAI Conference on Artificial Intelligence (AAAI), 2023

Point cloud compression with a higher compression ratio and tiny loss is essential for efficient data transportation. However, previous methods that depend on 3D convolution or frequent multi-head self-attention operations bring huge computations. To address this problem, we propose an octree-based Transformer compression method called OctFormer, which does not rely on the occupancy information of sibling nodes. Our method uses non-overlapped context windows to construct octree node sequences and share the result of a multi-head self-attention operation among a sequence of nodes. %them. Besides, we introduce a locally-enhance module for exploiting the sibling features and a positional encoding generator for enhancing the translation invariance of the octree node sequence. Compared to the previous state-of-the-art works, our method obtains up to 17\% Bpp savings compared to the voxel-context-based baseline and saves an overall 99\% coding time compared to the attention-based baseline.

Recommended citation: M. Cui, J. Long, M. Feng, B. Li, and K Huang*, OctFormer: Efficient Octree-Based Transformer for Point Cloud Compression with Local Enhancement. Proceedings of the 2023 AAAI Conference on Artificial Intelligence (AAAI).
Download Paper | Download Slides