TensoRF

Tensorial Radiance Fields (ECCV 2022)

TensoRF: Tensorial Radiance Fields

Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su

paper :
https://arxiv.org/abs/2203.09517
project website :
https://apchenstu.github.io/TensoRF/
code :
https://github.com/apchenstu/TensoRF
referenced blog :
https://xoft.tistory.com/42

Abstract

Radiance Field (Scene)에 대해 Tensor Decomposition을 적용해보자!
fast training and less computational cost

Tensor Decomposition

외적 (outer product) :
\(\begin{bmatrix} 1 & 2 \\ 2 & 4 \\ 4 & 8 \end{bmatrix} = \begin{bmatrix} 1 \\ 2 \\ 4 \end{bmatrix} \circ \begin{bmatrix} 1 & 2 \end{bmatrix}\)
위의 예시는
shape (3, 2) matrix를
shape (3,) vector와 shape (2,) vector의 외적으로 표현
Tensor Decomposition :
- \(n\)-dim.의 data를 \(n\)개의 1D vector들의 외적으로 표현할 수 있다!
  이 때, 정보 손실이 발생할 수 있으므로
  \(R\) 개의 rank에 대해 외적들을 더해 \(n\)-dim. data를 근사
- 장점 :
  고차원 data를 1D vector들로 표현할 수 있으므로
  speed 개선
- 단점 :
  수많은 1D vector들로 표현하므로
  GPU memory 많이 소요
- 종류 :
  CP(CANDECOMP/PARAFAC) decomposition
  Tucker Decomposition
  Block Term Decomposition

reference : https://www.kolda.net/publication/TensorReview.pdf

Tensor Decomposition w. Trilinear Interpolation :
interpolation으로 1D vector A와 B의 길이를 증가시키고
그 값으로 원본 matrix 표현

https://xoft.tistory.com/42

TensoRF Tensor Decomposition

CP(CANDECOMP/PARAFAC) Decomposition

https://xoft.tistory.com/42

shape (i, j, k)의 \(T\) 에 대해
\(T = \sum_{r=1}^R v_{r}^1 \circ v_{r}^2 \circ v_{r}^3\)

VM(vector-matrix) Decomposition

https://xoft.tistory.com/42

shape (i, j, k)의 \(T\) 에 대해
\(T = \sum_{r=1}^{R_1} v_{r}^1 \circ M_{r}^{2,3} + \sum_{r=1}^{R_2} v_{r}^2 \circ M_{r}^{1,3} + \sum_{r=1}^{R_3} v_{r}^3 \circ M_{r}^{1,2}\)

Grid-based 연구들이 training speed 높이는 데 많은 기여를 하고 있으니
다른 논문들도 한 번 읽어보자
- Plenoxel (CVPR 2022)
- Instant-NGP (SIGGRAPH 2022)
- DVGO (CVPR 2022)
Grid-based 연구들
- 장점 : speed 개선
- 단점 : 해상도가 증가하면 GPU memory 많이 소요
  기존 연구들은 space complexity \(O(N^3)\) 인데,
  TensoRF는 이를 \(O(N^2)\) 으로 줄임

Algorithm

Algorithm :
- step 1)
  scene을 bounded cubic (grid)로 제한
- step 2)
  ray를 쏴서 sampled points를 구한 뒤
  각 rank의 선과 면으로 projection하고
  외적한 값들을 이용해서
  color와 volume density 계산
- step 2-1)
  volume density는 단순히 외적한 값들을 더해서 구함
  (VM Decomposition)
- step 2-2)
  color는 외적한 값들을 concat한 뒤
  function B와 function S에 통과시켜 얻음
  - function B :
    1개의 FC-layer
    appearance commonalities를 추상화는 Global Apperance Dictionary 역할
  - function S :
    MLP 또는 SH(Spherical Harmonics) 함수

Loss

sparse input images일 경우
적게 관측된 view에서는 outlier 혹은 noise가 발생할 수 있어
overfitting 혹은 local minima 문제 발생
\(\rightarrow\)
regularization term 추가한 loss 사용
e.g. TV(total variation) loss :
pixel 값 간의 급격한 변화 (noise or outlier)를 억제하기 위해
\(I_{i+1, j} - I_{i, j}\) 항과 \(I_{i, j+1} - I_{i, j}\) 항을 loss에 추가

Coarse-to-Fine

NeRF의 coarse-to-fine 기법 :
\(w_i = T_i \alpha_{i}\) 의 PDF 분포에 따라
일부 구간을 더 많이 sampling
Mip-NeRF 360의 coarse-to-fine 기법 :
- small coarse proposal-MLP는 many samples로 여러 번 evaluate하여 weight \(\hat w\) 를 구하고
  large fine NeRF-MLP는 less samples로 딱 한 번 evaluate하여 weight \(w\) 와 color \(c\) 를 구함
- proposal loss를 이용하여 NeRF-MLP의 지식을 proposal-MLP가 따라잡도록 함
TensoRF의 coarse-to-fine 기법 :
단순히 grid 크기를 upsampling
Grid size(resolution)이 커질수록 선 또는 면이 더 촘촘해져서 3D scene의 high-freq. feature를 더 잘 잡아낼 수 있음

Implementation

Decomposition Rank : 총 48개
- RGB : 16, 4, 4
- volume density : 16, 4, 4
Grid size : coarse-to-fine
\(128^3\) 에서 \(300^3\) 으로 점점 증가시키면서 학습
(2000, 3000, 4000, 5500, 7000 step에서 점차 증가시킴)
Batch size : 4096 pixels
Adam optimizer, V100 GPU(16GB)

Evaluation

기존 Grid-based 연구들과
training speed는 유사하지만
PSNR이 높고
모델 사이즈 및 GPU memory 사용량이 적음

Ours-VM-192 : 4DGS, VM Decomposition, 192개의 rank
- speed, PSNR : Ours-VM-192를 15000 iter.만큼만 진행했을 때 8분만에 기존 연구들보다 PSNR 높음
- memory : 기존 연구들보다 확연히 memory size 적음

Param. 실험 :
- Grid size가 증가할수록 성능 좋아지지만 speed 느려지고 model size 커짐
- CP Decomposition보다 VM Decomposition이 성능 더 좋음
- rank 개수가 증가할수록 성능 좋아짐
iter. :
- iter.이 증가할수록 PSNR이 증가
  5k iter.만 해도 PSNR이 30에 가까워지고, 점점 변동폭이 작아짐

Limitation

scene을 bounded cubic 안에 제한해야
projection을 통해 VM Decomposition이 가능하므로
unbounded scene은 다루지 못함

TensoRF

TensoRF: Tensorial Radiance Fields

Anpei Chen, Zexiang Xu, Andreas Geiger, Jingyi Yu, Hao Su

Abstract

Tensor Decomposition

TensoRF Tensor Decomposition

CP(CANDECOMP/PARAFAC) Decomposition

VM(vector-matrix) Decomposition

Related Works

Algorithm

Loss

Coarse-to-Fine

Implementation

Evaluation

Limitation