Skip to content

Comparative analysis

Select the best architecture from the below

Encoder :

  • variations:
    • standard conv2
    • depth-wise separable convolution (1)
    • MobileNet
  • initialization:
    • random (1)
    • pre-train as classification on ImageNet
  • metric at component level: number of weights, lower is better.

Decoder:

  • Fix the "UpConv Block"
  • Variations:
    • NNConv3 with interpolation using nearest (1)
    • NNConv3 with pixel shuffle
    • Just pixel shuffle
  • Note: Why using convolution during decoding before upsampling.

Loss function:

  • train with only the last decoder (decorer_0) (1)
  • train with multi-scale depth prediction.

Training method:

  • Supervised learning (1)
  • Unsupervised learning using stereo images.
Edited by Harley Nelson Lara Alonso