Unsupervised monocular depth estimation with omnidirectional camera for 3D reconstruction of grape berries in the wild

doi:10.1371/journal.pone.0317359

Fig 1.

An example of general Japanese table grape fields targeting this research.

More »

Expand

Fig 2.

The procedure of estimating 3D positions of a whole grape bunch.

(a) The input video is taken with an omnidirectional camera. (b) The input video is split into several parts, and 3D reconstruction is performed for each part. The results are integrated to obtain (c) the 3D bunch. The details of the beige rectangle are shown in Fig 6.

More »

Expand

Fig 3.

Overview of the stereo vision.

Best viewed in color.

More »

Expand

Fig 4.

Unsupervised monocular depth estimation using differentiable DIBR.

More »

Expand

Fig 5.

Comparison of camera models.

Coordinate systems of C_t and are in green or blue, respectively.

More »

Expand

Fig 6.

Flow of estimation berry positions of grape bunches.

More »

Expand

Fig 7.

Estimated 3D shapes of grape berries.

(a) and (b) show different bunches. First, second, third and forth columns are input, inverse depth estimation results (in Jet colormap, namely the closer the redder), generated point cloud (diagonal view), and generated point cloud (side view), respectively. Regions closer than a certain degree are converted to point clouds. Red points and numbers are manual annotations which show berry correspondence.

More »

Expand

Fig 8.

Camera poses and berry positions after bundle adjustment.

Points and pyramids in black indicate initial positions and poses of berries and cameras. Those in blue or red indicate results after bundle adjustment.

More »

Expand