Object-Level Semantic Segmentation on The
High-Resolution Gaofen-3 FUSAR-Map Dataset

Xianzheng Shi, Shilei Fu, Feng Wang and Feng Xu
Key Laboratory of Information Science of Electromagnetic
Waves(MoE), Fudan University,
Shanghai 200433, China

Abstract

This paper explores the potential of object-level semantic segmentation of high-resolution single-pol SAR images, in particular tailored for the Gaofen-3 (GF-3) sensor. Firstly, a well-annotated GF-3 segmentation dataset 'FUSAR-Map' is presented for SAR semantic segmentation. It consists of 610 high-resolution GF-3 single-pol SAR images with the size of 1024 x 1024. Secondly, an encoder-decoder network based on transfer learning is employed to implement semantic segmentation of GF-3 SAR images. Our algorithm obtains 4th place about the PolSAR image semantic segmentation on the '2020 Gaofen Challenge on Automated High-Resolution Earth Observation Image Interpretation'.

Citation

@article{,
title={Object-level Semantic Segmentation on The High-Resolution Gaofen-3 FUSAR-Map Dataset},
author={},
journal={},
year={2021}
}

FUSAR-Map: a benchmark dataset for SAR semantic segmentation

The FUSAR-Map dataset consists of 610 high-resolution GF-3 single-pol SAR images with the size of 1024×1024. It contains 8 different areas from 6 provinces of more than 4, 500 km2 in China. The annotation layer contains four terrain types, i.e. water, road, building, and vegetation.

In FUSAR-Map, each class is labeled with different colors, i.e. water in the blue, road in yellow, building in red, and vegetation in green. Pixels of the undetermined or unknown class are colored in black.

The advantages of the dataset include:
(1) Accurate latitude and longitude coordinates are provided for each sample.
(2) Accurate masks of water, road, and building.
(3) Difference in features of the same category in the same or different patch images

Information of the GF-3 single-pol SAR images used to generate FUSAR-Map:

1.'文件名统称(location_图片序号_范围序号_行序号_列序号_序号)', the data given is the naming method of the corresponding SAR image, corresponding to this image block from left to right is the geographical location (city, district), the serial number corresponding to the Gaofen-3 SAR image to which it belongs (from 8 source images in total), the serial number of the area block in the corresponding large image (the range of continuous non-overlapping), and the row number of the area image to which it belongs ( in the row), the column number of the region image (in the column), the picture data type, the picture number, for example, the name 'JiuJiang_01_01_01_01_SAR_001.tif' : the first range area of the first SAR image in Jiujiang, Jiangxi SAR image block in the first row and first column is numbered as the first image.
2.'高分3号来源', the name of the Gaofen-3 source data of the corresponding image block, for example, the first data source: 'GF3_MYN_UFS_999992_E116.0_N29.7_20160815_L1A_DH_L10002010515'
3.'景中心俯仰角(度/°)', corresponding to the elevation angle of the field center imaged by Gaofen-3;
4.'近端俯仰角(度/°)', corresponding to the near-end elevation angle of Gaofen-3;
5.'远端俯仰角(度/°)', corresponding to the far end pitch angle of Gaofen-3;
6.'宽分辨率(米/m)', which corresponds to the wide resolution of Gaofen-3;
7.'高分辨率(米/m)', corresponding to the high resolution of Gaofen-3;
8.'标称分辨率(米/m)', corresponding to the nominal resolution of Gaofen-3;
9.'光学图像分辨率(米/m)', the corresponding optical remote sensing image resolution is 1.02m;
10.' DEM分辨率(米/m)', the original resolution of the corresponding DEM data is 30m;
11.'BoundingBox[min_lon, min_lat; max_lon, max_lat]', corresponding to the longitude and latitude information of the image block, respectively [minimum longitude, minimum latitude; maximum longitude, maximum latitude], which corresponds to the lower left corner of the image [minimum longitude, Minimum latitude], upper right corner [maximum longitude, maximum latitude];
12.'Boundingbox_row_col[min_row, min_col; max_row, max_col]', corresponding to the position of the image block in the original image size, [minimum row value, minimum column value; maximum row value, maximum column value], taking the top left vertex of the image as the origin of coordinates, that is, the coordinates of the upper left corner of the corresponding image] [minimum row value, minimum column value, and lower right coordinates [maximum row value, maximum column value].

Description of different types of data formats:

1.The image size of the SAR dataset is 1024*1024, and the value type is ‘uint8’;
2.The Gaofen-3 SAR images in SAR_geocode have the value type of ‘single’.
3.The image size of the label image dataset is 1024*1024*3, and the value type is uint8, where red [255, 0, 0] represents the building label, green [0, 255, 0] represents the vegetation label, and blue [0 , 0, 255] represents water area label, yellow [255, 255, 0] represents road label, [0, 0, 0] represents unknown;

FUSAR-Map can be downloaded from Baidudrive:

FUSAR-Map

(extraction code:ae39)

Model code is availabel at:

FUSAR-Map Code

Contact

E-mail: fengxu@fudan.edu.cn 、xzshi19@fudan.edu.cn