After having fixed the groundtruth images of the "13434" dataset, an older but full resolution dataset, has to be repaired too.
Repair of the "overlapping_chromosomes_examples.h5" dataset:
This dataset contained originally 2854 (grayscaled+groundtruth) pairs of 190x189 images, stored in a unique numpy array. Its shape was 2854x190x189x2.
The grayscaled images could suffer from two problems:
- Some grayscaled images components had black dots: those images were removed (with their corresponding groundtruth labels).
- The images dtype was int64, it is now np.uint8
The labels of the groundtruth don't have no more spurious pixels:
Dataset format:
Once downloaded, the dataset shape is (2164, 190, 189, 2) available as: