Pascal VOC 2012 examples

The purpose of the Pascal VOC 2012(PASCAL Visual Object Classes) dataset is to recognize objects in realistic scenarios from a variety of visual object types that are not pre-segmented objects and is basically used for the supervised learning task. The dataset can be used for different object recognition challenges such as classification, detection, segmentation, and person layout. There are a whole total of twenty object classes chosen. There are 11,530 images in the train/val data set, including 27,450 ROI-tagged objects and 6,929 segmentations.

Conversion from original data into dataset without the borders and void labels (index=255) is achieved with the convert.py Python script:

./venv/bin/python convert.py <JPEGImages dir> <SegmentationClass dir> <output dir>

Citation

License

The VOC2012 data includes images obtained from the "flickr" website. Use of these images must respect the corresponding terms of use:

For the purposes of the challenge, the identity of the images in the database, e.g. source and name of owner, has been obscured. Details of the contributor of each image can be found in the annotation to be included in the final release of the data, after completion of the challenge. Any queries about the use or ownership of the data should be addressed to the organizers.