name: CaffeNet fine-tuned on the Oxford 102 category flower dataset
caffemodel: oxford102.caffemodel
caffemodel_url: https://s3.amazonaws.com/jgoode/oxford102.caffemodel
gist_id: 0179e52305ca768a601f
license: BSD-3
See https://github.com/jimgoo/caffe-oxford102 for full code.
The CNN is a BVLC reference CaffeNet fine-tuned for the Oxford 102 category flower dataset. The number of outputs in the inner product layer has been set to 102 to reflect the number of flower categories. Hyperparameter choices reflect those in Fine-tuning CaffeNet for Style Recognition on “Flickr Style” Data. The global learning rate is reduced while the learning rate for the final fully connected is increased relative to the other layers.
The split file (setid.mat) lists 6,149 images in the test set and 1,020 images in the training set. We have instead trained this model on the larger set of 6,149 images and tested against the smaller set of 1,020 images.
After 50,000 iterations, the top-1 error is 7% on the test set of 1,020 images:
I0215 15:28:06.417726 6585 solver.cpp:246] Iteration 50000, loss = 0.000120038
I0215 15:28:06.417789 6585 solver.cpp:264] Iteration 50000, Testing net (#0)
I0215 15:28:30.834987 6585 solver.cpp:315] Test net output #0: accuracy = 0.9326
I0215 15:28:30.835072 6585 solver.cpp:251] Optimization Done.
I0215 15:28:30.835083 6585 caffe.cpp:121] Optimization Done.
Note that this uses the mean file for ILSVRC 2012 instead of the mean for the actual Oxford dataset.
I have the same question myself, as I couldn't find the mappings between integer categories and actual flower names anywhere on the website (http://www.robots.ox.ac.uk/~vgg/data/flowers/). I did a quick sanity check by counting the number of integer labels for a given class and comparing that number to the category counts shown here: http://www.robots.ox.ac.uk/~vgg/data/flowers/102/categories.html. This however is not an effective strategy for all categories because some have the same number of counts (for instance corn poppy, globe-flower and grape hyacinth both have 41 images, so you can't tell which integer ID with 41 occurrences is which.
I suppose the authors could provide it if emailed. If you are able to get this info, please let me know.