TODO :

MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

Forked from the original MoCo PyTorch repo.
The code is minimal modification on the original code, with small modifications for our experiments.
The project is made in the context of the TissueNet: Detect Lesions in Cervical Biopsies competition.

The idea is to adapt the MoCo method to the specificities of biopsy scans, with the goal of detecting cancers.

Multiple changes are to be tested :
[] Change in the pretext task used to train the MoCo encoder, and experiment with different contrastive losses
[] Change in the key and query representation to give help the key and query have more semantics
[] Experiment with different transformations that might be more suited to our task [] Experiment with MoCo v2 data augmentations

Unsupervised Training

This implementation only supports multi-gpu, DistributedDataParallel training, which is faster and simpler; single-gpu or DataParallel training is not supported.

To do unsupervised pre-training of a ResNet-50 model on ImageNet in an 8-gpu machine, run:

python main_moco.py -a resnet34 --lr 0.03 --batch-size 32 --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 [your imagenet-folder with train and val folders]

This script uses all the default hyper-parameters as described in the MoCo v1 paper. To run MoCo v2, set --mlp --moco-t 0.2 --aug-plus --cos.

Note: for 4-gpu training, we recommend following the linear lr scaling recipe: --lr 0.015 --batch-size 128 with 4 gpus. We got similar results using this setting.

Linear Classification

With a pre-trained model, to train a supervised linear classifier on frozen features/weights in an 8-gpu machine, run:

python main_lincls.py \
  -a resnet50 \
  --lr 30.0 \
  --batch-size 256 \
  --pretrained [your checkpoint path]/checkpoint_0199.pth.tar \
  --dist-url 'tcp://localhost:10001' --multiprocessing-distributed --world-size 1 --rank 0 \
  [your imagenet-folder with train and val folders]

Linear classification results on ImageNet using this repo with 8 NVIDIA V100 GPUs :

	pre-train epochs	pre-train time	MoCo v1 top-1 acc.	MoCo v2 top-1 acc.
ResNet-50	200	53 hours	60.8±0.2	67.5±0.1

Here we run 5 trials (of pre-training and linear classification) and report mean±std: the 5 results of MoCo v1 are {60.6, 60.6, 60.7, 60.9, 61.1}, and of MoCo v2 are {67.7, 67.6, 67.4, 67.6, 67.3}.

TODO :

[] Change the normalization to dataset-relevant means and standard deviations
[] Properly test the model with validation data
[] Compare the pretrained data classifier to a purely supervised classifier

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
detection		detection
moco		moco
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
load_model_and_test.ipynb		load_model_and_test.ipynb
main_lincls.py		main_lincls.py
main_moco.py		main_moco.py
moco_utils.py		moco_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

Unsupervised Training

Linear Classification

TODO :

About

Uh oh!

Releases

Packages

Languages

License

mintiti/moco

Folders and files

Latest commit

History

Repository files navigation

MoCo: Momentum Contrast for Unsupervised Visual Representation Learning

Unsupervised Training

Linear Classification

TODO :

About

Resources

License

Code of conduct

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages