- create a virtual environment
python3 -m venv .env
then activate itsource .env/bin/activate
- install pytorch
- install clip-retrieval
pip install clip-retrieval
- install s3fs
pip install s3fs
- (optional) install wandb
pip install wandb
and loginwandb login
- create a folder of image & txt pairs with the same filename (excepting the extension)
- example: img0.png, img0.txt
- fill out the
input_dataset
andoutput_dataset
fields of the script below - change the wandb project name & toggle if using wandb
- adjust the clip model preference
- adjust the slurm job comment to use your team's account
- set your slurm cache path ( can be anything you'd like )
more notes & advanced usage at: https://github.com/rom1504/clip-retrieval