The Python script validator.py
(see pip
-compatible requirements file for dependencies) will compare the checksums for the data files associated with the
Y6 BAO data release that are stored in the S3 bucket at URL:
https://ncsa.osn.xsede.org/phy240006-bucket01/despublic/y6a2_files/y6_bao/
These files are browsable via an HTTP proxy at https://desdr-server.ncsa.illinois.edu/despublic/y6a2_files/y6_bao/.
To run the validation script, first download the manifest file and execute the script as shown below:
$ python validator.py /path/to/downloaded/y6_bao_manifest.20250124.json
100%|█████████████████████████████████████████████| 2014/2014 [00:04<00:00, 409.76it/s]
Dataset is valid.
Build and run the script following the commands below:
$ docker build . -t validator
...
=> exporting to image
=> => exporting layers
...
$ docker run --rm -it validator bash
root@bd85371657f5:/tmp# curl -O https://desdr-server.ncsa.illinois.edu/despublic/y6a2_files/y6_bao/y6_bao_manifest.20250124.json
root@bd85371657f5:/tmp# python validator.py y6_bao_manifest.20250124.json
100%|███████████████████████████████████████████████████████████| 2014/2014 [00:57<00:00, 35.19it/s]
Dataset is valid.