This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| -- CompoundValet CSV: no structural identifiers → can't join to other databases | |
| -- Paste into https://shell.duckdb.org/ | |
| -- | |
| -- The Drug Repurposing Hub is a similar drug-target database but includes | |
| -- InChIKey, SMILES, and PubChem CID — making it instantly joinable. | |
| -- CompoundValet has only drug names. | |
| -- Load CompoundValet | |
| CREATE TABLE cv AS | |
| SELECT * FROM read_csv_auto( |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| #!/bin/bash | |
| # | |
| # Calculate checksum corresponding to the entity-tag hash (ETag) of Amazon S3 objects | |
| # | |
| # Usage: compute_etag.sh <filename> <part_size_mb> | |
| # | |
| # filename: file to process | |
| # part_size_mb: chunk size in MiB used for multipart uploads. | |
| # This is 8M by default for the AWS CLI See: | |
| # https://docs.aws.amazon.com/cli/latest/topic/s3-config.html#multipart_chunksize |