If I am not wrong to assume that the cram/crai files on the bucket gs://topmed-irc-share/genomes/ should be mirrors of the files on the bucket s3://nih-nhlbi-datacommons/, then there are at least 201 occurrences where this files do not match in sizes. I've created the tsv file where I've gathered the problematic files.
topmed-files-mismatch.txt
If I wrongly assumed that the files should be the same, then ignore this, of course.
Originally posted by @mvucenovic in #26 (comment)
If I am not wrong to assume that the cram/crai files on the bucket gs://topmed-irc-share/genomes/ should be mirrors of the files on the bucket s3://nih-nhlbi-datacommons/, then there are at least 201 occurrences where this files do not match in sizes. I've created the tsv file where I've gathered the problematic files.
topmed-files-mismatch.txt
If I wrongly assumed that the files should be the same, then ignore this, of course.
Originally posted by @mvucenovic in #26 (comment)