The file names we process in this ETL embed some additional data we should maintain (and have cataloged) e.g. the refseq db name (e.g. refseq88) and the kmer length (like k31).
we should enumerate, figure out how we want to store/recall and then do the things.
The file names we process in this ETL embed some additional data we should maintain (and have cataloged) e.g. the refseq db name (e.g. refseq88) and the kmer length (like k31).
we should enumerate, figure out how we want to store/recall and then do the things.