gscissors is a Perl script for the extraction of sequences from a multifasta file, allowing the selection of specific sequences by specifying their start and end positions.
- Extraction of sequences from a multifasta file.
- Specification of start and end positions of the sequences to be extracted.
- Changing of sequence names and addition of comments.
- Command-line interface.
- Support for different input formats, including GFF, BED and TXT.
To use gscissors, follow these steps:
- Clone the gscissors.pl file from the GitHub repository Link.
- Open a terminal on your operating system and navigate to the folder where the gscissors.pl file is located.
- Run the perl gscissors.pl command followed by the necessary arguments to execute the script. Required arguments include the input multifasta file, the start and end positions of the sequences to be extracted, and the output multifasta file.
- Optionally, you can specify the input and output file formats, as well as change sequence names and add comments.
./gscissors.pl --fasta input.fasta --coordinates position_of_my_sequence.txt --format txt --output output.fasta
./gscissors.pl --fasta input.fasta --coordinates blast_result --format blast --output output.fasta
./gscissors.pl --fasta input.fasta --coordinates file.gff --format gff --output output.fasta
./gscissors.pl --fasta input.fasta --coordinates file.bed --format bed --output output.fasta
./gscissors.pl --help
GScissors extracts the sequence of positions in the input.fasta file, renames it, and saves the resulting sequence in the output.fasta file.
Pull requests and π stars are always welcome.- For major changes, please open an issue first to discuss what you would like to change.
- Please make sure to update tests as appropriate.
βοΈ @MathiasMangino
MIT Β© Mathias Mangino