A tool to fetch protein and nucleotide sequences using Entrez from Biopython
- Python 3.12.1
- tqdm
- Clone the repository to your local machine:
git clone https://github.com/DEHourigan/fetch-that.git
- Navigate to the cloned directory:
cd fetch-that
- Install the required dependencies:
conda env create -f fetchthat.yml conda activate fetchthat
python fetchthat.py --infile test_data/protein_acc.txt --db prot --email [email here] --outfile test_data/out.faa
- Prepare your data:
- input is a text file with 1 accession per line
fasta file containing desired sequences
DEHourigan
For any queries, please reach out via GitHub issues or directly to [email protected]
.