Skip to content

python tool to fetch protein or nucleotide sequences from GenBank

Notifications You must be signed in to change notification settings

DEHourigan/fetch-that

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

fetch that

A tool to fetch protein and nucleotide sequences using Entrez from Biopython

Example of Image created using DALL-E

Requirements

  • Python 3.12.1
  • tqdm

Installation

  1. Clone the repository to your local machine:
    git clone https://github.com/DEHourigan/fetch-that.git
  2. Navigate to the cloned directory:
    cd fetch-that
  3. Install the required dependencies:
    conda env create -f fetchthat.yml
    conda activate fetchthat

Usage

python fetchthat.py --infile test_data/protein_acc.txt --db prot --email   [email here] --outfile test_data/out.faa
  1. Prepare your data:
    • input is a text file with 1 accession per line

Output

fasta file containing desired sequences

Contributing

DEHourigan

Contact

For any queries, please reach out via GitHub issues or directly to [email protected].


About

python tool to fetch protein or nucleotide sequences from GenBank

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages