The pipeline that we will be running will fetch protein sequences from the UniProt database and align them using mafft. Graphically, we’ll be doing this:
wget: fetches files from the web directly; cat: fuses 2 or more files together; mafft: aligns all sequences in a file
Don’t worry if you don’t understand exactly what it does and how…