Exercise 0 objective – BIOI2 – Integrative BIOInformatics platforme

Getting started with Snakemake

About this course | Before the session | About Snakemake | Course material | Exercises

Exercise 0 - run your first snakefile

In this Exercise, we will be taking our first steps with Snakemake. We will:

learn how to fetch (setup) and run a small Snakemake pipeline (o1)
understand the basic architecture of Snakemake files (o2)
get familiar with the output log (o3)

The pipeline that we will be running will fetch protein sequences from the UniProt database and align them using mafft. Graphically, we’ll be doing this:

wget: fetches files from the web directly; cat: fuses 2 or more files together; mafft: aligns all sequences in a file

Don’t worry if you don’t understand exactly what it does and how…