Event box

Workflow languages – your foundation for accuracy and reproducibility in data analysis

Workflow languages – your foundation for accuracy and reproducibility in data analysis Online

Instructor: Leah Kemp, ESR


Are you working with big data? Do you need to pass your data through various software? If you’ve ever been in this situation (as I have in a population genetics masters project), you would know it can become very difficult to maintain reproducibility and accuracy; wait, have I updated this output file? The more manual steps we do, the more human errors are inevitably introduced into our analysis, hampering accuracy and reproducibility.

Be lazy, the machine does it better.

Workflow languages automate your data analysis workflow . But this isn’t all, they ensure all your analysis logs are captured in an organised fashion, they explicitly outline the software (and exact software versions) used, the input and output files at each step. Lastly, when your data inevitably becomes big data, you can easily scale up from running your analysis on your laptop, to running your analysis on a high performance cluster (HPC) such as NeSi.

In this workshop, we will work through an introduction to Snakemake, a workflow language with its basis in the popular programming language, Python. This Workshop is intended for anyone who has several steps in their data analysis workflow, particularly when many different software are involved.

#resbazNZ2020 #ResBazPickNMix

Date:
Friday 27 November 2020
Time:
2:30pm - 3:30pm
Time Zone:
Auckland (change)
Campus:
Online Webinar
Online:
This is an online event. Event URL will be sent via registration email.
Categories:
  Research Webinars  
Registration has closed.

Event Organizer

Profile photo of Matt Plummer
Matt Plummer

More events like this...