Course Advert: NERC Workshop on Population and Metagenomics Analysis

I help out with a fair few workshops on genomic analysis, but most are limited by being restricted to just a day or two, hardly enough time to cover more than the basics. So the 10 day long NERC Population and Metagenomics Analysis course, organised by the fantastic Konrad Paszkiewicz from the University of Exeter is a quite amazing opportunity. The teaching will be from some of the best people in the field (and me!) and of the highest quality. Incredibly, if you are a UK-funded researcher the costs of the course are entirely waived, with accommodation and meals also included! Spaces are limited but I would definitely urge you to sign-up. Full details below,  and please head over to the course workshop page to register. Hurry!

Workshop Overview

A ten-day workshop taking place between 25 February - 6 March 2014 providing detailed hands-on training for population and meta-genomics analysis for researchers with little or no background in mathematics or computing.

Venue: Dartington Hall, Totnes, Devon (nearest train station - Totnes)

Times: 25 February - 6th March 2014.

Arrival evening of Tuesday 25 February 2014. Departure morning of 6th March 2014. The course itself will take place 9am-12pm, 2pm-5pm and on some evenings 7pm-10pm everyday 26 February-5th March. Students are expected to attend the entire course.

Contact: research-events@exeter.ac.uk

Registration

The course itself is free of charged and is funded by a Professional Postgraduate Development Award from NERC.

A total of 30 funded places are available which cover the costs of accommodation and food, but not the cost of transportation to/from the venue.

An additional 10 places are available for participants from industry. The cost of accommodation and meals will need to be covered by the participants.

You should register your interest by 31 December 2013. Participants will be informed by 10th January 2014 as to whether they have been selected. Please note that preference will be given to researchers funded by NERC.

Accomodation and Transport:

For UK-based academic researchers:

The course is free of charge for up to 30 academic researchers working at recognised UK HEIs and research institutes. Accommodation at Dartington Hall is included and includes breakfast, lunch and dinner. Transportation to/from Dartington hall is NOT included.

For all other participants:

Whilst course fees will be waived, the cost of accommodation and meals will NOT be included. If selected, you will need to book accommodation with Dartington Hall separately. A special rate of £102 per night plus VAT has been negotiated (this includes breakfast, lunch and dinner).

Requirements:


Selected participants must bring their own laptops to the course. This needs to be a modern laptop (Windows, Mac OSX or Linux) with a full-size screen and wireless (please do not bring netbooks etc). Ethernet sockets may not be available during the course so please plan accordingly.

 

Draft Programme

An outline of the short course is given below with lead instructors in brackets. Please note that this is subject to change.

Tuesday 25 February 2014:

Arrival at Dartington Hall

Evening welcome buffet

Wednesday 26 February 2014:

Breakfast

Morning session 9am-12pm:

Introduction to the course

Hands on-workshop: Introduction to Amazon EC2 cloud (Konrad Paszkiewicz)

Lunch

Afternoon session 2pm-5pm:

Hands-on workshop: Introduction to Linux (Konrad Paszkiewicz and Julian Catchen)

Dinner

Evening session 7pm-10pm:

Hands-on workshop: Introduction to Linux (Konrad Paszkiewicz and Julian Catchen)

Thursday 27 February 2014:

Breakfast

Morning session 9am-12pm:

Hands-on workshop: Introduction to Linux (Konrad Paszkiewicz and Julian Catchen)

Lunch

Afternoon session 2pm-5pm:

Lecture: Introduction to genomics and bioinformatics (David Studholme)

Dinner

Evening session 7pm-10pm:

Hands-on workshop: Short read genomics (Konrad Paszkiewicz and David Studholme)

Friday 28 February 2014:

Breakfast

Morning session 9am-12pm:

Lecture: Introduction to RAD-seq (William Cresko and Julian Catchen)

Lunch

Afternoon session 2pm-5pm:

Hands-on workshop: RAD-seq (William Cresko and Julian Catchen)

Dinner

Evening session 7pm-10pm:

Hands-on workshop: RAD-seq (William Cresko and Julian Catchen)

Saturday 1 March 2014:

Breakfast

Morning session 9am-12pm:

Hands-on workshop: RAD-seq (William Cresko and Julian Catchen)

Lunch

Afternoon session 2pm-5pm:

Participant presentations

Dinner

Evening session 7pm-10pm:

 

Sunday 2 March 2014:

Free day with organised activities

Monday 3 March 2014:

Breakfast

Morning session 9am-12pm:

Lecture: Marker-based metagenomics introduction (Jose Clemente)

Lecture: Statistical challenges in metagenomics analysis (Chris Quince)

Lunch

Afternoon session 2pm-5pm:

Hands-on workshop: Introduction to QIIME (Jose Clemente, Daniel McDonald)

Dinner

Evening session 7pm-10pm:

Hands-on workshop: Introduction to QIIME (Jose Clemente, Daniel McDonald)

 

Tuesday 4 March 2014:

Breakfast

Morning session 9am-12pm:

Lecture: Whole genome metagenomics introduction (Nick Loman)

Lunch

Afternoon session 2pm-5pm:

Hands-on workshop: Whole genome metagenomics (Nick Loman, Chris Quince)

Dinner

Evening session 7pm-10pm:

Hands-on workshop: Whole genome metagenomics (Nick Loman, Chris Quince)

Wednesday 5 March 2014:

Breakfast

Morning session 9am-12pm:

Hands-on workshop: Free session (bring your own data)

Lunch

Afternoon session 2pm-5pm:

Hands-on workshop: Free session (bring your own data)

Dinner

Evening session 7pm-10pm:

End of workshop party

Thursday 6 March 2014:

Breakfast

Departure

Full event information

Population and metagenomics analysis are fields which have developed rapidly over the recent years and have opened up new methodologies to researchers in ecology, systematics, evolutionary development and ecotoxicology. However, the software which has been developed to analyse these types of data are typically non-graphical and complex to master for researchers in biological sciences who have not been specifically trained in bioinformatics. In this short course the Amazon EC2 cloud will be used for training using laptops.

One of the most significant recent developments in population genomics is Restriction-site Associated DNA sequencing (RAD-seq)1. This technique uses high-throughput sequencing to simultaneously sequence and genotype organisms at tens of thousands of loci. The number of markers generated makes analysis much more sensitive than traditional microsatellite-based approaches, enabling resolution between very closely related individuals who belong to the same microsatellite type. It also requires comparatively less development and optimization time since the number of markers is proportional to the number of fragments digested. It can be applied even in the absence of a reference genome and can assist with genome assembly as well as provide functional information1,2.

Metagenomics involves the study of communities of microbial organisms in particular environments. The combination of uncultured-based techniques and high-throughput sequencing technology has made possible a comprehensive characterization of whole communities for a fraction of the cost. This enables studies of particular environmental niches over time or changing conditions at different resolution levels3,8.

We have arranged for leading population genomic and metagenomic experts in the US and the UK to serve as instructors for this short-course. The US instructors below have either developed the molecular methods, the theory behind the analysis, or have actively developed the relevant software to perform the analysis.

 

Workshop instructors

  1. Professor William Cresko is a Principle Investigator and Director of the Institute of Ecology and Evolution at the University of Oregon. He is a pioneer of the RAD-seq technique, and has used the approach extensively to perform genetic mapping of stickleback fish phenotypic variation1 as well as the evolutionary genomics of pipefishes and seahorses (http://creskolab.uoregon.edu/).
  2. Dr Julian Catchen is a Postdoctoral Research Fellow at the University of Oregon Institute of Ecology and Evolution. He is the author of Stacks – the most popular software package designed to process and analyse RAD-seq data.
  3. Prof. Jose Clemente is based at the Icahn School of Medicine at Mount Sinai, New York. He is a contributing author to the QIIME (Quantitative Insights Into Microbial Ecology)4 software package which is one of the most popular software tools for performing metagenomic analysis. His lab at Mount Sinai is particularly focused on characterizing the mechanisms of action of the microbiome in IBD.
  4. Dr Nick Loman is an MRC Special Training Research Fellow currently working at the University of Birmingham. His research program focuses on the genomic and metagenomic analysis of microbial sequence data in a clinical context.
  5. Daniel McDonald is a graduate student in the Interdisciplinary Quantitative Biology program in the BioFrontiers Institute at the University of Colorado, and a part of Prof. Rob Knight's lab, a recognized leader in microbiome research. Daniel is a contributing author of QIIME and a core software developer on the project.
  6. Dr David Studholme is a Senior Lecturer in Bioinformatics at the University of Exeter. His research interests encompass applications of genomics, transcriptomics and metagenomics to plant-pathogen interactions. His recent projects have focussed on tree-pathogens Chalara fraxinea and Phytophthora ramorum as well as bacterial pathogens of banana, enset, tomato and other crops.
  7. Dr Konrad Paszkiewicz is the Director of the Wellcome Trust Biomedical Informatics hub. He is responsible for the provision of training for PhD students and researchers as well as bioinformatics facilities and capabilities within the University of Exeter.
  8. Prof. Peter Kille is the director of Bio-Initiatives at the University of Cardiff. His primary research expertise lies in the application of molecular techniques such as proteomics and genomics to eco-toxicology. His research interests encompass the effect on biological systems of the release of heavy metals into the environment.
  9. Dr Christopher Quince is a Reader at the University of Glasgow. He leads the Computational Microbial genomics group which focuses on the development of novel algorithms to aid the analysis of microbial community structures. The group also develop engineering systems using microbial communities including microbial fuel cells and filtration systems. He is the author of PyroNoise and AmpliconNoise which are integral to the analysis of many high-throughput metagenomic datasets.

All of the instructors have extensive experience teaching in a short-course/workshop environment. These include the National Evolutionary Synthesis Centre Workshops (NESCent) in Next Generation Sequencing 2011 and 2012 () held at Duke University, North Carolina, USA and the Evomics workshop held every January in Český Krumlov, Czech Republic. The QIIME group hold regular workshops in the US and worldwide. Much of the teaching material for the proposed short course has already been produced, delivered and tested to a large number of audiences. Amazon cloud images for each section of the short course have already been produced by the instructors and have been extensively tested in previous workshops.

The Amazon and Linux training will be based on a modified version of the ‘Unix & Perl for Biologists’7 course adapted for use on the cloud with an extensive EC2 tutorial. Pre-existing in-house workshop materials will be used to teach the basics of remapping, assembly and variant calling. The Stacks software suite will be used in conjunction with R to teach RAD-seq analysis. We will use QIIME4, MEGAN5 and MetaPhlAn6 software packages to teach various aspects of marker based and shotgun metagenomics.

 

References:

  1. Hohenlohe PA, Bassham S, Etter PD, Stiffler, N. Cresko, W.A. Population Genomics of Parallel Adaptation in Threespine Stickleback using Sequenced RAD Tags. PLoS Genetics 2010;6.
  2. J. Catchen, P. Hohenlohe, S. Bassham, A. Amores, and W. Cresko. Stacks: an analysis tool set for population genomics. Molecular Ecology. 2013.
  3. J Rousk, E Bååth, PC Brookes, CL Lauber, C Lozupone, JG Caporaso, R Knight Soil bacterial and fungal communities across a pH gradient in an arable soilThe ISME journal 4 (10), 1340-1351
  4. J. Caporaso et al. QIIME allows analysis of high-throughput community sequencing data. Nature Methods 7, 335-336 (2010)
  5. Huson, DH, Mitra, S, Weber, N, Ruscheweyh, H, and Schuster, SC (2011). Integrative analysis of environmental sequences using MEGAN4. Genome Research, 21:1552-1560
  6. Segata, N et al. Metagenomic microbial community profiling using unique clade-specific marker genes. Nature Methods 9, 811-814 (2012).
  7. Bates, S. et al. Global biogeography of highly diverse protisan communities in soil. ISME J 7: 652-659; Dec. 2012. http://korflab.ucdavis.edu/Unix_and_Perl/