Skip to content

Latest commit

 

History

History
53 lines (37 loc) · 2.89 KB

command-line-workflows.md

File metadata and controls

53 lines (37 loc) · 2.89 KB

The command line for bioinformatics workflows

In this lesson we'll go through an introduction to what sequencing is, learn about the genomic data file types and how to start processing genomic data using bioinformatics tools at the command line.

Learning objectives

  • Knowledge of cloud computing resources and why it's important in bioinformatics
  • Understand how to connect to cloud computing resources via ssh
  • Review navigating at the command line
  • Understand the genomic data file format, FASTQ
  • How to use command line functions to look at and get information about FASTQ files
  • Understand a general bioinformatics workflow
  • Understand how to use command line bioinformatics tools to process genomic data, using FastQC as an example
  • Know where to find more information on working with genomic data and specific analyses

Lesson Outline

I. Using cloud computing for fun and genomic data

Goals: Discuss needs for cloud computing and what are some options. Connect to Amazon AWS resources using ssh. Provide documentation on how to get access to cloud computing resources.

Materials: Introduction to cloud computing

Additional resources:

II. The genomic data file type and the command line

Goals: Work with the FASTQ file type at the command line.

Materials:

III. Running bioinformatics programs

Goals: How to run bioinformatics programs at the command line, using FastQC as an example.

Materials: Bioinformatics programs - quality control

Other Resources

Lessons

Articles