Computer Science 1020
Introduction to Computer Science: Bioinformatics

Homework Assignment 05

Sequence Alignment

Collaboration Policy

For this assignment, you must work alone. Please make sure you adhere to the policies on academic integrity in this regard.

Overview

Topic: Sequence Alignment
Related Reading: class notes
Due: 2:10pm, Monday, 26 March 2018

This homework mirrors an earlier in-class exploration of an algorithm for computing the longest common subsequence, and other more advanced forms of sequence alignment. Specifically, we wish for you to compute alignments that optimize two metrics:

Longest-common Subsequence
A scored alignment in which the metric includes
- +1 for a matched pair that are aligned
- -1 for a mismatched pair that are aligned
- -1 for a character matched with an inserted gap in the opposite sequence

Detailed Walkthrough

For a detailed explanation of the algorithms for computing these measures, and for a detailed walkthrough of an example alignment, see this attached description.

Your Task

Your task is to anaylze a new pair of strings and to compute both an optimal LCS alignment and optimal scored alignment for those strings (akin to what is given in the above detailed walkthrough). Specifically, for each of the two measures, you must do provide the following:

Identify the maximum value of an alignment of the strings.
Portray an alignment that achieves that optimal value. (That is, place one string immediately below the other, with gaps inserted to achieve the desired alignment.)
Provide the entire table of scores that was used to compute the above results.

For your convenience, here is a blank form that you can use for your solutions (although you are not required to use this particular form).

Your input strings: We have created a custom pair of strings for each student. To download YOUR individual data set, you must use your official SLUnet id (e.g. goldwamh), which is not to be confused with your SLU email address nor your Banner ID. The basic form of the URL to download your data set is http://cs.slu.edu/~goldwasser/1020/homeworks/align/input/username.txt except with your SLUnet username in place of username, using entirely lowercase letters. If you wish, enter your username into the form below (lowercased), and we'll take you directly to your data set.

Grading Standards

This assignment is worth 40 points, which will be assessed with 20 points for LCS computations and 20 for the scored alignment, broken down as

(6 points)
For correctly computing the value of the best alignment
(6 points)
For correctly portraying an actual alignment achieving the optimal score
(8 points)
For providing the full table that you used to compute the results.

Michael Goldwasser

CSCI 1020, Spring 2018
Last modified: Sunday, 18 March 2018

Saint Louis University

Computer Science 1020
Introduction to Computer Science: Bioinformatics

Michael Goldwasser

Spring 2018

Computer Science Department

Homework Assignment 05

Sequence Alignment

Contents:

Collaboration Policy

Overview

Detailed Walkthrough

Your Task

Grading Standards

Saint Louis University

Computer Science 1020 Introduction to Computer Science: Bioinformatics

Michael Goldwasser

Spring 2018

Computer Science Department

Homework Assignment 05

Sequence Alignment

Contents:

Collaboration Policy

Overview

Detailed Walkthrough

Your Task

Grading Standards

Computer Science 1020
Introduction to Computer Science: Bioinformatics