Saint Louis University |
Computer Science 1020
|
Computer Science Department |
For this assignment, you must work alone. Please make sure you adhere to the policies on academic integrity in this regard.
Topic: Sequence Alignment
Related Reading: class notes
Due:
2:10pm, Monday, 26 March 2018
This homework mirrors an earlier in-class exploration of an algorithm for computing the longest common subsequence, and other more advanced forms of sequence alignment. Specifically, we wish for you to compute alignments that optimize two metrics:
Longest-common Subsequence
A scored alignment in which the metric includes
For a detailed explanation of the algorithms for computing these measures, and for a detailed walkthrough of an example alignment, see this attached description.
Your task is to anaylze a new pair of strings and to compute both an optimal LCS alignment and optimal scored alignment for those strings (akin to what is given in the above detailed walkthrough). Specifically, for each of the two measures, you must do provide the following:
Identify the maximum value of an alignment of the strings.
Portray an alignment that achieves that optimal value. (That is, place one string immediately below the other, with gaps inserted to achieve the desired alignment.)
Provide the entire table of scores that was used to compute the above results.
Your input strings: We have created a custom pair of strings for each student. To download YOUR individual data set, you must use your official SLUnet id (e.g. goldwamh), which is not to be confused with your SLU email address nor your Banner ID. The basic form of the URL to download your data set is http://cs.slu.edu/~goldwasser/1020/homeworks/align/input/username.txt except with your SLUnet username in place of username, using entirely lowercase letters. If you wish, enter your username into the form below (lowercased), and we'll take you directly to your data set.
This assignment is worth 40 points, which will be assessed with 20 points for LCS computations and 20 for the scored alignment, broken down as
For correctly computing the value of the best alignment
For correctly portraying an actual alignment achieving the optimal score
For providing the full table that you used to compute the results.