CSE5290: Algorithms for Bioinformatics

Course Objectives

News:

Administrivia

References:
  1. Gusfield D (1997) Algorithms on Strings, Trees and Sequences
  2. A. Tozeren, S. Byers (2004): New Biology for Engineers and Computer Scientists. Pearson Prentice Hall.
Evaluation: Your marks are here
Project: Possible project topics:

List of topics and Lecture schedule (tentative):

  1. Introduction:
  2. Lecture 1 (Sep 08): Administrivia: Objectives, course overview. My slides are here.
    Please consider participating in the York Programming contests. Details are here.
    Some notes on installing and using R are here.
  3. Lecture 2 (Sep 15). Finish introduction, Genome assembly. My slides are here.
  4. Lecture 3 (Sep 28): Exhaustive search algorithms. Same slides as the last lecture. Fourier methods for genome analysis.
  5. Lecture 4 (Oct 4): Greedy algorithms (Ch 5). Intro to dynamic programming (Ch 6). My slides are here.
  6. Lecture 5 (Oct 18): Sequence alignment using dynamic programming (Ch 6). My slides are here.
  7. Lecture 6 (Oct 25): Sequence assembly using graph algorithms (Ch 7). My slides are here.
  8. Lecture 7 (Nov 2): midterm. Suffix trees and repeat finding. My slides are here.
  9. Lecture 8 (Nov 9): Clustering, phyllogenetic trees. My slides are here. A r-file that demonstrates some problems with the k-means algorithm is here.
  10. Lecture 9 (Nov 16): phyllogenetic trees contd. Markov chains and hidden markov models. My slides are here.
  11. Lecture 10 (Nov 23): Hidden markov models - contd. Randomized algorithms. My slides are here.
  12. Lecture 11 (Nov 30): Extra topics. My slides are here.
Assignments: R resources Other links

While it is not required for the course, this is a good opportunity to start using LaTeX. A good starting point is this page.