Special Topics in Math Biology
Of related interest: 1997 CMU School of Computer Science
Alchemy of Genomics
The genome contained within a human cell is very large and complex.
It holds all of the genetic information necessary for its creation and
function encoded with a total of six feet of DNA. The goals of the
Human Genome Initiative (HGI), as framed by the National Institutes of
Health and the Department of Energy, are to generate a complete map,
containing well-defined markers, and to sequence the entire human
genome within the next seven, or less years. The sequencing aspects
of this project will have to deal with approximately 3 billion base
pairs. A large number of genes (70,000-100,000) will be identified
and characterized in terms of biochemical, developmental, and clinical
criteria. Additionally, the development of approaches to globally,
and quantitatively, characterize message (RNA transcripts, which
direct synthesis of specific proteins) will also play a major role in
virtually every aspect of biological, pharmaceutical and clinical
The science of computational genomics and bio-informatics have been
created out of this massive sea of sequence data and the need to
establish functionality of genes largely based on similarities
discerned at the level of the DNA code; bypassing the need for
extensive biochemical characterization.
This emerging subfield relies on some classical and many novel
mathematical, statistical and algorithmic ideas that are essential to
accomplish this task. This course deals with mainly these mathematical
and computational approaches. The course is self contained, developing
the biological, statistical, probabilistic and algorithmic tools and
techniques along the way.
Professor B. Mishra
Introduction & History.
Some Molecular Biology: DNA, Transfer RNA and Protein
DDP (Double Digestion Problem): Complexity and Algorithms.
Cloning and Clone Libraries.
Physical Genome Maps (Oceans, Islands and Anchors):
Alignment of Two and Multiple Sequences.
Lander-Waterman Statistics and Applications to Sequence
RNA Secondary Structure.
Optical Mapping and Map-Based Sequence Assembly.
Office Hours: TBA
Office Phone: 212.998.3464
Email Address: firstname.lastname@example.org
- Day and Time:
Tuesdays, 5:00-6:50pm EST
Room 101, WWH, 251 Mercer Street.
- Credits for Course:
Mathematical Maturity, Combinatorics, Statistics and
- Required Text(s):
Statistical Genomics: Linkage, Mapping and QTL
By Ben Hui Liu, CRC Press, ISBN 0-8493-3166-8.
Introduction to Computational Molecular Biology.
By Setubal & Meidanis, PWS Publishing Company, ISBN 0-534-95262-3.
Introduction to Computational Biology: Maps, Sequences
By Michael Waterman, Chapman and Hall, ISBN 0-412-99391-0
Analysis of Human Genetic Linkage,
By Jurg Ott, The Johns Hopkins University Press, ISBN
- Recommended Text(s):
Principles of Genome Analysis.
By S.B. Primrose, Blackwell Science, ISBN 0-86542-946-4.
The Human Genome Project: Deciphering the Blueprint of
Edited by N.G. Cooper, University Science Books, ISBN
- Midterm Date:
- Final Date:
November 1 1998