Abstract
This chapter is the longest in the book as it deals with both general principles and practical aspects of sequence and, to a lesser degree, structure analysis. Although these methods are not, in themselves, part of genomics, no reasonable genome analysis and annotation would be possible without understanding how these methods work and some practical experience with their use. Inappropriate use of sequence analysis procedures may result in numerous errors in genome annotation (we have already touched upon this subject in the previous chapter and further discuss it in Chapter 5). We attempted to strike a balance between generalities and specifics, aiming to give the reader a clear perspective of the computational approaches used in comparative and functional genomics, rather than discuss any one of these approaches in great detail. In particular, we refrained from any extensive discussion of the statistical basis and algorithmic aspects of sequence analysis because these can be found in several recent books on computational biology and bioinformatics (see ♦4.8) and, no less importantly, because we cannot claim advanced expertise in this area. We also tried not to duplicate the “click here”-type tutorials, which are available on many web sites. However, we deemed it important to point out some difficult and confusing issues in sequence analysis and warn the readers against the most common pitfalls.
We aligned sequences by eye. Troy CS, MacHugh DE, Bailey JF, Magee DA, Loftus RT, Cunningham P, Chamberlain AT, Sykes BC, Bradley DG. Genetic evidence for Near-Eastern origins of European cattle. Nature, 2001, vol. 410, p. 1091
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Further Reading
Doolittle RF. 1986. Of Urfs and Orfs: A primer on how to analyze derived amino acid sequences. University Science Books, San Diego.
Baxevanis AD and Ouellette BFF (eds). 2001. Bioinformatics: a practical guide to the analysis of genes and proteins. John Wiley & Sons, New York.
Mount DW. 2000. Bioinformatics: Sequence and genome analysis. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Chapter 1.
Durbin R, Eddy SR, Krogh A and Mitchison G. 1997. Biological Sequence Analysis: Probabilistic models of proteins and nucleic acids. Cambridge University Press, Cambridge, UK.
Waterman MS. 1995. Introduction to Computational BiologyMaps, Sequences and Genomes. CRC Press, Boca Raton, FL.
Gusfield D. 1997. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge University Press, Cambridge, UK.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Koonin, E.V., Galperin, M.Y. (2003). Principles and Methods of Sequence Analysis. In: Sequence — Evolution — Function. Springer, Boston, MA. https://doi.org/10.1007/978-1-4757-3783-7_5
Download citation
DOI: https://doi.org/10.1007/978-1-4757-3783-7_5
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4419-5321-6
Online ISBN: 978-1-4757-3783-7
eBook Packages: Springer Book Archive