Abstract
In this paper, we present a new model for RNA multiple sequence structural alignment based on the longest common subsequence. We consider both the off-line and on-line cases. For the off-line case, i.e., when the longest common subsequence is given as a linear graph with n vertices, we first present a polynomial O(n 2) time algorithm to compute its maximum nested loop. We then consider a slightly different problem – the Maximum Loop Chain problem and present a factor-2 approximation which runs in O(n 2.5) time. For the on-line case, i.e., given m RNA sequences of lengths n, compute the longest common subsequence of them such that this subsequence either induces a maximum nested loop or the maximum number of matches, we present efficient algorithms using dynamic programming when m is small.
This research is partially supported by EPSCOR Visiting Scholar’s Program and MSU Short-term Professional Development Program.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Cormen, T., Leiserson, C., Rivest, R., Stein, C.: Introduction to Algorithms, 2nd edn. MIT Press, Cambridge (2001)
Dayhoff, M.: Computer aids to protein sequence determination. J. Theoret. Biology 8(1), 97–112 (1965)
Dayhoff, M.: Computer analysis of protein evolution. Scientific American 221(1), 86–95 (1969)
Davydov, E., Batzoglu, S.: A computational model for RNA multiple structural alignment. In: Sahinalp, S.C., Muthukrishnan, S.M., Dogrusoz, U. (eds.) CPM 2004. LNCS, vol. 3109, pp. 254–269. Springer, Heidelberg (2004)
Deng, X., Li, G., Li, Z., Ma, B., Wang, L.: A PTAS for distinguishing (sub)string selection. In: Widmayer, P., Triguero, F., Morales, R., Hennessy, M., Eidenbenz, S., Conejo, R. (eds.) ICALP 2002. LNCS, vol. 2380, pp. 740–751. Springer, Heidelberg (2002)
Eddy, S.R.: Computational genomics of noncoding RNA genes. Cell 109, 137–140 (2002)
Goldman, D., Istrail, S., Papadimitriou, C.: Algorithmic aspects of protein structure similarity. In: Proc. 40th Ann. Symp. Foundations of Computer Science (FOCS 1999), pp. 512–522 (1999)
Gabow, H.: An efficient implementation of Edmond’s algorithm for maximum matching on graphs. J. ACM 23(2), 221–234 (1976)
Greenberg, R.I.: Bounds on the Number of the Longest Common Subsequence Problem. CoRR cs.DM/0301030 (2003)
Hirschberg, D.: The longest common subsequence problem. PhD Thesis, Princeton University (1975)
Hsu, W.J., Du, M.W.: Computing a longest common subsequence for a set of strings. BIT 24, 45–59 (1984)
Jiang, T., Li, M.: On the approximation of shortest common supersequences and longest common subsequences. SIAM J. Comput. 24(5), 1122–1139 (1995)
Lanctot, K., Li, M., Ma, B., Wang, S., Zhang, L.: Distinguishing string selection problems. In: Proc. 6th Ann. ACM-SIAM Symp. on Discrete Algorithms, pp. 633–642 (1999)
Li, M., Ma, B., Wang, L.: Finding similar regions in many strings. In: Proc. 31st ACM Symp. on Theory of Computing (STOC 1999), pp. 473–482 (1999)
Micali, S., Vazirani, V.: An O(_|V ||E|2) algorithm for finding maximum matching in general graphs. In: Proc. 21st Ann. Symp. Foundations of Computer Science (FOCS 1980), pp. 17–27 (1980)
Nussinov, R., Pieczenik, G., Griggs, J., Kleitman, D.: Algorithms for loop matching. SIAM J. Applied Math. 35, 68–82 (1978)
Rick, C.: Efficient Computation of All Longest Common Subsequences. In: Halldórsson, M.M. (ed.) SWAT 2000. LNCS, vol. 1851, pp. 407–418. Springer, Heidelberg (2000)
Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Molecular Biology 147, 195–197 (1981)
Zucker, M., Stiegler, P.: Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Research 9, 133–148 (1981)
Zucker, M.: Computer prediction of RNA structure. Methods in Enzymology 180, 262–288 (1989)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bereg, S., Zhu, B. (2005). RNA Multiple Structural Alignment with Longest Common Subsequences. In: Wang, L. (eds) Computing and Combinatorics. COCOON 2005. Lecture Notes in Computer Science, vol 3595. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11533719_6
Download citation
DOI: https://doi.org/10.1007/11533719_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28061-3
Online ISBN: 978-3-540-31806-4
eBook Packages: Computer ScienceComputer Science (R0)