Abstract
The Spliced Alignment Problem is a well-known problem in Bioinformatics with application to the gene prediction task. This problem consists in finding an ordered subset of non-overlapping substrings of a subject sequence g that best fits a target sequence t. In this work we present an approximation algorithm for a variant of the Spliced Alignment Problem, called Multiple Spliced Alignment Problem, that involves more than one target sequence. Under a metric, this algorithm is proved to be a 3-approximation for the problem and its good practical results compare to those obtained by four heuristics already developed for the Multiple Spliced Alignment Problem.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Burge, C., Karlin, S.: Prediction of Complete Gene Structures in Human Genomic DNA. Journal of Molecular Biology 268(1), 78–94 (1997)
Burset, M., Guigo, R.: Evaluation of Gene Structure Prediction Programs. Genomics 34(298), 353–367 (1996)
Gelfand, M.S., Mironov, A.A., Pevzner, P.A.: Gene Recognition Via Spliced Sequence Alignment. Proceedings of the National Academy of Sciences of the United States of America 93, 9061–9066 (1996)
Hogeweg, P.: The Roots of Bioinformatics in Theoretical Biology. PLoS Computational Biology 7(3), 1–5 (2011)
Kishi, R.M., dos Santos, R.F., Adi, S.S.: Gene Prediction by Multiple Spliced Alignment. In: Norberto de Souza, O., Telles, G.P., Palakal, M. (eds.) BSB 2011. LNCS, vol. 6832, pp. 26–33. Springer, Heidelberg (2011)
Kishi, R.M., dos Santos, R.F., Montera, L., Adi, S.S.: A Similarity-based Genetic Algorithm for the Gene Prediction Problem. In: BSB & EBB Digital Proceedings, Campo Grande, pp. 84–89 (2012)
Majoros, W.H.: Methods for Computational Gene Prediction, 1st edn. Cambridge University Press (2007)
Mathé, C., Sagot, M.-F., Schiex, T., Rouzé, P.: Current Methods of Gene Prediction, Their Strengths and Weaknesses. Nucleic Acids Research 30(19), 4103–4117 (2002)
Needleman, S.B., Wunsch, C.D.: A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins. Journal of Molecular Biology 48, 443–453 (1970)
The ENCODE Project Consortium: The ENCODE (Encyclopedia of DNA Elements) Project. Science 306(5696), 636–640 (2004)
Sayers, E.W., Barrett, T., Benson, D.A., Bolton, E., Bryant, S.H., Canese, K., Chetvernin, V., Church, D.M., DiCuccio, M., Federhen, S., Feolo, M., Fingerman, I.M., Geer, L.Y., Helmberg, W., Kapustin, Y., Krasnov, S., Landsman, D., Lipman, D.J., Lu, Z., Madden, T.L., Madej, T., Maglott, D.R., Marchler-Bauer, A., Miller, V., Karsch-Mizrachi, I., Ostell, J., Panchenko, A., Phan, L., Pruitt, K.D., Schuler, G.D., Sequeira, E., Sherry, S.T., Shumway, M., Sirotkin, K., Slotta, D., Souvorov, A., Starchenko, G., Tatusova, T.A., Wagner, L., Wang, Y., Wilbur, W.J., Yaschenko, E., Ye, J.: Database resources of the National Center for Biotechnology Information. Nucleic Acids Research 40 (D1), D13–D25 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Mazaro, R.B., de Lima, L.I.S., Adi, S.S. (2014). A 3-Approximation Algorithm for the Multiple Spliced Alignment Problem and Its Application to the Gene Prediction Task. In: Moura, E., Crochemore, M. (eds) String Processing and Information Retrieval. SPIRE 2014. Lecture Notes in Computer Science, vol 8799. Springer, Cham. https://doi.org/10.1007/978-3-319-11918-2_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-11918-2_13
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11917-5
Online ISBN: 978-3-319-11918-2
eBook Packages: Computer ScienceComputer Science (R0)