ACM International Conference Proceeding Series, Pages 7-11 , 27/12/2017

Using suffix tray and longest previous factor for pattern searching

Jongsuk Kongsen, Supaporn Chairungsee

Abstract

Finding patterns in genomic sequences needs an effective searching algorithm. Although there are different types of string repetition such as tandem repeat, palindrome, or clumps, we are interested in searching for the pattern occurrences in a sequence which can be overlapped. We propose a new algorithm constructed with the suffix tray data structure to ensure the linear time running. We present the algorithm to compute the Longest Previous Factor of a string from its suffix tray. We also apply the notion of suffix links and the attribute m. The proposed algorithm to compute the Longest Previous Factor table of the text, t, of length n requires O(n) time.

Document Type

Conference Paper

Source Type

Conference Proceeding

ISBN

[9781450363518]

ISSN

Keywords

Clustered-clumpsLongest previous factorSuffix linkSuffix tray

ASJC Subject Area

Computer Science : SoftwareComputer Science : Computer Networks and CommunicationsComputer Science : Human-Computer InteractionComputer Science : Computer Vision and Pattern Recognition


Bibliography


Kongsen, J., & Chairungsee, S. (2017). Using suffix tray and longest previous factor for pattern searching. ACM International Conference Proceeding Series7-11. doi:10.1145/3176653.3176662

Copy | Save