CINECA IRIS Institutional Research Information System

We introduce a new notion of motifs, called masks, that succinctly represents the repeated patterns for an input sequence T of n symbols drawn from an alphabet Σ. We show how to build the set of all frequent maximal masks of length L in O (2 L n) time and space in the worst case, using the Karp-Miller-Rosenberg approach. We analytically show that our algorithm performs better than the method based on constant-time enumerating and checking all the potential (| Σ | + 1) L candidate patterns in T, after a polynomial-time preprocessing of T. Our algorithm is also cache-friendly, attaining O (2 L s o r t (n)) block transfers, where s o r t (n) is the cache complexity of sorting n items.

Masking Patterns in Sequences: A New Class of Motif Discovery with Don't Cares

G. BATTAGLIA;GROSSI, ROBERTO;D. CANGELOSI;PISANTI, NADIA

2009-01-01

Abstract

We introduce a new notion of motifs, called masks, that succinctly represents the repeated patterns for an input sequence T of n symbols drawn from an alphabet Σ. We show how to build the set of all frequent maximal masks of length L in O (2 L n) time and space in the worst case, using the Karp-Miller-Rosenberg approach. We analytically show that our algorithm performs better than the method based on constant-time enumerating and checking all the potential (| Σ | + 1) L candidate patterns in T, after a polynomial-time preprocessing of T. Our algorithm is also cache-friendly, attaining O (2 L s o r t (n)) block transfers, where s o r t (n) is the cache complexity of sorting n items.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2009
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.tcs.2009.07.014
			
	Tutti gli autori
	
						G., Battaglia; Grossi, Roberto; D., Cangelosi; Pisanti, Nadia
					
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11568/131273

Citazioni

ND

7

5

social impact