TY - GEN
T1 - Finding all minimal infrequent multi-dimensional intervals
AU - Elbassioni, Khaled M.
PY - 2006
Y1 - 2006
N2 - Let D be a database of transactions on n attributes, where each attribute specifies a (possibly empty) real closed interval I = [a,b] ⊆ ℝ. Given an integer threshold t, a multi-dimensional interval I = ([a1, b1], . . . ,[an, bn]) is called t-frequent, if (every component interval of) I is contained in (the corresponding component of) at least t transactions of D and otherwise, I is said to be t-infrequent. We consider the problem of generating all minimal t-infrequent multi-dimensional intervals, for a given database D and threshold t. This problem may arise, for instance, in the generation of association rules for a database of time-dependent transactions. We show that this problem can be solved in quasi-polynomial time. This is established by developing a quasi-polynomial time algorithm for generating maximal independent elements for a set of vectors in the product of lattices of intervals, a result which may be of independent interest. In contrast, the generation problem for maximal frequent intervals turns out to be NP-hard.
AB - Let D be a database of transactions on n attributes, where each attribute specifies a (possibly empty) real closed interval I = [a,b] ⊆ ℝ. Given an integer threshold t, a multi-dimensional interval I = ([a1, b1], . . . ,[an, bn]) is called t-frequent, if (every component interval of) I is contained in (the corresponding component of) at least t transactions of D and otherwise, I is said to be t-infrequent. We consider the problem of generating all minimal t-infrequent multi-dimensional intervals, for a given database D and threshold t. This problem may arise, for instance, in the generation of association rules for a database of time-dependent transactions. We show that this problem can be solved in quasi-polynomial time. This is established by developing a quasi-polynomial time algorithm for generating maximal independent elements for a set of vectors in the product of lattices of intervals, a result which may be of independent interest. In contrast, the generation problem for maximal frequent intervals turns out to be NP-hard.
UR - http://www.scopus.com/inward/record.url?scp=33745630312&partnerID=8YFLogxK
U2 - 10.1007/11682462_40
DO - 10.1007/11682462_40
M3 - Conference contribution
AN - SCOPUS:33745630312
SN - 354032755X
SN - 9783540327554
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 423
EP - 434
BT - LATIN 2006
T2 - LATIN 2006: Theoretical Informatics - 7th Latin American Symposium
Y2 - 20 March 2006 through 24 March 2006
ER -