You are here

Mining Time-constrained Sequential Patterns with Constraint Programming

TitleMining Time-constrained Sequential Patterns with Constraint Programming
Publication TypeJournal Article
Year of Publication2017
AuthorsAoga, John O. R., Guns Tias, and Schaus Pierre
JournalConstraints
Keywordsconstraint programming, Data mining, Gap constraint, Global Constraint, Sequential pattern mining, Span constraint, Time constraint
Abstract

Constraint Programming (CP) has proven to be an effective platform for constraint-based sequence mining. Previous work has focused on standard frequent sequence mining, as well as frequent sequence mining with a maximum ’gap’ between two matching events in a sequence. The main challenge in the latter is that this constraint can not be imposed independently of the omnipresent frequency constraint. Indeed, the gap constraint changes whether a subsequence is included in a sequence, and hence its frequency. In this work, we go beyond that and investigate the integration of timed events and constraining the minimum/maximum gap as well as minimum/maximum span. The latter constrains the allowed time between the first and last matching event of a pattern. We show how the three are interrelated, and what the required changes to the frequency constraint are. Key in our approach is the concept of an extension window defined by gap/span and we develop techniques to avoid scanning the sequences needlessly, as well as using a backtracking-aware data structure. Experiments demonstrate that the proposed approach outperforms both specialized and CP-based approaches in almost all cases and that the advantage increases as the minimum frequency threshold decreases. This paper is an extension of the original manuscript presented at CPAIOR’17 [1].


References

  1. Mining Time-constrained Sequential Patterns with Constraint Programming, Aoga, John O. R., Guns Tias, and Schaus Pierre , International Conference on Integration of AI and OR Techniques in Constraint Programming for Combinatorial Optimization Problems (CPAIOR17), (2017)
Full text: