Abstract
We study a family of pattern-detection problems in vertex-colored temporal graphs. In particular, given a vertex-colored temporal graph and a multiset of colors as a query, we search for temporal paths in the graph that contain the colors specified in the query. These types of problems have several applications, for example, in recommending tours for tourists or detecting abnormal behavior in a network of financial transactions. For the family of pattern-detection problems we consider, we establish complexity results and design an algebraic-algorithmic framework based on constrained multilinear sieving. We demonstrate that our solution scales to massive graphs with up to a billion edges for a multiset query with 5 colors and up to 100 million edges for a multiset query with 10 colors, despite the problems being non-deterministic polynomial time-hard. Our implementation, which is publicly available, exhibits practical edge-linear scalability and is highly optimized. For instance, in a real-world graph dataset with >6 million edges and a multiset query with 10 colors, we can extract an optimal solution in <8 minutes on a Haswell desktop with four cores.
Original language | English |
---|---|
Pages (from-to) | 335-362 |
Number of pages | 28 |
Journal | Big Data |
Volume | 8 |
Issue number | 5 |
DOIs | |
Publication status | Published - 1 Oct 2020 |
MoE publication type | A1 Journal article-refereed |
Keywords
- algebraic algorithms
- constrained multilinear sieving
- pattern detection
- temporal paths
- temporal patterns