A unified platform for higher-order interaction modeling in dynamic networks.
Understanding relations arising out of interactions among entities can be very difficult, and predicting them is even more challenging. This problem has many applications in various fields, such as financial networks and e-commerce. These relations can involve much more complexities than just involv- ing more than two entities. One such scenario is evolving recursive relations between multiple entities, and so far, this is still an open problem. This work addresses the problem of forecasting higher-order interaction events that can be multirelational and recursive. We pose the problem in the frame- work of representation learning of temporal hypergraphs that can capture complex relationships involving multiple entities. The proposed model, Relational Recursive Hyperedge Temporal Point Process (RRHyperTPP) uses an encoder that learns a dynamic node representation based on the historical interaction patterns and then a hyperedge link prediction- based decoder to model the occurrence of interaction events. These learned representations are then used for downstream tasks involving forecasting the type and time of interactions. The main challenge in learning from hyperedge events is that the number of possible hyperedges grows exponentially with the number of nodes in the network. This will make the computation of negative log-likelihood of the temporal point process expensive, as the calculation of survival function requires a summation over all possible hyperedges. In our work, we develop a noise contrastive estimation method to learn the parameters of our model, and we have experimentally shown that our models perform better than previous state-of-the-art methods for interaction forecasting.
We use a multi-relational recursive hyperedges structure to incorporate these complex higher-order interactions. Here, hyperedges can act as nodes in other hyperedges (like CCed or receiver groups in email interactions) and contain relation types indicating their role in the interactions. Then, a TPP is defined with these hyperedges as event types with a conditional intensity function parameterized using representations/embeddings of the nodes and re- lations involved in the interactions.
Furthermore, the previous approaches in TPP-based interaction forecasting use negative sampling to approximate the loss associated with the survival function, which is intractable to calculate due to the huge number of possible event types in the TPP. This can introduce biases in training, especially in these complex interactions. To address these challenges, we propose the model Relational Recursive Hyperedge Temporal Point Process (RRHyperTPP). The following are the contributions of this work:
Problem. Given E(t) = {(e1, t1), ..., (en, tn)} historical interactions until time t > tn, we aim to forecast the next interaction (tn+1, en+1), where tn+1 > t > tn. Here, we want to estimate the time tn+1 and the type of interaction en+1.
This work mainly focuses on higher-order interactions in communications networks and real-world events stored in source and target entities. The higher-order interaction in communications networks is represented using a recur- sive hyperedge graph of depth one h = {(hi0, ri0)}i=0k0. Here, {ri0}i=03 relations are used to represent the sender, recipients, and carbon-copied recipients’ addresses in case of email exchange and {ri0}i=04 represents the sender, re- ceiver, retweeter, and retweeted nodes in the Twitter network. The higher-order interactions in real-world events are repre- sented using depth two hyperedges h = {(h01, r01), (h11, r11)} and h0/11 = {(hi0, ri0)}i=03. Here, h01, h11 are the two entities involved in the interactions, r01 and r11 are the inverse relation pairs indicating the type of interactions, {ri0}i=03 are relations indicating the actor, sector, and country of the entity. Figure 1 shows an example of this type of higher-order interaction.
The RRHyperTPP model is a temporal point process (TPP) designed for hypergraphs. It defines event probabilities using a conditional intensity function and a survival function. The likelihood function is optimized using Noise Contrastive Estimation (NCE) instead of Maximum Likelihood Estimation (MLE).
The probability of a hyperedge event h occurring at time t is given by:
Pₕ(t) = λₕ(t) Sₕ(tₚₕ, t)
We evaluate our model on a real-world dataset of email interactions and Twitter retweet events. The dataset contains information about the sender, receiver, and carbon-copied recipients in the case of email interactions and sender, receiver, retweeter, and retweeted nodes in the case of Twitter retweets.
Our code is available on GitHub.