[go: up one dir, main page]

Skip to main content
Log in

Using core-periphery structure to predict high centrality nodes in time-varying networks

  • Published:
Data Mining and Knowledge Discovery Aims and scope Submit manuscript

Abstract

Vertices with high betweenness and closeness centrality represent influential entities in a network. An important problem for time varying networks is to know a-priori, using minimal computation, whether the influential vertices of the current time step will retain their high centrality, in the future time steps, as the network evolves. In this paper, based on empirical evidences from several large real world time varying networks, we discover a certain class of networks where the highly central vertices are part of the innermost core of the network and this property is maintained over time. As a key contribution of this work, we propose novel heuristics to identify these networks in an optimal fashion and also develop a two-step algorithm for predicting high centrality vertices. Consequently, we show for the first time that for such networks, expensive shortest path computations in each time step as the network changes can be completely avoided; instead we can use time series models (e.g., ARIMA as used here) to predict the overlap between the high centrality vertices in the current time step to the ones in the future time steps. Moreover, once the new network is available in time, we can find the high centrality vertices in the top core simply based on their high degree. To measure the effectiveness of our framework, we perform prediction task on a large set of diverse time-varying networks. We obtain F1-scores as high as 0.81 and 0.72 in predicting the top m closeness and betweenness centrality vertices respectively for real networks where the highly central vertices mostly reside in the innermost core. For synthetic networks that conform to this property we achieve F1-scores of 0.94 and 0.92 for closeness and betweenness respectively. We validate our results by showing that the practical effects of our predicted vertices match the effects of the actual high centrality vertices. Finally, we also provide a formal sketch demonstrating why our method works.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Notes

  1. In this paper, when we mention highly central vertices, we specifically refer to high closeness or betweenness centrality and not other types of centralities.

  2. We tried with other stretches of size 15, 25 etc. The results do not seem to be affected by such minor variations. Ideally this size should not be too large thus consuming a lot of data for prediction, nor it should be too small thus having too few points to correctly predict. Through experimentation, we find that a size close to 20 strikes an ideal balance.

  3. Note that if we keep increasing the number of top vertices, the prediction results can only get better. Through experiments, we observe that small numbers like 5 and 10 are judicial choices.

References

  • Agneessens F, Borgatti SP, Everett MG (2017) Geodesic based centrality: unifying the local and the global. Soc Netw 49:12–26

    Article  Google Scholar 

  • Alvarez-Hamelin JI, Dall’Asta L, Barrat A, Vespignani A (2006) Large scale networks fingerprinting and visualization using the k-core decomposition. In: Advances in neural information processing systems (NIPS), pp 41–50

  • Baeza-Yates R, Ribeiro-Neto B et al (1999) Modern information retrieval, vol 463. ACM Press, New York

    Google Scholar 

  • Barucca P, Tantari D, Lillo F (2016) Centrality metrics and localization in core-periphery networks. J Stat Mech Theory Exp 2016(2):023401

    Article  MathSciNet  Google Scholar 

  • Batagelj V, Zaversnik M (2003) An o(m) algorithm for cores decomposition of networks. arXiv preprint arXiv:cs.DS/0310049

  • Benyahia O, Largeron C, Jeudy B, Zaïane OR (2016) Dancer: dynamic attributed network with community structure generator. In: Joint European conference on machine learning and knowledge discovery in databases. Springer, pp 41–44

  • Borgatti SP, Carley KM, Krackhardt D (2006) On the robustness of centrality measures under conditions of imperfect data. Soc Netw 28(2):124–136

    Article  Google Scholar 

  • Box GE, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: forecasting and control. Wiley, Hoboken

    MATH  Google Scholar 

  • Braha D, Bar-Yam Y (2006) From centrality to temporary fame: dynamic centrality in complex networks. Complexity 12(2):59–63

    Article  Google Scholar 

  • Braha D, Bar-Yam Y (2009) Time-dependent complex networks: dynamic centrality, dynamic motifs, and cycles of social interactions. In: Gross T, Sayama H (eds) Adaptive networks: theory, models and applications. Springer studies on complexity. Springer, Berlin, pp 39–50

    Chapter  Google Scholar 

  • Carnes T, Nagarajan C, Wild SM, Van Zuylen A (2007) Maximizing influence in a competitive social network: a follower’s perspective. In: Proceedings of the 9th international conference on electronic commerce. ACM, pp 351–360

  • Freeman LC (1978) Centrality in social networks conceptual clarification. Soc Netw 1(3):215–239

    Article  Google Scholar 

  • Govindan P, Soundarajan S, Eliassi-Rad T, Faloutsos C (2016) Nimblecore: a space-efficient external memory algorithm for estimating core numbers. In: IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM), 2016. IEEE, pp 207–214

  • Gutfraind A, Safro I, Meyers LA (2015) Multiscale network generation. In: 18th international conference on information fusion (FUSION), 2015. IEEE, pp 158–165

  • Hamilton JD (1989) A new approach to the economic analysis of nonstationary time series and the business cycle. Econometrica 57:357–384

    Article  MathSciNet  MATH  Google Scholar 

  • Hempel S, Koseska A, Kurths J, Nikoloski Z (2011) Inner composition alignment for inferring directed networks from short time series. Phys Rev Lett 107(5):054101

    Article  Google Scholar 

  • Hill SA, Braha D (2010) Dynamic model of time-dependent complex networks. Phys Rev E 82(4):046105

    Article  Google Scholar 

  • Holme P (2015) Modern temporal network theory: a colloquium. Euro Phys J B 88(9):1–30

    Article  Google Scholar 

  • Holme P, Saramäki J (2012) Temporal networks. Phys Rep 519(3):97–125

    Article  Google Scholar 

  • Khaouid W, Barsky M, Srinivasan V, Thomo A (2015) K-core decomposition of large networks on a single PC. Proc VLDB Endow 9(1):13–23

    Article  Google Scholar 

  • Kim H, Anderson R (2012) Temporal node centrality in complex networks. Phys Rev E 85(2):026107

    Article  Google Scholar 

  • Kim H, Tang J, Anderson R, Mascolo C (2012) Centrality prediction in dynamic human contact networks. Comput Netw 56(3):983–996

    Article  Google Scholar 

  • Kitsak M, Gallos LK, Havlin S, Liljeros F, Muchnik L, Stanley HE, Makse HA (2010) Identification of influential spreaders in complex networks. Nat Phys 6(11):888–893

    Article  Google Scholar 

  • Kunegis J (2013) Konect: the Koblenz network collection. In: Proceedings of the 22nd international conference on world wide web (WWW). ACM, pp 1343–1350

  • Lerman K, Ghosh R, Kang JH (2010) Centrality metric for dynamic networks. In: Proceedings of the 8th workshop on mining and learning with graphs. ACM, pp 70–77

  • Leskovec J, Krevl A (2014) SNAP datasets: Stanford large network dataset collection. http://snap.stanford.edu/data. Accessed Jan 2016

  • Liu HL, Ma C, Xiang BB, Tang M, Zhang HF (2018) Identifying multiple influential spreaders based on generalized closeness centrality. Phys A 492:2237–2248

    Article  Google Scholar 

  • Morselli C, Masias VH, Crespo F, Laengle S (2013) Predicting sentencing outcomes with centrality measures. Secur Inf 2(1):4

    Article  Google Scholar 

  • Nicosia V, Tang J, Mascolo C, Musolesi M, Russo G, Latora V (2013) Graph metrics for temporal networks. In: Holme P, Saramäki J (eds) Temporal networks, understanding complex systems. Springer, Berlin, pp 15–40

    Google Scholar 

  • OBrien MP, Sullivan BD (2014) Locally estimating core numbers. In: IEEE international conference on data mining (ICDM), 2014. IEEE, pp 460–469

  • Peng C, Kolda TG, Pinar A (2014) Accelerating community detection by using k-core subgraphs. arXiv preprint arXiv:1403.2226

  • Rozenshtein P, Gionis A (2016) Temporal pagerank. In: Joint European conference on machine learning and knowledge discovery in databases. Springer, pp 674–689

  • Scherrer A, Borgnat P, Fleury E, Guillaume JL, Robardet C (2008) Description and simulation of dynamic mobility networks. Comput Netw 52(15):2842–2858

    Article  MATH  Google Scholar 

  • Seidman SB (1983) Network structure and minimum degree. Soc Netw 5(3):269–287

    Article  MathSciNet  Google Scholar 

  • Sikdar S, Ganguly N, Mukherjee A (2016) Time series analysis of temporal networks. Euro Phys J B 89(1):1–11

    Article  Google Scholar 

  • Taylor D, Myers SA, Clauset A, Porter MA, Mucha PJ (2015) Eigenvector-based centrality measures for temporal networks. arXiv preprint arXiv:1507.01266

  • Ufimtsev V, Sarkar S, Mukherjee A, Bhowmick S (2016) Understanding stability of noisy networks through centrality measures and local connections. In: Proceedings of the 25th ACM international on conference on information and knowledge management (CIKM). ACM, pp 2347–2352

  • Yang Y, Dong Y, Chawla NV (2014) Predicting node degree centrality with the node prominence profile. Sci Rep 4:7236

    Article  Google Scholar 

  • Zhou H, Xu S, Huang C (2015) Temporal centrality prediction in opportunistic mobile social networks. In: International conference on internet of vehicles. Springer, pp 68–77

  • Zhou H, Leung VC, Zhu C, Xu S, Fan J (2017) Predicting temporal social contact patterns for data forwarding in opportunistic mobile networks. IEEE Trans Veh Technol 66(11):10372–10383

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Soumya Sarkar.

Additional information

Responsible editors: Jesse Davis, Elisa Fromont, Derek Greene, Björn Bringmann.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sarkar, S., Sikdar, S., Bhowmick, S. et al. Using core-periphery structure to predict high centrality nodes in time-varying networks. Data Min Knowl Disc 32, 1368–1396 (2018). https://doi.org/10.1007/s10618-018-0574-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10618-018-0574-x

Keywords

Navigation