Skip to main content

Analysis of Network Flow Data

  • Chapter
  • First Online:

Part of the book series: Use R! ((USE R,volume 65))

Abstract

Many networks serve as conduits—either literally or figuratively—for flows , in the sense that they facilitate the movement of something, such as materials, people, or information. For example, transportation networks (e.g., of highways, railways, and airlines) support flows of commodities and people, communication networks allow for the flow of data, and networks of trade relations among nations reflect the flow of capital.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Data were originally collected by Manfred Fischer and Petra Staufer.

  2. 2.

    More precisely, we can write (9.7) in the form \( \log (\mu ) = \mathbf{M}\gamma \), where M is an (IJ) × (I + J + K) matrix, and \( \gamma = {(\alpha _{1},\ldots,\alpha _{I},\beta _{1},\ldots,\beta _{J},\theta _{1},\ldots,\theta _{K})}^{T} \) is an (I + J + K) × 1 vector. The first I + J columns of M are binary vectors, indicating the appropriate origin and destination for each entry of μ, and are redundant in that both the first I and the next J sum to the unit vector. The last K columns correspond to the K variables defining the c ij . Assuming that the latter are linearly independent of themselves and of the former, the rank of M will be (I + J − 1) + K. See Sen and Smith [130, Chap. 5.2].

  3. 3.

    The AIC statistic for a likelihood-based model, with k-dimensional parameter η, is defined as \( AIC = -2\ell(\hat{\eta }) + 2k \), where \( \ell(\eta ) \) is the log-likelihood evaluated at η, and \( \hat{\eta } \) is the maximum likelihood estimate of η. This statistic, as with others of its type, provides an estimate of the generalization error associated with the fitted model, in this case effectively by off-setting the assessment of how well the model fits the data by a measure of its complexity. See, for example, Hastie, Tibshirani, and Friedman [71, Chap. 7.5] for additional details.

  4. 4.

    If multiple routes are possible, the entries of B are instead fractions representing, for example, the proportion of traffic from i to j that is expected to use the link e.

References

  1. J. Cao, D. Davis, S. Wiel, B. Yu, Time-varying network tomography: router link data. J. Am. Stat. Assoc. 95(452), 1063–1075 (2000)

    Article  MATH  Google Scholar 

  2. H. Carey, Principles of Social Science (Lippincott, Philadelphia, 1858)

    Google Scholar 

  3. R. Castro, M. Coates, G. Liang, R. Nowak, B. Yu, Network tomography: recent developments. Stat. Sci. 19(3), 499–517 (2004)

    Article  MATH  MathSciNet  Google Scholar 

  4. W. Deming, F. Stephan, On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. Ann. Math. Stat. 11(4), 427–444 (1940)

    Article  MathSciNet  Google Scholar 

  5. M. Fischer, S. Gopal, Artificial neural networks: a new approach to modeling interregional telecommunication flows. J. Reg. Sci. 34(4), 503–527 (1994)

    Article  Google Scholar 

  6. T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning (Springer, New York, 2001)

    Book  MATH  Google Scholar 

  7. P. McCullagh, J. Nelder, Generalized Linear Models (Chapman & Hall/CRC, London, 1989)

    Book  MATH  Google Scholar 

  8. A. Sen, T. Smith, Gravity Models of Spatial Interaction Behavior (Springer, Berlin, 1995)

    Book  Google Scholar 

  9. J. Stewart, An inverse distance variation for certain social influences. Science 93(2404), 89–90 (1941)

    Article  Google Scholar 

  10. Y. Vardi, Network tomography: estimating source-destination traffic intensities from link data. J. Am. Stat. Assoc. 91(433), 365–377 (1996)

    Article  MATH  MathSciNet  Google Scholar 

  11. Y. Zhang, M. Roughan, C. Lund, D. Donoho, An information-theoretic approach to traffic matrix estimation. In Proceedings of SIGCOMM’03, 2003

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media New York

About this chapter

Cite this chapter

Kolaczyk, E.D., Csárdi, G. (2014). Analysis of Network Flow Data. In: Statistical Analysis of Network Data with R. Use R!, vol 65. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-0983-4_9

Download citation

Publish with us

Policies and ethics