Analysis of Network Flow Data

Kolaczyk, Eric D.; Csárdi, Gábor

doi:10.1007/978-1-4939-0983-4_9

Analysis of Network Flow Data

Eric D. Kolaczyk⁶ &
Gábor Csárdi⁷

Chapter
First Online: 01 January 2014

18k Accesses
1 Citations

Part of the book series: Use R! ((USE R,volume 65))

Abstract

Many networks serve as conduits—either literally or figuratively—for flows , in the sense that they facilitate the movement of something, such as materials, people, or information. For example, transportation networks (e.g., of highways, railways, and airlines) support flows of commodities and people, communication networks allow for the flow of data, and networks of trade relations among nations reflect the flow of capital.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
Data were originally collected by Manfred Fischer and Petra Staufer.
2.
More precisely, we can write (9.7) in the form \( \log (\mu ) = \mathbf{M}\gamma \), where M is an (IJ) × (I + J + K) matrix, and \( \gamma = {(\alpha _{1},\ldots,\alpha _{I},\beta _{1},\ldots,\beta _{J},\theta _{1},\ldots,\theta _{K})}^{T} \) is an (I + J + K) × 1 vector. The first I + J columns of M are binary vectors, indicating the appropriate origin and destination for each entry of μ, and are redundant in that both the first I and the next J sum to the unit vector. The last K columns correspond to the K variables defining the c _ij. Assuming that the latter are linearly independent of themselves and of the former, the rank of M will be (I + J − 1) + K. See Sen and Smith [130, Chap. 5.2].
3.
The AIC statistic for a likelihood-based model, with k-dimensional parameter η, is defined as \( AIC = -2\ell(\hat{\eta }) + 2k \), where \( \ell(\eta ) \) is the log-likelihood evaluated at η, and \( \hat{\eta } \) is the maximum likelihood estimate of η. This statistic, as with others of its type, provides an estimate of the generalization error associated with the fitted model, in this case effectively by off-setting the assessment of how well the model fits the data by a measure of its complexity. See, for example, Hastie, Tibshirani, and Friedman [71, Chap. 7.5] for additional details.
4.
If multiple routes are possible, the entries of B are instead fractions representing, for example, the proportion of traffic from i to j that is expected to use the link e.

References

J. Cao, D. Davis, S. Wiel, B. Yu, Time-varying network tomography: router link data. J. Am. Stat. Assoc. 95(452), 1063–1075 (2000)
Article MATH Google Scholar
H. Carey, Principles of Social Science (Lippincott, Philadelphia, 1858)
Google Scholar
R. Castro, M. Coates, G. Liang, R. Nowak, B. Yu, Network tomography: recent developments. Stat. Sci. 19(3), 499–517 (2004)
Article MATH MathSciNet Google Scholar
W. Deming, F. Stephan, On a least squares adjustment of a sampled frequency table when the expected marginal totals are known. Ann. Math. Stat. 11(4), 427–444 (1940)
Article MathSciNet Google Scholar
M. Fischer, S. Gopal, Artificial neural networks: a new approach to modeling interregional telecommunication flows. J. Reg. Sci. 34(4), 503–527 (1994)
Article Google Scholar
T. Hastie, R. Tibshirani, J. Friedman, The Elements of Statistical Learning (Springer, New York, 2001)
Book MATH Google Scholar
P. McCullagh, J. Nelder, Generalized Linear Models (Chapman & Hall/CRC, London, 1989)
Book MATH Google Scholar
A. Sen, T. Smith, Gravity Models of Spatial Interaction Behavior (Springer, Berlin, 1995)
Book Google Scholar
J. Stewart, An inverse distance variation for certain social influences. Science 93(2404), 89–90 (1941)
Article Google Scholar
Y. Vardi, Network tomography: estimating source-destination traffic intensities from link data. J. Am. Stat. Assoc. 91(433), 365–377 (1996)
Article MATH MathSciNet Google Scholar
Y. Zhang, M. Roughan, C. Lund, D. Donoho, An information-theoretic approach to traffic matrix estimation. In Proceedings of SIGCOMM’03, 2003
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Statistics, Boston University Professor, Boston, MA, USA
Eric D. Kolaczyk
Department of Statistics, Harvard University Research Associate, Cambridge, MA, USA
Gábor Csárdi

Authors

Eric D. Kolaczyk
View author publications
You can also search for this author in PubMed Google Scholar
Gábor Csárdi
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kolaczyk, E.D., Csárdi, G. (2014). Analysis of Network Flow Data. In: Statistical Analysis of Network Data with R. Use R!, vol 65. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-0983-4_9

Download citation

DOI: https://doi.org/10.1007/978-1-4939-0983-4_9
Published: 17 April 2014
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-0982-7
Online ISBN: 978-1-4939-0983-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics