Skip to main content

Geographical, Statistical, and Qualitative Network Analysis: A Multifaceted Method-Bridging Tool to Reveal and Model Meaningful Structures in Education Research

  • Chapter
  • First Online:
Higher Education: Handbook of Theory and Research

Part of the book series: Higher Education: Handbook of Theory and Research ((HATR,volume 34))

Abstract

A network consists of a group of units where sets of connections may be observed. When these connections are recorded in social settings, a network is depicted as a social network. Nonetheless, a network is realized whenever researchers can offer feasible definitions of rules and boundaries for when these connections among units may be noted. This latter property enables networks to not be constrained to social settings, but rather to take place in an almost infinite set of contexts. The overarching goal of this chapter is to showcase some of these contexts and leverage the power of network analysis to unveil and model meaningful structures in (higher) education research. Accordingly, with the goal to strengthen the study of higher education, this chapter depicts the manners in which network principles are merged with geographical, statistical, and qualitative approaches to test innovative, timely, and relevant hypotheses. Examples of these tools are the analysis of dependence among units, wherein connections can be established considering measures of geographical and/or social proximity. The method-bridging properties resulting from network principles are also used to reveal the mathematical and dynamic structure contained in qualitative data. All the analyses can be freely replicated using the R code and data provided herein – some analyses rely on protected data, accordingly those data and its corresponding code are not included in this chapter but the code is available upon request.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This is also true for time. All the social and spatial analyses can incorporate spatio-temporal analyses, but these analyses are beyond the scope of the chapter due to space limitations. However, readers may see González Canché (2019c) for an example of this methodology applied to a difference and differences framework.

  2. 2.

    In certain cases, like when dealing with the interconnection between social and spatial network analysis as detailed below, the weights of two-mode matrices are ignored, and such weights are treated as presence (cells with values greater that zero) or absence (cells with zero) of relationships before these transformations take place.

  3. 3.

    Located here: https://drive.google.com/file/d/1zLtnmuCXeaXeJa5QRC1BlPy8V3NdmRuX/view?usp=sharing

  4. 4.

    This example is not implying that gender is binary or constant but is simply used as an example of a possible characteristic of actors configuring the network.

  5. 5.

    Visit https://nces.ed.gov/ipeds/datacenter/data/EF2016C.zip to directly access these data.

  6. 6.

    For visualization purposes, when reading this edgelist as a network, analysts should execute an extra line of code g <− simplify(g), where g is the graph retrieved from Eq. (12.6) to remove self-loops or self-selections while keeping all isolated actors in the network.

  7. 7.

    Vectorized figures in color can be accessed in the following link as those figures do not lose resolution and can be zoomed in: https://drive.google.com/file/d/11BBOhUdULZN4Kp-oiXh8gSU1yyV8n_6f/view?usp=sharing

  8. 8.

    Once more, these procedures can be replicated using the code provided in the appendix.

  9. 9.

    Cross-walk refers to a merging procedure where there is no direct link between datasets, but there is one intermediary dataset that contained a common key for this merging process to be conducted.

  10. 10.

    The social network data analyzed herein is provided in the R code appendix available. Although the institution-level data cannot be shared, the code provided in the appendix will enable replication of such analyses when analysts apply these procedures to their own databases.

  11. 11.

    Following Bivand et al. (2013), the zI score for the statistic is computed as: \( {\mathrm{z}}_{\mathrm{I}}=\frac{\mathrm{I}-\mathrm{E}\left[\mathrm{I}\right]}{\sqrt{\mathrm{V}\left[\mathrm{I}\right]}} \), where \( \mathrm{E}\left[\mathrm{I}\right]=\frac{-1}{\left(\mathrm{n}-1\right)},\mathrm{and}\ \mathrm{V}\left[\mathrm{I}\right]=\mathrm{E}\left[{\mathrm{I}}^2\right]-\mathrm{E}{\left[\mathrm{I}\right]}^2 \)

  12. 12.

    See Bivand et al. (2013), for a detailed explanation of the process to deal with spatial dependence of the error terms in SAR.

  13. 13.

    Another example of the use of lagged variables can be seen in González Canché (2014) who captured mean proportion of nonresident students attending neighboring institutions for institution i to test the hypothesis that this lagged indicator should be negatively associated with tuition prices charged to nonresident students at such an institution i. The rationale behind this hypothetical relationship was that, to the extent neighboring institutions attract more nonresident students, a given institution would be losing those students if such an institution makes no changes to the final tuition charges modified to attract nonresident students. A hypothesis that was corroborated. González Canché (2014) also tested for the proportion of neighboring institutions for institution i that were research intensive to test for the hypothesis that these institutions would help institution i increase its tuition prices given a spillover effect (González Canché, 2017d) of being located in close proximity to highly selective colleges who tend to charge higher tuition prices. This hypothesis was also corroborated.

  14. 14.

    These analyses can be replicated with the code provided but the corresponding output table is not included herein due to space limitations.

  15. 15.

    This form of bias becomes a problem when indicators that may be associated with the outcome of interest are not included in the model and therefore become part of the error term (as in the population, not the model residuals realized from our sample). It follows then that if at least one of these omitted indicators is correlated with an observed variable included in our models, then this observed variable will be correlated with the error terms which would bias our estimates.

  16. 16.

    RQDA is a free and powerful version to conduct qualitative analysis. It can be found here http://rqda.r-forge.r-project.org/

  17. 17.

    The qualitative codes were created from the written information contained in a set of interviews and followed a rigorous qualitative coding process. No differentiation of participants’ roles were performed when the codes were generated. After complete transcription of the interviews through open codes (Creswell & Creswell, 2017; S. Lewis, 2015) in NVivo (Charmaz, 2011) and process coding (Bogdan & Biklen, 2007) to highlight practices, interpretation, and activities. The codes identified were compared and refined through two additional rounds of review and deliberation. Redundant codes were collapsed to generate larger categories.

  18. 18.

    Note that human actors are represented in squares and codes in circles. See color figures online as indicated in footnote 7.

  19. 19.

    These exchanges can also be downloaded from this link https://drive.google.com/file/d/1-bXjtlgUJ3qbWTjFG6-dKUtEA522KVRB/view?usp=sharing

  20. 20.

    See color version here https://drive.google.com/file/d/11BBOhUdULZN4Kp-oiXh8gSU1yyV8n_6f/view?usp=sharing

  21. 21.

    Distance considerations are particularly important for community college students who tend to enroll in college much closer than their public four-year counterparts, despite of college availability. For example, in the ELS sample, students have a median and mean home to community college distance of 7.68 and 11.37 miles (s.d. = 12.28 miles) regardless of whether they attended college. Notably, these distances are similar when compared to home to a public four-year college (median = 9.92, mean = 14.85 [s.d. = 15.13] miles). However, compared to their four-year counterparts, community college students attended college within 9.92 (median) and 49 miles (mean) from their high school home (s.d. = 191.65 miles), whereas public four-year students enrolled within a median and mean distances of 62.572 and 130 miles (s.d. = 241.36 miles) from their parents’ home.

References

  • Bassett, D. S., & Sporns, O. (2017). Network neuroscience. Nature Neuroscience, 20(3), 353.

    Article  Google Scholar 

  • Bell, M. G., & Iida, Y. (1997). Transportation network analysis. New York: Wiley Online Library.

    Book  Google Scholar 

  • Bivand, R. S., Pebesma, E. J., Gomez-Rubio, V., & Pebesma, E. J. (2013). Applied spatial data analysis with r (Vol. 747248717). New York: Springer.

    Book  Google Scholar 

  • Bogdan, R., & Biklen, S. (2007). Qualitative research for education: An introduction to theory and practice. Needham Heights, MA: Allyn and Bacon.

    Google Scholar 

  • Borgatti, S. P. (2005). Centrality and network flow. Social Networks, 27(1), 55–71.

    Article  Google Scholar 

  • Borgatti, S. P. (2006). Identifying sets of key players in a social network. Computational & Mathematical Organization Theory, 12(1), 21–34.

    Article  Google Scholar 

  • Borgatti, S. P., Mehra, A., Brass, D. J., & Labianca, G. (2009). Network analysis in the social sciences. Science, 323(5916), 892–895.

    Article  Google Scholar 

  • Borgatti, S. P., & Molina, J.-L. (2005). Toward ethical guidelines for network research in organizations. Social Networks, 27(2), 107–117.

    Article  Google Scholar 

  • Breiger, R. L. (1974). The duality of persons and groups. Social Forces, 53(2), 181–190.

    Article  Google Scholar 

  • Charmaz, K. (2011). Grounded theory methods in social justice research. The Sage Handbook of Qualitative Research, 4(1), 359–380.

    Google Scholar 

  • Cliff, A., & Ord, K. (1969). The problem of spatial autocorrelation. In A. J. Scott (Ed.), London papers in regional science (pp. 25–55). London: Pion.

    Google Scholar 

  • Cliff, A., & Ord, K. (1972). Testing for spatial autocorrelation among regression residuals. Geographical Analysis, 4(3), 267–284.

    Article  Google Scholar 

  • Cressie, N. A. (2015). Statistics for spatial data. New York: Wiley Online Library.

    Google Scholar 

  • Creswell, J. W., & Creswell, J. D. (2017). Research design: Qualitative, quantitative, and mixed methods approaches. Los Angeles: Sage Publications.

    Google Scholar 

  • Csárdi, G., & Nepusz, T. (2006). The igraph software package for complex network research. InterJournal, Complex Systems, 1695(5), 1–9.

    Google Scholar 

  • Curtis, G. E., & Karacan, T. (2002, December). The nexus among terrorists, narcotics traffickers, weapons proliferators, and organized crime networks in western Europe. Washington, DC: The Library of Congress.

    Google Scholar 

  • Elijah, A. (1990). Streetwise: Race, class, and change in an urban community. Chicago: University of Chicago.

    Google Scholar 

  • ESRI. (1998). Shapefile technical description, Jul. 1998. Environmental Systems Research Institute [ESRI], Inc. Retrieved from https://www.esri.com/library/whitepapers/pdfs/shapefile.pdf

  • Fortunato, S. (2010). Community detection in graphs. Physics Reports, 486(3–5), 75–174.

    Article  Google Scholar 

  • Freeman, L. C. (1978). Centrality in social networks conceptual clarification. Social Networks, 1(3), 215–239.

    Article  Google Scholar 

  • González Canché, M. S. (2014). Localized competition in the non-resident student market. Economics of Education Review, 43, 21–35.

    Article  Google Scholar 

  • González Canché, M. S. (2017a). Community college scientists and salary gap: Navigating socioeconomic and academic stratification in the U.S. higher education system. The Journal of Higher Education, 88(1), 1–32. Retrieved from https://doi.org/10.1080/00221546.2016.1243933, https://doi.org/10.1080/00221546.2016.1243933

  • González Canché, M. S. (2017b). Financial benefits of rapid student loan repayment: An analytic framework employing two decades of data. The Annals of the American Academy of Political and Social Science, 671(1), 154–182.

    Article  Google Scholar 

  • González Canché, M. S. (2017c). The heterogeneous non-resident student body: Measuring the effect of out-of-state students’ home-state wealth on tuition and fee price variations. Research in Higher Education, 58(2), 141–183.

    Article  Google Scholar 

  • González Canché, M. S. (2017d). Measuring universities’ spillover effects on community college students’ educational outcomes. In Association for the study of higher education, 42nd annual conference. Houston, TX.

    Google Scholar 

  • González Canché, M. S. (2018a). Geographical network analysis and spatial econometrics as tools to enhance our understanding of student migration patterns and benefits in the U.S. higher education network. The Review of Higher Education, 41(2), 169–216.

    Article  Google Scholar 

  • González Canché, M. S. (2018b). Nearby college enrollment and geographical skills mismatch: (re)conceptualizing student out-migration in the American higher education system. The Journal of Higher Education, 1–43.

    Google Scholar 

  • González Canché, M. S. (2018c). Reassessing the two-year sector’s role in the amelioration of a persistent socioeconomic gap: A proposed analytical framework for the study of community college effects in the big and geocoded data and quasi-experimental era. In M. B. Paulsen (Ed.), Higher education: Handbook of theory and research: Published under the sponsorship of the association for institutional research (air) and the association for the study of higher education (ASHE) (pp. 175–238). Cham, Switzerland: Springer International Publishing. Retrieved from https://doi.org/10.1007/978-3-319-72490-4_5; https://doi.org/10.1007/978-3-319-72490-4_5

  • González Canché, M. S. (2018d). Geographical Bias in testing: Is cultural Bias a problem of the past or are we simply not looking in the right space? Diverse Issues in Higher Education. Available from https://diverseeducation.com/article/126527/

  • González Canché, M. S. (2019a). Challenges and opportunities in the use of big and geocoded data in higher education research and policy. In G. M. & A. Castro-Samoya (Eds.), Contemporary issues in higher education (Ist edn., Chapter 3). New York: Routledge. Retreived from https://www.routledge.com/Contemporary-Issues-in-Higher-Education-1st-Edition/Gasman-Samayoa/p/book/9781138344617.

  • González Canché, M. S. (2019b). Spatial econometrics and network analysis as means to assess the assumption of independence in higher education research: Interrogating social dependence using spatial econometrics. New Directions for Institutional Research, 179, 11–29.

    Google Scholar 

  • González Canché, M. S. (2019c). The statistical power of “zooming” in: Applying geographically-based difference in differences using Spatio-temporal Analysis to the study of college aid and access. New Directions for Institutional Research, 179, 71–89.

    Google Scholar 

  • González Canché, M. S., D’Amico, M. M., Rios-Aguilar, C., & Salas, S. (2014). It’s who you know: Leveraging social networks for college and careers. The Community College Enterprise, 20(1), 17.

    Google Scholar 

  • González Canché, M. S., & Rios-Aguilar, C. (2015). Critical social network analysis in community colleges: Peer effects and credit attainment. New Directions for Institutional Research, 2014(163), 75–91.

    Article  Google Scholar 

  • Griffith, D. A. (1993). Advanced spatial statistics for analyzing and visualizing geo-referenced data. International Journal of Geographical Information Science, 7(2), 107–123.

    Google Scholar 

  • Hunter, D. R., Handcock, M. S., Butts, C. T., Goodreau, S. M., & Morris, M. (2008). ergm: A package to fit, simulate and diagnose exponential-family models for networks. Journal of statistical software, 24(3), nihpa54860.

    Google Scholar 

  • Jargowsky, P. A., & Tursi, N. O. (2015). Concentrated disadvantage. In J. D. Wright (Ed.), International encyclopedia of the social and behavioral sciences (2nd Ed., pp. 525–530). Oxford, UK: Elsevier. Retrieved from https://www.sciencedirect.com/science/article/pii/B9780080970868321924. https://doi.org/10.1016/B978-0-08-097086-8.32192-4

  • Kadushin, C. (2005). Who benefits from network analysis: ethics of social network research. Social Networks, 27(2), 139–153.

    Article  Google Scholar 

  • Kolaczyk, E. D., & Csárdi, G. (2014). Statistical analysis of network data with R (Vol. 65). New York: Springer.

    Google Scholar 

  • Lazega, E. (2001). The collegial phenomenon: The social mechanisms of cooperation among peers in a corporate law partnership. Oxford, UK: Oxford University Press on Demand.

    Book  Google Scholar 

  • Lewis, D. (1973). Causation. The Journal of Philosophy, 70, 556–567.

    Article  Google Scholar 

  • Lewis, S. (2015). Qualitative inquiry and research design: Choosing among five approaches. Health Promotion Practice, 16(4), 473–475.

    Article  Google Scholar 

  • Liu, B. (2011). Social network analysis. In Web data mining (pp. 269–309). Springer.

    Google Scholar 

  • Mall, R., Cerulo, L., Bensmail, H., Iavarone, A., & Ceccarelli, M. (2017). Detection of statistically significant network changes in complex biological networks. BMC Systems Biology, 11(1), 32.

    Article  Google Scholar 

  • Mastrobuoni, G., & Patacchini, E. (2012). Organized crime networks: An application of network analysis techniques to the American mafia. Review of Network Economics, 11(3).

    Google Scholar 

  • McMillen, D., Singell Jr., L., & Waddell, G. (2007). Spatial competition and the price of college. Economic Inquiry, 45(4), 817–833.

    Article  Google Scholar 

  • Miller, J. H., & Page, S. E. (2007). Complex adaptive systems. An introduction to computational models of social life. Princeton, NJ: Princeton University Press.

    Google Scholar 

  • Mills, C. W. (2000). The sociological imagination. New York: Oxford University Press.

    Google Scholar 

  • Moreno, J. L. (1934). Who shall survive? A new approach to the problem of human interrelations. Philadelphia, PA: Nervous and Mental Disease Publishing Co.

    Book  Google Scholar 

  • Pacione, M. (1997). The geography of educational disadvantage in Glasgow. Applied Geography, 17(3), 169–192.

    Article  Google Scholar 

  • Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the web.

    Google Scholar 

  • Pastor, M. J. (2001). Geography and opportunity. In N. J. Smelser, W. J. Wilson, & M. Faith (Eds.), America becoming: Racial trends and their consequences (Vol. 1, pp. 435–468). Washington, D.C: National Academies Press.

    Google Scholar 

  • Pavlopoulos, G. A., Secrier, M., Moschopoulos, C. N., Soldatos, T. G., Kossida, S., Aerts, J., et al. (2011). Using graph theory to analyze biological networks. BioData mining, 4(1), 10.

    Google Scholar 

  • Phipson, B., & Smyth, G. K. (2010). Permutation p-values should never be zero: Calculating exact p-values when permutations are randomly drawn. Statistical Applications in Genetics and Molecular Biology, 9, 1.

    Google Scholar 

  • Rapino, M. A., & Fields, A. K.. 2013. Mega Commuters in the U.S.: Time and Distance in Defining the Long Commute Using the American Community Survey. Working Paper 2013–03, United States Census Bureau, Atlanta, GA.

    Google Scholar 

  • Rubin, D. B. (2005). Causal inference using potential outcomes. Journal of the American Statistical Association, (469), 100.

    Google Scholar 

  • Schabenberger, O., & Gotway, C. A. (2017). Statistical methods for spatial data analysis. CRC press.

    Google Scholar 

  • Tate IV, W. F. (2008). Geography of opportunity: Poverty, place, and educational outcomes. Educational Researcher, 37(7), 397–411.

    Google Scholar 

  • Turley, R. N. L. (2009). College proximity: Mapping access to opportunity. Sociology of Education, 82(2), 126–146.

    Google Scholar 

  • USDAERS. (1993). Codes, rural-urban continuum. US Department of Agriculture. Retrieved from https://www.ers.usda.gov/webdocs/DataFiles/53241/ruca1990.xls?v=0.

  • USDAERS. (2003). Codes, rural-urban continuum. US Department of Agriculture. Retrieved from https://www.ers.usda.gov/webdocs/DataFiles/53241/ruca00.xls?v=0.

  • USDAERS. (2013). Codes, rural-urban continuum. US Department of Agriculture. Retrieved from https://www.ers.usda.gov/webdocs/DataFiles/53241/ruca2010.xlsx?v=0.

  • Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (Vol. 8). Cambridge university press.

    Google Scholar 

  • Webber, K. L., & González Canché, M. S. (2015). Not equal for all: Gender and race differences in salary for doctoral degree recipients. Research in Higher Education, 56(7), 645–672.

    Google Scholar 

  • Whitbred, R. (2011). Quadratic assignment procedure (qap). In G. A. Barnett (Ed.), Encyclopedia of social networks (Vol. 1, pp. 733–734). Thousand Oaks, CA: SAGE Publications, Inc.

    Google Scholar 

  • Zachary, W. W. (1977). An information flow model for conflict and fission in small groups. Journal of Anthropological Research, 452–473.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manuel S. González Canché .

Editor information

Editors and Affiliations

Appendices

Appendices

12.1.1 Appendix A: Code for Simulated Example

A programme code to create simulated examples.
A programme code to create simulated examples.
A programme code to create simulated examples.
A programme code to create simulated examples.

12.1.2 Appendix B: Code Geographical Network Analysis

A programme code to create simulated examples.
A programme code to create simulated examples.
A programme code to create simulated examples.

12.1.3 Appendix C: Social Dependence Procedures

A programme code to create simulated examples.
A programme code to create simulated examples.
A programme code to create simulated examples.

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

González Canché, M.S. (2019). Geographical, Statistical, and Qualitative Network Analysis: A Multifaceted Method-Bridging Tool to Reveal and Model Meaningful Structures in Education Research. In: Paulsen, M.B., Perna, L.W. (eds) Higher Education: Handbook of Theory and Research. Higher Education: Handbook of Theory and Research, vol 34. Springer, Cham. https://doi.org/10.1007/978-3-030-03457-3_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-03457-3_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-03456-6

  • Online ISBN: 978-3-030-03457-3

  • eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics