Abstract
A network consists of a group of units where sets of connections may be observed. When these connections are recorded in social settings, a network is depicted as a social network. Nonetheless, a network is realized whenever researchers can offer feasible definitions of rules and boundaries for when these connections among units may be noted. This latter property enables networks to not be constrained to social settings, but rather to take place in an almost infinite set of contexts. The overarching goal of this chapter is to showcase some of these contexts and leverage the power of network analysis to unveil and model meaningful structures in (higher) education research. Accordingly, with the goal to strengthen the study of higher education, this chapter depicts the manners in which network principles are merged with geographical, statistical, and qualitative approaches to test innovative, timely, and relevant hypotheses. Examples of these tools are the analysis of dependence among units, wherein connections can be established considering measures of geographical and/or social proximity. The method-bridging properties resulting from network principles are also used to reveal the mathematical and dynamic structure contained in qualitative data. All the analyses can be freely replicated using the R code and data provided herein – some analyses rely on protected data, accordingly those data and its corresponding code are not included in this chapter but the code is available upon request.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
This is also true for time. All the social and spatial analyses can incorporate spatio-temporal analyses, but these analyses are beyond the scope of the chapter due to space limitations. However, readers may see González Canché (2019c) for an example of this methodology applied to a difference and differences framework.
- 2.
In certain cases, like when dealing with the interconnection between social and spatial network analysis as detailed below, the weights of two-mode matrices are ignored, and such weights are treated as presence (cells with values greater that zero) or absence (cells with zero) of relationships before these transformations take place.
- 3.
- 4.
This example is not implying that gender is binary or constant but is simply used as an example of a possible characteristic of actors configuring the network.
- 5.
Visit https://nces.ed.gov/ipeds/datacenter/data/EF2016C.zip to directly access these data.
- 6.
For visualization purposes, when reading this edgelist as a network, analysts should execute an extra line of code g <− simplify(g), where g is the graph retrieved from Eq. (12.6) to remove self-loops or self-selections while keeping all isolated actors in the network.
- 7.
Vectorized figures in color can be accessed in the following link as those figures do not lose resolution and can be zoomed in: https://drive.google.com/file/d/11BBOhUdULZN4Kp-oiXh8gSU1yyV8n_6f/view?usp=sharing
- 8.
Once more, these procedures can be replicated using the code provided in the appendix.
- 9.
Cross-walk refers to a merging procedure where there is no direct link between datasets, but there is one intermediary dataset that contained a common key for this merging process to be conducted.
- 10.
The social network data analyzed herein is provided in the R code appendix available. Although the institution-level data cannot be shared, the code provided in the appendix will enable replication of such analyses when analysts apply these procedures to their own databases.
- 11.
Following Bivand et al. (2013), the zI score for the statistic is computed as: \( {\mathrm{z}}_{\mathrm{I}}=\frac{\mathrm{I}-\mathrm{E}\left[\mathrm{I}\right]}{\sqrt{\mathrm{V}\left[\mathrm{I}\right]}} \), where \( \mathrm{E}\left[\mathrm{I}\right]=\frac{-1}{\left(\mathrm{n}-1\right)},\mathrm{and}\ \mathrm{V}\left[\mathrm{I}\right]=\mathrm{E}\left[{\mathrm{I}}^2\right]-\mathrm{E}{\left[\mathrm{I}\right]}^2 \)
- 12.
See Bivand et al. (2013), for a detailed explanation of the process to deal with spatial dependence of the error terms in SAR.
- 13.
Another example of the use of lagged variables can be seen in González Canché (2014) who captured mean proportion of nonresident students attending neighboring institutions for institution i to test the hypothesis that this lagged indicator should be negatively associated with tuition prices charged to nonresident students at such an institution i. The rationale behind this hypothetical relationship was that, to the extent neighboring institutions attract more nonresident students, a given institution would be losing those students if such an institution makes no changes to the final tuition charges modified to attract nonresident students. A hypothesis that was corroborated. González Canché (2014) also tested for the proportion of neighboring institutions for institution i that were research intensive to test for the hypothesis that these institutions would help institution i increase its tuition prices given a spillover effect (González Canché, 2017d) of being located in close proximity to highly selective colleges who tend to charge higher tuition prices. This hypothesis was also corroborated.
- 14.
These analyses can be replicated with the code provided but the corresponding output table is not included herein due to space limitations.
- 15.
This form of bias becomes a problem when indicators that may be associated with the outcome of interest are not included in the model and therefore become part of the error term (as in the population, not the model residuals realized from our sample). It follows then that if at least one of these omitted indicators is correlated with an observed variable included in our models, then this observed variable will be correlated with the error terms which would bias our estimates.
- 16.
RQDA is a free and powerful version to conduct qualitative analysis. It can be found here http://rqda.r-forge.r-project.org/
- 17.
The qualitative codes were created from the written information contained in a set of interviews and followed a rigorous qualitative coding process. No differentiation of participants’ roles were performed when the codes were generated. After complete transcription of the interviews through open codes (Creswell & Creswell, 2017; S. Lewis, 2015) in NVivo (Charmaz, 2011) and process coding (Bogdan & Biklen, 2007) to highlight practices, interpretation, and activities. The codes identified were compared and refined through two additional rounds of review and deliberation. Redundant codes were collapsed to generate larger categories.
- 18.
Note that human actors are represented in squares and codes in circles. See color figures online as indicated in footnote 7.
- 19.
These exchanges can also be downloaded from this link https://drive.google.com/file/d/1-bXjtlgUJ3qbWTjFG6-dKUtEA522KVRB/view?usp=sharing
- 20.
See color version here https://drive.google.com/file/d/11BBOhUdULZN4Kp-oiXh8gSU1yyV8n_6f/view?usp=sharing
- 21.
Distance considerations are particularly important for community college students who tend to enroll in college much closer than their public four-year counterparts, despite of college availability. For example, in the ELS sample, students have a median and mean home to community college distance of 7.68 and 11.37 miles (s.d. = 12.28 miles) regardless of whether they attended college. Notably, these distances are similar when compared to home to a public four-year college (median = 9.92, mean = 14.85 [s.d. = 15.13] miles). However, compared to their four-year counterparts, community college students attended college within 9.92 (median) and 49 miles (mean) from their high school home (s.d. = 191.65 miles), whereas public four-year students enrolled within a median and mean distances of 62.572 and 130 miles (s.d. = 241.36 miles) from their parents’ home.
References
Bassett, D. S., & Sporns, O. (2017). Network neuroscience. Nature Neuroscience, 20(3), 353.
Bell, M. G., & Iida, Y. (1997). Transportation network analysis. New York: Wiley Online Library.
Bivand, R. S., Pebesma, E. J., Gomez-Rubio, V., & Pebesma, E. J. (2013). Applied spatial data analysis with r (Vol. 747248717). New York: Springer.
Bogdan, R., & Biklen, S. (2007). Qualitative research for education: An introduction to theory and practice. Needham Heights, MA: Allyn and Bacon.
Borgatti, S. P. (2005). Centrality and network flow. Social Networks, 27(1), 55–71.
Borgatti, S. P. (2006). Identifying sets of key players in a social network. Computational & Mathematical Organization Theory, 12(1), 21–34.
Borgatti, S. P., Mehra, A., Brass, D. J., & Labianca, G. (2009). Network analysis in the social sciences. Science, 323(5916), 892–895.
Borgatti, S. P., & Molina, J.-L. (2005). Toward ethical guidelines for network research in organizations. Social Networks, 27(2), 107–117.
Breiger, R. L. (1974). The duality of persons and groups. Social Forces, 53(2), 181–190.
Charmaz, K. (2011). Grounded theory methods in social justice research. The Sage Handbook of Qualitative Research, 4(1), 359–380.
Cliff, A., & Ord, K. (1969). The problem of spatial autocorrelation. In A. J. Scott (Ed.), London papers in regional science (pp. 25–55). London: Pion.
Cliff, A., & Ord, K. (1972). Testing for spatial autocorrelation among regression residuals. Geographical Analysis, 4(3), 267–284.
Cressie, N. A. (2015). Statistics for spatial data. New York: Wiley Online Library.
Creswell, J. W., & Creswell, J. D. (2017). Research design: Qualitative, quantitative, and mixed methods approaches. Los Angeles: Sage Publications.
Csárdi, G., & Nepusz, T. (2006). The igraph software package for complex network research. InterJournal, Complex Systems, 1695(5), 1–9.
Curtis, G. E., & Karacan, T. (2002, December). The nexus among terrorists, narcotics traffickers, weapons proliferators, and organized crime networks in western Europe. Washington, DC: The Library of Congress.
Elijah, A. (1990). Streetwise: Race, class, and change in an urban community. Chicago: University of Chicago.
ESRI. (1998). Shapefile technical description, Jul. 1998. Environmental Systems Research Institute [ESRI], Inc. Retrieved from https://www.esri.com/library/whitepapers/pdfs/shapefile.pdf
Fortunato, S. (2010). Community detection in graphs. Physics Reports, 486(3–5), 75–174.
Freeman, L. C. (1978). Centrality in social networks conceptual clarification. Social Networks, 1(3), 215–239.
González Canché, M. S. (2014). Localized competition in the non-resident student market. Economics of Education Review, 43, 21–35.
González Canché, M. S. (2017a). Community college scientists and salary gap: Navigating socioeconomic and academic stratification in the U.S. higher education system. The Journal of Higher Education, 88(1), 1–32. Retrieved from https://doi.org/10.1080/00221546.2016.1243933, https://doi.org/10.1080/00221546.2016.1243933
González Canché, M. S. (2017b). Financial benefits of rapid student loan repayment: An analytic framework employing two decades of data. The Annals of the American Academy of Political and Social Science, 671(1), 154–182.
González Canché, M. S. (2017c). The heterogeneous non-resident student body: Measuring the effect of out-of-state students’ home-state wealth on tuition and fee price variations. Research in Higher Education, 58(2), 141–183.
González Canché, M. S. (2017d). Measuring universities’ spillover effects on community college students’ educational outcomes. In Association for the study of higher education, 42nd annual conference. Houston, TX.
González Canché, M. S. (2018a). Geographical network analysis and spatial econometrics as tools to enhance our understanding of student migration patterns and benefits in the U.S. higher education network. The Review of Higher Education, 41(2), 169–216.
González Canché, M. S. (2018b). Nearby college enrollment and geographical skills mismatch: (re)conceptualizing student out-migration in the American higher education system. The Journal of Higher Education, 1–43.
González Canché, M. S. (2018c). Reassessing the two-year sector’s role in the amelioration of a persistent socioeconomic gap: A proposed analytical framework for the study of community college effects in the big and geocoded data and quasi-experimental era. In M. B. Paulsen (Ed.), Higher education: Handbook of theory and research: Published under the sponsorship of the association for institutional research (air) and the association for the study of higher education (ASHE) (pp. 175–238). Cham, Switzerland: Springer International Publishing. Retrieved from https://doi.org/10.1007/978-3-319-72490-4_5; https://doi.org/10.1007/978-3-319-72490-4_5
González Canché, M. S. (2018d). Geographical Bias in testing: Is cultural Bias a problem of the past or are we simply not looking in the right space? Diverse Issues in Higher Education. Available from https://diverseeducation.com/article/126527/
González Canché, M. S. (2019a). Challenges and opportunities in the use of big and geocoded data in higher education research and policy. In G. M. & A. Castro-Samoya (Eds.), Contemporary issues in higher education (Ist edn., Chapter 3). New York: Routledge. Retreived from https://www.routledge.com/Contemporary-Issues-in-Higher-Education-1st-Edition/Gasman-Samayoa/p/book/9781138344617.
González Canché, M. S. (2019b). Spatial econometrics and network analysis as means to assess the assumption of independence in higher education research: Interrogating social dependence using spatial econometrics. New Directions for Institutional Research, 179, 11–29.
González Canché, M. S. (2019c). The statistical power of “zooming” in: Applying geographically-based difference in differences using Spatio-temporal Analysis to the study of college aid and access. New Directions for Institutional Research, 179, 71–89.
González Canché, M. S., D’Amico, M. M., Rios-Aguilar, C., & Salas, S. (2014). It’s who you know: Leveraging social networks for college and careers. The Community College Enterprise, 20(1), 17.
González Canché, M. S., & Rios-Aguilar, C. (2015). Critical social network analysis in community colleges: Peer effects and credit attainment. New Directions for Institutional Research, 2014(163), 75–91.
Griffith, D. A. (1993). Advanced spatial statistics for analyzing and visualizing geo-referenced data. International Journal of Geographical Information Science, 7(2), 107–123.
Hunter, D. R., Handcock, M. S., Butts, C. T., Goodreau, S. M., & Morris, M. (2008). ergm: A package to fit, simulate and diagnose exponential-family models for networks. Journal of statistical software, 24(3), nihpa54860.
Jargowsky, P. A., & Tursi, N. O. (2015). Concentrated disadvantage. In J. D. Wright (Ed.), International encyclopedia of the social and behavioral sciences (2nd Ed., pp. 525–530). Oxford, UK: Elsevier. Retrieved from https://www.sciencedirect.com/science/article/pii/B9780080970868321924. https://doi.org/10.1016/B978-0-08-097086-8.32192-4
Kadushin, C. (2005). Who benefits from network analysis: ethics of social network research. Social Networks, 27(2), 139–153.
Kolaczyk, E. D., & Csárdi, G. (2014). Statistical analysis of network data with R (Vol. 65). New York: Springer.
Lazega, E. (2001). The collegial phenomenon: The social mechanisms of cooperation among peers in a corporate law partnership. Oxford, UK: Oxford University Press on Demand.
Lewis, D. (1973). Causation. The Journal of Philosophy, 70, 556–567.
Lewis, S. (2015). Qualitative inquiry and research design: Choosing among five approaches. Health Promotion Practice, 16(4), 473–475.
Liu, B. (2011). Social network analysis. In Web data mining (pp. 269–309). Springer.
Mall, R., Cerulo, L., Bensmail, H., Iavarone, A., & Ceccarelli, M. (2017). Detection of statistically significant network changes in complex biological networks. BMC Systems Biology, 11(1), 32.
Mastrobuoni, G., & Patacchini, E. (2012). Organized crime networks: An application of network analysis techniques to the American mafia. Review of Network Economics, 11(3).
McMillen, D., Singell Jr., L., & Waddell, G. (2007). Spatial competition and the price of college. Economic Inquiry, 45(4), 817–833.
Miller, J. H., & Page, S. E. (2007). Complex adaptive systems. An introduction to computational models of social life. Princeton, NJ: Princeton University Press.
Mills, C. W. (2000). The sociological imagination. New York: Oxford University Press.
Moreno, J. L. (1934). Who shall survive? A new approach to the problem of human interrelations. Philadelphia, PA: Nervous and Mental Disease Publishing Co.
Pacione, M. (1997). The geography of educational disadvantage in Glasgow. Applied Geography, 17(3), 169–192.
Page, L., Brin, S., Motwani, R., & Winograd, T. (1999). The PageRank citation ranking: Bringing order to the web.
Pastor, M. J. (2001). Geography and opportunity. In N. J. Smelser, W. J. Wilson, & M. Faith (Eds.), America becoming: Racial trends and their consequences (Vol. 1, pp. 435–468). Washington, D.C: National Academies Press.
Pavlopoulos, G. A., Secrier, M., Moschopoulos, C. N., Soldatos, T. G., Kossida, S., Aerts, J., et al. (2011). Using graph theory to analyze biological networks. BioData mining, 4(1), 10.
Phipson, B., & Smyth, G. K. (2010). Permutation p-values should never be zero: Calculating exact p-values when permutations are randomly drawn. Statistical Applications in Genetics and Molecular Biology, 9, 1.
Rapino, M. A., & Fields, A. K.. 2013. Mega Commuters in the U.S.: Time and Distance in Defining the Long Commute Using the American Community Survey. Working Paper 2013–03, United States Census Bureau, Atlanta, GA.
Rubin, D. B. (2005). Causal inference using potential outcomes. Journal of the American Statistical Association, (469), 100.
Schabenberger, O., & Gotway, C. A. (2017). Statistical methods for spatial data analysis. CRC press.
Tate IV, W. F. (2008). Geography of opportunity: Poverty, place, and educational outcomes. Educational Researcher, 37(7), 397–411.
Turley, R. N. L. (2009). College proximity: Mapping access to opportunity. Sociology of Education, 82(2), 126–146.
USDAERS. (1993). Codes, rural-urban continuum. US Department of Agriculture. Retrieved from https://www.ers.usda.gov/webdocs/DataFiles/53241/ruca1990.xls?v=0.
USDAERS. (2003). Codes, rural-urban continuum. US Department of Agriculture. Retrieved from https://www.ers.usda.gov/webdocs/DataFiles/53241/ruca00.xls?v=0.
USDAERS. (2013). Codes, rural-urban continuum. US Department of Agriculture. Retrieved from https://www.ers.usda.gov/webdocs/DataFiles/53241/ruca2010.xlsx?v=0.
Wasserman, S., & Faust, K. (1994). Social network analysis: Methods and applications (Vol. 8). Cambridge university press.
Webber, K. L., & González Canché, M. S. (2015). Not equal for all: Gender and race differences in salary for doctoral degree recipients. Research in Higher Education, 56(7), 645–672.
Whitbred, R. (2011). Quadratic assignment procedure (qap). In G. A. Barnett (Ed.), Encyclopedia of social networks (Vol. 1, pp. 733–734). Thousand Oaks, CA: SAGE Publications, Inc.
Zachary, W. W. (1977). An information flow model for conflict and fission in small groups. Journal of Anthropological Research, 452–473.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendices
12.1.1 Appendix A: Code for Simulated Example
12.1.2 Appendix B: Code Geographical Network Analysis
12.1.3 Appendix C: Social Dependence Procedures
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
González Canché, M.S. (2019). Geographical, Statistical, and Qualitative Network Analysis: A Multifaceted Method-Bridging Tool to Reveal and Model Meaningful Structures in Education Research. In: Paulsen, M.B., Perna, L.W. (eds) Higher Education: Handbook of Theory and Research. Higher Education: Handbook of Theory and Research, vol 34. Springer, Cham. https://doi.org/10.1007/978-3-030-03457-3_12
Download citation
DOI: https://doi.org/10.1007/978-3-030-03457-3_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03456-6
Online ISBN: 978-3-030-03457-3
eBook Packages: EducationEducation (R0)