Individuals belong to geographically based social networks that include an individual’s ties (family, friends, professional contacts, etc.) in nearby and distant places (Acedo et al., 2017). A set of mapped ties comprises a spatial distribution, i.e., a unique “fingerprint” of geolocated social contacts (e.g., two friends in Rome, six family members in New York, a co-worker in Milan, etc.). This distribution is a natural part of social life, as humans have been “traveling, wandering and friending friends and kin seemingly forever” (Hampton and Wellman, 2003, p. 284).
While studies have estimated social network size (ex. Hill and Dunbar, 2003) and structure (examples abound), relatively less is known about the spatial distribution of an individual’s social ties. For example: in how many places (e.g., cities and towns) does the average person have a social tie? Does this number expand or contract for the individual over time? What is the average distance at which an individual’s ties reside? In addition, longitudinal social network studies have examined contact strength, network benefits, individuals comprising the networks, and networks through life events such as marriage, divorce, retirement, and widowhood (Ertel et al., 2009). Yet, the shift in network geography over time is not as clear.
The answers to these questions can tell us more about an individual’s level of access to social and spatial resources, as having trusted nearby contacts or trusted contacts in particular locations-of-interest leads to the acquisition of a rich variety of resources. In short, it is not just where you live and what you can access nearby, but also what you can access through your social network ties. We situate these questions within a larger framework of the theory of relational geography, which affirms that social networks (and social capital) are spatialized, and that relationships are embedded within a larger spatial system that includes institutions and economic practices (Bathelt and Glückler, 2003). Relational geography theory also asserts that economic actors acquire distant resources across regions and countries in order to reap the benefits of innovation, diffusion, collaboration, collective learning processes and knowledge generation (Bathelt and Glückler, 2005). Whether through new tie acquisition, inter-tie relationship changes, or changes in the spatial configuration of a network, time often changes the variety, diversity, and robustness of an individual’s personal Rolodex of people and places. As an individual moves, their spatial social network will add this new cost of visiting or maintaining communication, seek local alternatives for social life, or restructure in another way.
These life events and strategies to cope with the change of moving and changing geographic social networks are common, but the formal concepts are still abstract and challenging to measure. This serves as a motivation for quantitatively discretizing the spatial distribution of many individuals’ contacts. Mapping these ties over time can help codify complex structures in order to discern the cause and effect of social network life changes and to articulate commonalities of social network geographies across the population. Making this implicit, personalized knowledge more explicit and our analysis more procedural can show how humans may have a common experience. For instance, mapping contacts provides a way to find the typical individual’s balance of nearby and distant ties, and distinguishing this balance is important because nearby and distant ties offer different types of support. Nearby ties can provide help in emergency situations and yield increased power in local governance (Aldrich and Meyer, 2014) while distant friends are helpful when an agent plans a visit or seeks advice (Stafford, 2005).
To further motivate the mapping of geographic social networks, there are also practical travel and health benefits to enumerating contacts’ distances. Mapping tie locales can reveal how far people must travel to see their ties and to calculate the cost, or cost-prohibitive nature, of accessing social support (Carrasco et al., 2009). It can help measure the extent to which an individual is in jeopardy of isolation from known social ties, which can lead to loneliness and adverse health effects (Cacioppo and Patrick, 2008). Maps of contacts can also be used to estimate whether the average person is privy to information about many places (Dabbs et al., 1998; Golledge, 2002) and has opportunities to visit an array of cultural locales (Pultar and Raubal, 2009). Geographic theory supports this notion, through an understanding that contacts share their “insider” knowledge of places.
Engaging with the questions of geographic social network configuration is becoming increasingly facile with digital data sources. In particular, social media applications that allow individuals to track the geographic locations of their contacts provide instant information about the spatial expanse of a user’s contacts (where they move to, their vacation locations, etc.). Most commonly, social media acts as a repository in which users can look up locations from friends’ profiles (on platforms such as Facebook). However, more recently, the locations of large sets of friends has been mapped in real time by apps such as Snapchat’s Snap Maps, allowing users (most often teens and young adults) to peruse a changing heat map annotated with cartoon images of their friends (Juhász and Hochmair, 2018). This mass location sharing creates a “landscape” of many contacts mapped at one time (Næss, 2018). Users most often opt to share location with friends and family, as well as nearby contacts in order to ease the logistics of meeting up (Consolvo et al., 2005). The digital collection of these data encourages better studies not only on of social network size and structure, but also on the distribution of social tie locations.
Motivated by a conceptual systems approach of relational geography and enhanced offerings of online social network data, we perform a case study to measure the geographic distribution of social ties. The objective of this work is to take advantage of digital social network data, and present a proof-of-concept study that illustrates how geographic information systems (GIS) mapping and modeling can help analyze the underlying structure and organization of an individual’s relational geography. In our approach, we map the Facebook friends of a group of 20 volunteer egos. We use two locations for each alter: the location where the alter lived when they met the ego (pre-period) and the alter’s location at the time of the study (post-period). We perform a descriptive analysis of this data set to answer the following research questions:
RQ1. In how many cities and countries do individuals have ties?
RQ2. Do individuals have ties in more locations and at farther distances over time?
RQ3. Do tie locations deviate from the expected distances at which an individual is likely to have ties?
RQ4. Do different types of alter groups (e.g., family, school friends, neighbors, etc.) tend to spread (disperse) over time?
This study illustrates our argument that an individual’s set of places they can access through ties will change over time, as will their ability to access geographic resources through ties. Instead of thinking about the individual’s geographic network as highly personalized and difficult to codify, the case study shows the technology needed to create a working model of geolocated social networks, what measurements and statistics are pertinent, and what visualizations are helpful. As a result, the network provides new insights about how social ties are spread across geographic space. This case study is a non-conventional approach to social network data analysis, as it combines the GIS analysis and measures of spatial network expanse at the individual level. By replicating the procedure described in our case study, others can implement their own networks and collect their own data for future studies.
In the following section, we review major concepts mobilized in this work, and briefly describe our case study approach. We next describe the Facebook data collection process, network measurements, GIS methods and summary statistics. We then describe the results of the 20 networks’ spatial distributions and their change over time. Finally, we contextualize our findings in a wider body of literature, discuss study limitations and conclude.
This section explores previous research describing contacts’ expected and actual distribution over geographic space.
The gravity model is a classic method for estimating a “baseline” of spatial interaction across locales. Interaction can represent number of phone calls, migrants, commuters, number of relationships or other flows that connect two locales. This model computes an expected value of interaction (
This equation has been used to estimate inter-city travel (Ben-Akiva and Lerman, 1985) and retail patronage (Huff, 1963) and has been reparametrized by adding demographic data such as income at a destination city (Greenwood, 1985). The gravity model provides an expectation from which to measure whether real-world inter-city connections deviate from this expected value. Recently, actual interaction data have been used to re-calibrate the coefficient of friction from the default value of two (Krings et al., 2009), and determine where sets of cities are over- or under-connecting in comparison to the gravity estimates (Dugundji et al., 2011; Takhteyev et al., 2012).
Next, survey data and large data sets describing tie distribution in geographic space have shown that distance affects the likelihood of relationships.
Distance plays a role in the likelihood of relationships and relationship maintenance. The likelihood of interaction (or friendship) tends to decrease with each increment of distance between individuals, a longstanding economic geography concept known as distance decay. Recently, large data sets from GPS traces, Location-based social networks (LBSNs), and call data records have confirmed that the probability of having a social tie in a certain location decreases exponentially with distance to that location (Blondel et al., 2008; Lambiotte et al., 2008; Leskovec and Horvitz, 2008; Liben-Nowell et al., 2005; Onnela et al., 2011; Preciado et al., 2012; Scellato et al., 2011), although rates can vary depending on data source (Spiro et al., 2016). Each different function reveals the likelihood of an individual having a contact at certain distances. For instance, in Belgium, this probability drops off significantly at 40 km from the individual’s locale (Lambiotte et al., 2008). In studies of Twitter data, it was shown that 34% of Twitter friends who follow one another live within 25 miles while only 18% of users who do not follow each other, but “mentioned” one another live within that radius (McGee et al., 2011), and that Twitter friendships are more likely to follow flight patterns than Euclidean distance (Takhteyev et al., 2012). Moreover, the average distance of an individual’s contacts has been shown to vary by hometown: a recent Facebook study showed that residents in isolated US areas (such as Eastern Kentucky) had up to 82% of Facebook friends living within 50 miles, whereas residents of in the Western US had fewer than 43% of friends living within the same radius (Bailey et al., 2018).
In addition to big data harnessed from online social networks, studies using standard survey approaches confirm that the accessibility of social support is an important descriptor of a community. For example, in his study of the “spatial dimensions of personal relations”, Fischer (1982) finds that 26% of semi-rural residents’ relatives lived within a 5 minutes’ drive, but only 15% of urbanites’ relatives lived within a 5 minutes’ drive. The General Social Survey reports that 27% of respondents live within a 15-minute drive of their mother, and 12% live 12+ hours away (Smith et al. n.d.). Similar studies of long distance relationships find that 66% of elderly parents live within 30 minutes of an adult child (Lye, 1996; Stafford, 2005). These types of studies are particularly helpful because they distinguish different types of ties.
Survey data and distance decay functions provide rules-of-thumb for where relationship probability declines. Yet, they should be paired with map-based approaches that can communicate a richer portrait of relationships. Our case study aims to dig deeper into personal distributions for individuals by listing individual cities that egos interact with and describing how these dynamics change over time.
In total, 20 volunteers downloaded their friendship networks (totaling 8,549 friends) from
Each volunteer (ego) annotated each of their friends (alters) with an accompanying home location at two time periods (the location from when the ego and alter met, and a current location). Individuals’ ties were mapped in geographic space and analyzed both individually and as a combined group. This study is equipped to respond to our four research questions because it specifically maps individuals’ social ties, calculates distance between ties, associates ties with different cities, and measures tie dispersion over time.
Volunteers were enrolled at The Pennsylvania State University (Penn State) as graduate or undergraduate students at the time of the study. All volunteers resided in State College, PA, a small college town in the northeastern USA, and were recruited to participate in the study through a seminar course project. Participants were required to have a Facebook account with associated “friends” (i.e., alters) and be at least 18 years old. The group ranged in age between 20–40 years old, and included 10 women and 10 men. It was comprised of 17 white and three Asian respondents, and four participants were non-native English speakers. Further information about egos was concealed for privacy reasons.
Within Facebook, each volunteer used the NameGenWeb application (as described in Hogan, 2011) to download their social network as a graph of their friends and friend inter-connections (i.e., if the ego’s friends are friends with one another). NameGenWeb has been used to show that agents in dense networks influence one another’s emotional status (Lin and Qiu, 2012), to find structurally- and semantically-related groups of nodes (Cruz et al., 2013), and to identify social groups and clusters (Brooks et al., 2014).
NameGenWeb created an undirected network of friends, where an ego with
We calculated standard network descriptors, including diameter, density, and average clustering coefficient using the “igraph” library (Csárdi and Nepusz, 2006) in the R statistical computing environment.
Next, volunteers geocoded the locations of each of their friends at two time steps (using recollection and assistance from social media, such as Facebook profiles): the alter’s home location when the relationship began (pre-city) and the alter’s home location at the time of the study (post-city). A few alters met in “cyberspace” or had unknown locations and were removed because, although virtual spaces are geographic (Chen et al., 2013), we required metrics of distance change, i.e., two distinct geographic points, over time. Volunteers assigned longitude and latitude coordinates to alters by geolocating cities in Google Maps, Mapquest or Esri’s global gazetteer.
Each ego and alter were assigned an ID number to preserve anonymity. After anonymizing the data, the geographic coordinates were mapped within the ArcMap GIS software environment. These coordinates were spatially joined to existing shapefiles (i.e., spatial data files) of core-based statistical areas (CBSA) (metropolitan areas) or, if in a rural area, counties. Coordinates outside the USA were assigned to global administrative areas (GADM). As a result, each alter was assigned to a standardized urban center or a US county.
We then calculated the Euclidean distance between the ego’s current location and the locations of each alter’s pre-city and post-city, using the Great Circle method from the R “geosphere” package based on the WGS84 ellipsoid (Hijmans et al., 2017). To examine how social networks spread over time, we found the standard distance (i.e., the standard deviation of the longest linear axis of a point pattern) of each ego’s alters for both time steps. We determined the mean centers, that is, the average center of friend distributions, and reported the extent of the shift in mean centers over time in the ArcMap environment.
The gravity model (as described earlier) is used in this study to predict the places where the egos are likely to have ties. We compared the locations of the alters’ cities to the gravity model for US cities only. First, we found the product of the population of State College and each alter’s locale (using 2000 US census population counts). We next computed the Euclidean distance from the town’s centroid to all other cities’ centroids.
Facebook friendship networks tend to naturally cluster around different areas of a person’s life (Hogan et al., 2007) with significant clustering within universities (Lewis et al., 2008). To find clusters, or modules, within each network, we used the Louvain method for community detection (i.e., modularity calculation) (Blondel et al., 2008) within the Gephi environment (Bastian et al., 2009). To avoid double-counting alters, we removed 44 “overlapping” alters, i.e., nodes that are friends with more than one ego.
For each modularity group, egos identified an institution that the modularity group represents and the year of the group’s inception. We provided a crowdsourced set of labels, including university, secondary school, place-based cultural groups, and non-residential gatherings, such as conferences and vacation travel. Volunteers were invited to label groups using the aforementioned examples but were also able to choose their own labels. A modularity group required three members to be considered, and 90% of egos annotated their groups with a classification. We assume that each unique cluster within a Facebook network represents a specific social context (Brooks et al., 2014).
All 20 egos had a total of 8,549 alters ranging from 123 to 772 per ego (Table 1, Figure 1). The number of edges connecting the nodes ranged from 784 to 12,667. The average degree of friends ranged from 6.9 to 37.7, indicating that some alters had many shared friends in the same network while others did not. Density is defined as the number of edges divided by the total possible edges in a network and is computed as the number of actual edges (e) that exist over the number of possible edges (
Summary statistics and geographic distribution of alters for each ego’s network.
Network characteristics | Geographic characteristics | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Agent | Nodes | Edges | Average degree | Diameter | Density | Clustering coefficient | Modules; modularity | Average distance to friends (post-period) | Standard distance pre-period | Standard distance post-period | Change in standard distance | Change of mean center |
|
376 | 2679 | 12.3 | 10 | 0.038 | 0.36 | 4; 0.61 | 431 | 1468 | 2392 | 924 | 192 |
|
336 | 2533 | 14.5 | 9 | 0.045 | 0.53 | 13; 0.67 | 7827 | 5570 | 5691 | 121 | 667 |
|
601 | 7392 | 23.5 | 11 | 0.041 | 0.49 | 15; 0.62 | 277 | 1592 | 2395 | 803 | 383 |
|
370 | 3686 | 19.4 | 9 | 0.054 | 0.57 | 11; 0.71 | 2567 | 4377 | 3686 | -691 | 597 |
|
573 | 9833 | 33.8 | 10 | 0.06 | 0.47 | 12; 0.51 | 1409 | 2754 | 3727 | 973 | 1873 |
|
232 | 2706 | 22.3 | 5 | 0.10 | 0.53 | 12; 0.37 | 1174 | 2917 | 3903 | 986 | 455 |
|
361 | 2989 | 14.8 | 12 | 0.04 | 0.54 | 12; 0.71 | 215 | 887 | 2283 | 1396 | 368 |
|
437 | 6002 | 27.2 | 8 | 0.06 | 0.46 | 10; 0.51 | 1185 | 627 | 1267 | 640 | 188 |
|
403 | 5022 | 24.7 | 8 | 0.062 | 0.43 | 9; 0.52 | 687 | 3087 | 2383 | –704 | 518 |
|
437 | 2763 | 11.5 | 10 | 0.029 | 0.52 | 13; 0.78 | 1619 | 3726 | 4157 | 431 | 409 |
|
639 | 9784 | 30.5 | 7 | 0.048 | 0.33 | 10; 0.46 | 712 | 2534 | 2893 | 359 | 575 |
|
188 | 1002 | 10.4 | 10 | 0.057 | 0.64 | 6; 0.74 | 697 | 3047 | 2605 | –442 | 129 |
|
201 | 784 | 6.9 | 10 | 0.039 | 0.16 | 16; 0.46 | 8522 | 5727 | 5610 | –117 | 325 |
|
772 | 7143 | 18.2 | 11 | 0.024 | 0.44 | 16; 0.76 | 529 | 3274 | 3357 | 83 | 292 |
|
123 | 1028 | 15.8 | 9 | 0.137 | 0.74 | 9; 0.40 | 6701 | 3588 | 5564 | 1976 | 448 |
|
659 | 11,708 | 34.9 | 8 | 0.054 | 0.50 | 14; 0.52 | 1508 | 3726 | 3552 | –174 | 1150 |
|
727 | 12,667 | 34.5 | 9 | 0.048 | 0.53 | 10; 0.69 | 492 | 1792 | 2951 | 1159 | 520 |
|
227 | 4412 | 37.7 | 6 | 0.172 | 0.55 | 10; 0.23 | 176 | 49 | 1658 | 1609 | 196 |
|
408 | 7639 | 37.0 | 7 | 0.092 | 0.52 | 9; 0.52 | 626 | 1693 | 2929 | 1236 | 441 |
|
566 | 4157 | 14.3 | 14 | 0.026 | 0.53 | 15; 0.75 | 1148 | 1156 | 2167 | 1011 | 560 |
Note: All distances measured in kilometers.
Egos provided a total of 1207 recorded locations: 430 individual places logged in the pre-period and 777 in the post-period. Alters spread over time, producing more places for egos to visit, telecommunicate with, think about, and in which to claim they have social capital (i.e., the benefits that relationships can yield). Accounting for duplicates across the two time periods, there were 942 unique locations (and 265 repeats), which represent the collective places where egos are likely to interact at a personal level. At the time of the study, three egos could see friends at, on average, less than 300 km distance (
Friends have generally dispersed over time (Table 1), at distributions ranging from 83 km (
Along with the change in standard distribution, the mean center of each ego’s friends (i.e., the center point of the distribution) has shifted as well. The mean center of each ego’s set of alters changed as much as 1,873 km (
Due to privacy concerns associated with our small sample size and requests from participants, we did not disclose the distinct countries where international friends are located. Indeed, volunteers were initially hesitant to provide international locational information. For example, a participant with many friends (~100) in a single foreign country was reluctant to publish the name of this country, although many countries had more than 100 alters (often furnished by one ego). When discussing privacy, we sometimes tend to insinuate personal privacy, yet the participants’ primary consideration was their friends’ privacy, especially in non-urban locations. Egos were also uncomfortable listing how many alters they had in a single city if data from no other alter was furnished. We gathered this information on privacy preferences by asking volunteers about their preferences during the data collection process.
Internationally, alters lived in 64 countries (including the USA) in the pre-period and 90 countries in the post-period. 7472 (87%) friends lived in the US during the pre-period, and there was a net gain of 145 alters abroad over time. Excluding the US, five countries had at least 50 alters in the pre-period and eight countries in the post-period, meaning certain countries attracted many individuals. Similarly, ten countries had between 10 and 49 alters in the pre-period and 12 had 10–49 alters in the post-period. None of the top five countries for alters have English as a primary language, but in the following ten, four have English as a primary language. Time yielded fewer South American and Asian-residing alters, and more alters in Australia, North America and Europe.
In the US, participants met alters in 248 cities or counties during the pre-period, and had alters in 426 cities or counties during the post-period. Locations were mostly concentrated in the Eastern Atlantic United States (Table 2). A number of egos had alters in the US outside CBSAs, living in rural areas in Iowa, New York, Georgia and Missouri.
Top 20 most popular domestic alter cities and the number of egos with alters in these locations in the pre-period and post-period.
Pre-city | Alters | Egos | Post-city | Alters | Egos |
---|---|---|---|---|---|
State College, PA | 1384 | 17 | State College, PA | 797 | 18 |
Washington–Arlington–Alexandria, DC–VA–MD–WV | 803 | 12 | Washington–Arlington–Alexandria, DC–VA–MD–WV | 721 | 18 |
Seattle–Tacoma–Bellevue, WA | 310 | 9 | New York–Northern New Jersey–Long Island, NY–NJ–PA | 346 | 17 |
Poughkeepsie–Newburgh–Middletown, NY | 306 | 3 | Boston–Cambridge–Quincy, MA–NH | 276 | 17 |
Chicago–Naperville–Joliet, IL–IN–WI | 288 | 10 | Charlotte–Gastonia–Concord, NC–SC | 239 | 8 |
Boston–Cambridge–Quincy, MA–NH | 245 | 13 | Seattle–Tacoma–Bellevue, WA | 231 | 17 |
Knoxville, TN | 222 | 5 | Chicago–Naperville–Joliet, IL–IN–WI | 224 | 16 |
Cedar Rapids, IA | 205 | 2 | Philadelphia–Camden–Wilmington, PA–NJ–DE–MD | 207 | 15 |
Charlotte–Gastonia–Concord, NC–SC | 199 | 4 | Atlanta–Sandy Springs–Marietta, GA | 193 | 12 |
Atlanta–Sandy Springs–Marietta, GA | 196 | 7 | San Francisco–Oakland–Fremont, CA | 183 | 14 |
San Francisco–Oakland–Fremont, CA | 186 | 8 | Los Angeles–Long Beach–Santa Ana, CA | 133 | 17 |
Williamsport, PA | 180 | 3 | Dallas–Fort Worth–Arlington, TX | 132 | 15 |
Austin–Round Rock, TX | 178 | 5 | Albany–Schenectady–Troy, NY | 129 | 6 |
Philadelphia–Camden–Wilmington, PA–NJ–DE–MD | 173 | 9 | Knoxville, TN | 126 | 5 |
Dallas–Fort Worth–Arlington, TX | 159 | 6 | Austin–Round Rock, TX | 120 | 15 |
Appleton, WI | 125 | 2 | Williamsport, PA | 107 | 4 |
Riverside–San Bernardino–Ontario, CA | 104 | 5 | Cedar Rapids, IA | 92 | 2 |
New York–Northern New Jersey–Long Island, NY–NJ–PA | 63 | 16 | Pittsburgh, PA | 91 | 15 |
Harrisburg–Carlisle, PA | 61 | 3 | Augusta–Richmond County, GA–SC | 82 | 5 |
San Diego–Carlsbad–San Marcos, CA | 45 | 7 | Riverside–San Bernardino–Ontario, CA | 76 | 7 |
Note: Six pre-cities were removed for privacy because only one ego supplied alters from the city.
A “heat map” result shows the loss and gain of alters between the two time periods for the collective group (Figure 4). This figure does not show the raw frequency of alters, but is normalized based on a quadratic kernel function (Silverman, 2018) (akin to a probability density function) that effectively smooths the data within a spatial radius. Alters have moved from Pennsylvania, Washington, DC and the Midwest and now inhabit geographic pockets, notably in the East Coast, North Carolina, Denver, CO, and West Coast cities (Figure 4). This finding is in line with research using 2000 census data that heralds large metropolitan areas, and the USA South and West, as key places for young, college-educated adults (Franklin, 2003). In addition, the places that draw alters, such as Los Angeles, Seattle, San Francisco, and North Carolina’s Research Triangle are information technology hubs (He and Fallah, 2011) with advanced research and job opportunities in the tech sector, a field of expertise for a number of the egos. Incidentally, the seminar course that anchored this study was in the field of information science.
A general gravity model from State College predicts high interaction with following cities (in order): New York, NY, Philadelphia, PA, Washington DC, Harrisburg, PA, Pittsburgh, PA, Baltimore, MD, Lewistown, PA, Altoona, PA, and DuBois, PA. Of the 50 top cities for predicted interaction, 10 of the 20 pre- and post-cities are represented (Table 2).
In general, results followed the gravity model to a small extent. This value correlated with actual interaction at rates of 0.45 (
The differences between actual and predicted ranks of each city further support the proclivity toward large cities. Washington, DC’s pre-city and post-city rank is 2 and gravity rank is 3 out of all cities in the network. Therefore, Washington, DC could be described by the absolute value difference (
In total, 143 modularity groups were identified. We found that, generally, social groups spread over time (Table 3). Groups associated with stronger ties to place and long-term membership (i.e., family, cultural groups) spread less than those associated with transitory institutions (such as universities, co-workers and secondary education). Educational groups, including university and high school, and non-residential gatherings, such as conferences and summer camps, spread over 1500 km over time. These also tend to be the “youngest” groups. Although they had more time to spread, older groups (those with longer average time since formation) spread the least since their inception. This finding matches social network research discoveries that some groups are more stable than others, particularly that family ties endure over extended friendship networks (Ertel et al., 2009). Recreational groups, including place-based cultural groups, did not incur significant spreading, and a review of social group longevity revealed that outdoor, cultural, or sports groups tend to dissolve time (especially for men), while religious groups tend to persist over time (Hyyppä et al., 2008).
Table of group types and changes in standard distances.
Group type | Modules | Avg. standard distance before; after (km) | Difference in standard distance (km) | Avg. years since inception |
---|---|---|---|---|
University | 36 | 2298; 3810 | 1512 | 8.5 |
Professional | 21 | 2468; 3927 | 1460 | 6.6 |
Secondary education | 20 | 527; 2441 | 1914 | 14.2 |
Place-based cultural group | 16 | 1437; 2224 | 788 | 10.7 |
Family | 15 | 1678; 2459 | 781 | 22.2 |
Non-residential gathering | 14 | 2898; 4865 | 1967 | 6.5 |
Non-place-based cultural group | 8 | 2463; 4267 | 1805 | 9.5 |
Other | 3 | 4429; 5410 | 981 | 5.5 |
Neighbors were largely left out of these distinctions – very few egos labeled their modules as “neighbors”. This may be due to a lack of traditional “neighbors” being on Facebook, or that college neighbors (e.g., dormmates) are commonly regarded as friends. Nevertheless, neighbor groups do not tend to persist through time for highly mobile individuals. As Wellman et al. (1997) find, over a decade, 20% of neighbors maintained ties after moving.
Within the framework of relational geography, we conducted a social network mapping case study. In response to our research questions, we found that egos had five to 50+ locations where they have met alters and the average tie distance for individual egos ranged from 83 to 1,976 km. These findings resemble Bailey et al.’s (2018) map of average tie distance in that some egos’ distance corresponds to a typical person in Penn State’s Appalachian region (where most friends live less than 100 km away), while other have much higher distances (typical of large cities and West coast cities). In terms of the individual cities that attracted alters, this geographic shadow points to alters also being young and educated, despite opportunities for Facebook friends to include more than these student volunteers’ peers (e.g., aunts and uncles).
Over time, contacts dispersed geographically. On average, egos had alters in 21 individual cities, counties and countries (depending on how they reported locales) in the pre-period and 38 in the post-period. Egos’ alters lived in a combined 64 countries in the pre-period and in 90 countries in the post-period – 145 alters moved abroad. Given the demographics of the egos, the mechanisms behind the spreading of alters are assumed to be job prospects and opportunities, or returning to a home country, rather than forced migration (political, economic, or otherwise). The notion of growing distances over time aligns with prior research, namely, findings that those who move farther maintain more dispersed networks, and those who move locally keep proximate social networks (Magdol, 2000). A number of egos in this study moved to State College for graduate school, some from international locations.
In the pre-period, about 45% of friend distribution locations were predicted by the gravity model, and this value rose to 56% in the post-period. This group had a distinct lack of social ties around State College, in nearby Pennsylvania cities Lewisburg, Altoona and DuBois, which may be explained by egos’ preference for large cities. The locations of potential interaction may be perpetuated by the network of universities and other institutions as opposed to nearness. We find that the predictive power of the gravity model is slightly less than previous research on the gravity model’s ability to predict inter-city US migration, which has been measured at about 60% (Andris et al., 2011). The increase in predictive power over time is expected, and is attributed to two factors: friends’ migration to larger cities, and that in the pre-period, the egos may not have lived in State College – providing a false center from which to measure interaction distance.
Groups of contacts spread at different rates, but each type of group experienced geographical dispersion. The spreading of education-based groups can be explained by migration research that cites higher-education cohorts’ above-average movement rates where the young and educated produce the highest rates of migration in the USA (Frey, 2005). It also reflects findings that that living far from home, a history of residential movement, and high education levels equate to geographically distributed networks (Viry, 2012).
Accordingly, education level may be the most distinguishing predictor variable of this cohort. As such, this cohort may want to pay special attention to university and professional contacts, and their respective locations in the future, as individuals tend to stay in touch with higher-education ties, especially those that provide career support (Feld et al., 2007), and more education leads to a greater capacity to maintain distant ties (Viry, 2012). In one study, between 25–33% of classmates, co-workers and friends persisted across a decade, and educational attainment predicted the persistence of school/professional ties (Suitor and Keeton, 1997). Collectively, these findings suggest that the groups created from professional and university institutions may play a substantial supporting role for the egos in the future – and, thus, their locations matter.
The research questions in this work were largely conceptual, and the study has a set of limitations. Our sample size was relatively small. This is a highly specialized group of 20 egos that only serves as a proof-of-concept for mapping the social ties for this subset of the population. Recruiting volunteers was difficult due to the newness of geolocated social networks and the lack of previously successful studies to which we could refer potential contributors. As a result, we limit the generalizability of these particular results to the wider population, since our sample is a group of young, mobile university students, some of whom are not native to the region. In the future, examining diverse networks of the elderly, those with different levels of educational achievement, or migrant populations may yield different patterns of spreading behavior – perhaps flows to retirement communities, less spreading in general, or movement within and between ethnic enclaves, respectively.
There were also challenges associated with data creation. With each ego geolocating upwards of 500 highly mobile contacts, there was likely to be human error in tagging locations and relying on Facebook profiles to report the alter’s most current location. In a few cases, egos could not recall where they met or how they knew an alter, and thus, the alter was removed from the data set. However, since egos filtered their own data and validated their modularity groups with their raison d’etre (in the form of “club”, “family”, etc.), digital footprint data quality issues (Lewis, 2015) such as lack of context and over-inclusive sample, were partially remedied. Regarding analysis, although we used modularity to detect social groups, a social group is not always synonymous with a module. Modularity algorithms may omit social group members or may capture individuals who were not in the social group but were grouped together mathematically due to mutual ties with the group’s members.
New research studies could be developed based on the social-spatial perspective proposed here. Future studies could also use data from Facebook or other social networks to create a larger and more diverse set of egos. With a larger data set, we can more reliably measure spatial network regularities and create benchmarks for cross-study comparisons. These measures should include how proximal an individual’s social support is and the variety of places in which an individual has social support, in order to determine whether there is a common spatial rhythm to the spatial distribution of human relationships.
An ideal study would map alters at multiple time steps, perhaps in real time, with technologies that resemble the Snap Maps described above. Egos should be interviewed or surveyed about how access to their own dynamic, visual address book of contacts affects decisions to travel, telecommunicate, maintain friendships, and perceive places, as one may develop different migration or travel plans after reexamining their tie locations. If the social tie has a hidden byproduct of helping the ego explore new places, it may be that a link between two people also indicates a link between two places, and we can explore the validity of this theory under various definitions and conditions. This exercise would also help characterize social networks. For example, although an ego may have a strong, vibrant social network, this network may be limited in terms of the places he or she can be exposed to through these ties. On the other hand, an ego may have a relatively sparse social network, but can access a variety of attractive places through these ties.
Despite the study’s drawbacks, this case study yielded new insights about how ties are distributed and how time affects social network geography. We hope this exploratory research can lead to future studies at larger scales to create new findings about these personalized distributions.