Inferring social ties from geographic coincidences
Top Cited Papers
Open Access
- 8 December 2010
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 107 (52), 22436-22441
- https://doi.org/10.1073/pnas.1006155107
Abstract
We investigate the extent to which social ties between people can be inferred from co-occurrence in time and space: Given that two people have been in approximately the same geographic locale at approximately the same time, on multiple occasions, how likely are they to know each other? Furthermore, how does this likelihood depend on the spatial and temporal proximity of the co-occurrences? Such issues arise in data originating in both online and offline domains as well as settings that capture interfaces between online and offline behavior. Here we develop a framework for quantifying the answers to such questions, and we apply this framework to publicly available data from a social media site, finding that even a very small number of co-occurrences can result in a high empirical likelihood of a social tie. We then present probabilistic models showing how such large probabilities can arise from a natural model of proximity and co-occurrence in the presence of social ties. In addition to providing a method for establishing some of the first quantifiable estimates of these measures, our findings have potential privacy implications, particularly for the ways in which social structures can be inferred from public online records that capture individuals’ physical locations over time.Keywords
This publication has 17 references indexed in Scilit:
- Inferring friendship network structure by using mobile phone dataProceedings of the National Academy of Sciences, 2009
- Predicting Social Security numbers from public dataProceedings of the National Academy of Sciences, 2009
- Understanding individual human mobility patternsNature, 2008
- The scaling laws of human travelNature, 2006
- HEALPix: A Framework for High‐Resolution Discretization and Fast Analysis of Data Distributed on the SphereThe Astrophysical Journal, 2005
- k-ANONYMITY: A MODEL FOR PROTECTING PRIVACYInternational Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 2002
- Fast approximate energy minimization via graph cutsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2001
- Methods for Studying CoincidencesJournal of the American Statistical Association, 1989
- The Experience of Living in CitiesScience, 1970
- Guilt by Association: Three Words in Search of a MeaningThe University of Chicago Law Review, 1949