NSA collects metadata for a large fraction of the cell phone calls placed in the United States. Upon inspection, analysts are allowed to make three hops from a targeted calling account. In other words, they're allowed to inspect the metadata for every contact that the targeted person has made, for every contact that those contacts have made, for every contact that those contacts have made, and finally, for all the contacts that those contacts have called, i.e. three "hops". One person in a community of 10,000 is targeted for having called a grandparent in Yemen.
In addition, there are two toll free hotlines, one for emergency assistance that is called by 214 unique community members, and another for information about various community events that is called by 673 unique community members. The full calling record for the community is listed here.
You are a journalist who has been given the calling record by a whistleblower inside NSA. You know that the targeting has been carried out but you don't know the phone number of the targeted individual. To get a sense of how many people fall under the dragnet when this one person is targeted, find the number of 2-hop connections for the following phone numbers, and report the average:
1 , 17, 793, 1200, 3402
- Each line representing a call between two numbers. The two integers are the "phone numbers" of the two individuals involved in the call.
caller1 caller 2
- The number of unique contacts for each caller has been generated according to the best fit to a real distribution published by Sprint corporation.