(It is assumed that the reader has read the third article of this series)
As promised in the last article, in this article we will derive the exact form of , the degree distribution, for ER network. To derive the exact form of , observe that if a certain node is connected to given other nodes, then probability of this event is where the linking probability as defined in the first article and is the total number of nodes in the network. Let me explain this in some detail. Since the probability that our chosen node is connected to one other node is , the probability that it is not connected to that node is . Out of total nodes which this node can connect to, it is connected to nodes so this gives us factor . This also means that our node is not connected to nodes which gives us factor of . But this calculation we did for 'given' number of nodes. But observe that we can actually choose these nodes from remaining nodes in ways. This gives us:
Just to simplify this expression, let us assume that we there were number of nodes to start with instead of . Then we get:
Does this expression resemble something? Consider expansion of and you will realize that is nothing but a binomial distribution!. Now we want to see what happens if we increase the size of the network. As we increase the number of nodes, in general, we don't expect that average degree of the nodes would increase. For example, if the number of people in the city increases by , we don't expect that average number of friends of person living in that city would double. Thus we arrive at an important conclusion that instead of , we should really keep , the average degree of the network, a parameter to describe ER graph. Hence to study larger and larger ER networks, we first keep average degree of the network constant and then see what happens as the network grows. As was mentioned in previous articles, for ER network, and hence while simulating larger and larger ER networks, we must decrease appropriately so as to keep constant.
Now let us rewrite using as the parameter instead of . After some algebraic manipulations,in the limit of large , we get,
This is called a Poisson distribution. You can see the whole derivation here. Thus we see that ER network has Poisson distribution for degrees and this fact is going to be very important for our future journey.
In the next article we will try to look deep into the structure of social networks and we will discover an extremely important property of them which will ultimately be related to ER network. So be ready! :-)