If I have a set of data point and I want to approximate the distribution of that data set. What methods can be employed to fit the data set with the best most distribution. Whether it be gamma, normal, log normal, exponential, etc. I am trying to find the best distribution and the parameters that optimizes the best fit. What methods are out there to do so?
Here is the data, I am trying to approximate. With a distribution. I generated the data by running a 3,466 binary simulations (1,0) and summing the number of 1's in each simulation. According to probability theory, the sum of the outcomes of a Bernoulli distribution is a binomial. But for the sake of being ignorant, if I didn't know this was binomial, how could I build a function that approximates the data. My end goal is to build an excel function that draws on the inverse of the density function and spits out a random number from the distribution.
x #occurance P(x)
1636 1 0%
1646 2 0%
1656 2 0%
1666 6 1%
1676 13 2%
1686 20 2%
1696 44 5%
1706 61 7%
1716 79 10%
1726 115 14%
1736 120 14%
1746 97 12%
1756 88 11%
1766 81 10%
1776 48 6%
1786 31 4%
1796 13 2%
1806 7 1%
1816 3 0%
1826 0 0%