Also known as MLE.

Given some observations and a known distribution (Bernoulli, Poisson etc. )we would like to estimate the distribution’s parameters from the observed data. We denote the estimated parameter as .

We can use the Likelihood function denoted as .

Let be i.i.d. random variables with probability mass function or density , where .

The log-likelihood is

In practice, would the the pdf of the distribution with parameters.

We use the log likelihood since it makes differentiating easier. To find the Maximum Likelihood Estimator for some distribution and observed value, we can take the first derivative of and set this function to .

This is finding the global maximum of the likelihood function with respect the parameters.