The Gaussian distribution also known as the normal distribution is a common continuous distribution.
Univariate gaussian distribution
The probability density function is given as:
p(x∣μ,σ2)=N(x∣μ,σ2)=2πσ21exp(2σ2−(x−μ)2)
The parameters estimations are:
μ=N1n=1∑Nxn
σ2=N1n=1∑N(xn−μ)2
where the mean (μ) is the location, and the variance (σ2) is the dispersion. The xn denotes the feature value of nth sample, and N is the number of samples in total.
The maximum likelihood can be calculated using optimisation. First assume independence of training samples, and apply natural logs.
The Pearson’s Correlation Coefficient is a measure of the linear correlation between two variables X and Y.
p(xi,xj)=pi,j=σi,iσj,jσi,j
The correlation coefficient p(xi,xj) is obtained by normalising the covariance σi,j by the square root of the
product of the variances σi,i and σj,j, and satisfies −1≤σi,j≤1.
Bayes’ theorem
We can use Bayes’ theorem for continuous data x and discrete class k as: