Main Page

Introduction to Inferential Statistics

What is Inferential Statistics?

Inferential statistics includes the hypothes tests, estimating and predicting about data. We use the samples to generalize subjects about a population the most basic form. In this section we will see about distrubitions as a tool, and how distrubitions made and what types of distrubitions they are.

Distrubitions

Binomial Distrubition

Before diving deep to Binomial distrubitions we should update our knowledge about Binomial Coefficient calculations and probability.

Binomial Distrubitions is used when there exactly two outcomes.Like flipping a coin or a students exam grade has passed or not. We know that in there are some calculations of them in probability.But how does that relation with a "distrubition"? Let's see together.

Let's say we are flipping a coin 7 times.What is the probability of there is no Tails?

We will say heads to: H and tails to: T. And number of T after 7 flips is x.

We are looking for P(x=0)=? right?

Than it is P(HHHHHHH)=?.

What is the probability of there is one tails?

Now in the above equation we don't need any coefficient because we calculated there is no tails.The thing is we don't care about the order of the tails.It can be at the first try or the last try.So it is called combination. Binomial coefficient is the other name of combination and this is the formula.

So if we calculate all the tail probabilities for 7 flips this will be it:

Now these are our probabilites for each 7 flips. Let's show them in a plot and make a distrubition out of this shall we?

Poisson Distrubition

Poisson distrubition is used to show the probability of a given event over a specific period.

Here is the formula:

e is the eulers number=2.71828

λ is the expected value

x is the value we want to see the probability.

P(X) is the probability of x.

So let's say there are 12 fish on average gets caught by a boat in 2 hours.What is the probability of 9 fish caught by that person over next 2 hours?

Gaussian Normal Distrubition

The other two distrubitions we saw were discrete distrubitions.Gaussian Distrubition is arguably one of the most important distrubitions.This is a continuous distrubition. It is one of the best fit for natural phenomenas so it is used commonly in statistics. Gaussian Distrubition is mostly described as a bell curve.And it is symmetrical from center.In the center we have the mean value and you can go to each side 3 standard deviation. So at the right wing the value is Mean+(3xStandard deviation).At the left Mean-(3xStandard deviation).We will se how to use these. Here is the formula:

Here is the distrubition:

What is Z-Table?

Let's say we have a sample of peoples height as a Gaussian Distrubition.Our samples mean is 170cm and standar deviation is 5 cm. If we want to know the percentage of people that 164 < x < 168. How we can calculate it out of Gaussian Distrubition? Answer is simple, we should calcute the area under the bell curve in 164< x < 168 values. Z-Table comes to help in here. Z-Table helps us standardize this area calculations. Here is Z-Table and its formula:

Now let's calculate Z score for this values. Mean=170,Std.dev.=5.We want to know the probability of 164 < x < 168.

If we look for Z=-0.4 and Z=-1.2 values from Z-Table we get, Z(-0.4)=.34458 and Z(-1.2)=.11507

So if we substract the values,the possibility of the magenta colored area(164 < x < 168) is .22951 as percentage the probability is %22.9

My Twitter Account My Github Account My LinkedIn Account