- Probability Definition: Events, Sample Points and Sequencing Events Techniques
- How to calculate probability: Combinations, Permutations, Cardinality
- Random Variables: Discrete and Continuous (pmf, CDF functions)
- [TODO Expected Value, Variance, Standard Deviation, Quartiles][#]
- Probability Distributions: Binomial
- TODO Probability Distributions: Geometric
- Probability Distributions: Hypergeometric
- Probability Distributions: Negative Binomial
- Probability Distributions: Poisson
Probability is the likelihood that an event will occur.
Events The probability of an event
E
is the cardinality of the event|E|
divided by the cardinality of the sample space|S|
(the "universe",S
,) that the event is in.
The Wackerly probability book is great, and describes the sample-point method for calculating probability.
Another technique, after sample point technique, is sequenced events.
Combinations: Order Doesn't Matter
Examples: Out of the set S = {A, B, C}
, a combination set would include AAA
, AAB
, ABC
, .... etc, and ABA = BAA
because order doesn't matter. When order doesn't matter, you don't need to count as many things, e.g. if AAB
is equivalent to ABA
, then those items count as one element of the set, not two.
Permutations: Order Matters
Note that the denominator is smaller than in combinations. Permuations possibilities are much larger because order matters, so we have to count it all.
Examples: Out of the set S= {A, B, C}
, a combination set would include AAA
, AAB
, ABC
, .... etc, and ABA != BAA.
Cardinality is the number of elements in a Set.
TODO - in progress, as this coursework does NOT cover this material to satisfaction.
pmf: Probability "mass" function
A pmf measures the scalar value of a discrete variable, and a PDF (probability density function) measures the probability that a continuous random variable will have a certain range.
Note in R, the "density function," invoked via dhyper(y, r, N-r, n)
, this function measures a discrete random variable's scalar value, such as our hypergeometric example in R; there's a bit of oddness here, since we've used this function for discrete random variables.
Also in R, the "probability distribution function" is invoked via phyper(4, r, N-r, n)
.
CDF
TODO
Expected Value: TODO
Variance: TODO
Standard Deviation: TODO
Quartiles: TODO
To be continued when there is more time :) Essentially, repeated uniform experiments of a series of failures and successes, for example
Distribution:
Using the binomial probability distribution formula, we know that for
the pmf represented by:
for
Mean, Variance (TODO):
TODO - wrt Wackerly
Distribution (TODO)
Mean, Variance (TODO)
Distribution:
For random sampling of sample size
The denominator: counting the number of ways to select a subset of
Then for the numerator, we think of
Mean, Variance (TODO):
Either counting the number of failures, or counting the $r$th trial where the first success occurs.
Distribution (TODO):
Mean, Variance (TODO):
The Poisson probability distribution, used for rare events over a period of time, is also used to approximate the binomial distribution since the binomial distribution converges to the Poisson distribution. The Poisson distribution can approximate the binomial distribution in use cases for: large
The Poisson distribution's probability function is