4.1 Statistics

Statistics

Variable

Mean

Median

Mode

Interquartile range

Descriptive

Variance

Standard deviation

Correlation

Data

Cumulative frequency

Line of best fit

Regression

Modeling is a process of creating a mathematical representation of certain real-world scenarios. This falls under the statistics field, which requires some terminology in prior.

Terminology	Definition
Population	Entire collection of individuals about which we want to draw conclusions
Census	Information collected from the whole population
Sample	Subset of population which could be chosen at random to avoid bias
Survey	Collection of information from a sample
Data	Information about individuals in a population
Categorical variable	Describes a particular quality or characteristics which can be divided into different categories
Numerical variable	Describes a characteristic which has a numerical value that can be counted or measured. There are two types: discrete and continuous.
Discrete variable	Takes exact values and is often a result of counting
Continuous variable	Takes numerical values within a certain range, and is often a result of measuring
Random variable	A variable that is subject to random variations so that it can take on multiple different values, each with an associated probability. Often takes $X$ , $Y$ etc.
Parameter	Numerical quantity that measures some aspect of the population
Statistics	Quantity calculated from data gathered from a sample
Mean	The average of a data set: $µ=\frac{\sum f}{n}$
Median	The middle value in a sorted list of numbers. Given $n$ items, $\tilde{x} = \frac {n+1}2$ th item
Mode	Most frequently occurring data
Interquartile Range IQR	The difference between the upper quartile (75 percentile) and the lower quartile (25 percentile).
Variance $\sigma ^2$	The squared deviation from the mean of a random variable. $\sigma ^2= \frac {\sum _{i=1}^{n}(x_i-µ)^2}{n}$
Standard Deviation $\sigma$	Statistical measurement that analyzes the distance of the data from the mean. This is a square root of the variance. $\sigma =\sqrt{\frac{ \sum _{i=1}^{n}(x_i-µ)^2}{n}}$