Student Reader


In statistics, we try to draw inferences about a population based on a sample, a technique called statistical inference.

The inference is valid if the sample is random, and the bigger the sample the more reliable the inference. Any measure computes on a population is called a population parameter or simply a parameter, such as population size. Any descriptive measure based on a sample is called a sample statistic or simply a statistic. Subjects are the entities whose characteristics are being studied in a particular experiment.

Collection of data elements equals data set. May be of population or sample.

Whether it is one or the other must be conveyed by the data analyst as otherwise it cannot be determined. Variability is the characteristic of dispersion. There are several measures of variability -- population range, either like two to eighteen or sixteen'' population variance, the sigma squared, population variance is based on deviations from the mean. Data elements aka data points are representations of a particular characteristic or variable such as age in Yeats or weight in kilograms. Data elements are measured as numbers of subjects or units of measurement.

Term Overview
Population Size
The number of subjects in a population.
Sample Size
The number of subjects in a sample.
The capital sigma Σ denotes summation.
X Place holder for the measurement under consideration. X1 where x is the variable name and the subscript refers to a particular data element.
Dispersion Dispersion can be measured with the mean absolute deviations (from the mean), but this is less popular than the sample standard deviation. Another method for measuring dispersion in a data set is the range.
Variable Variables can be defined as continuous or discrete; and a discrete variable is either ordinal or nominal (categorical).
Variable Overview
Continuous Any value between the minimum and maximum on a particular measurement scale. Variables like age height weight. The standard data summary consists of the sample size (n), sample mean (x) and sample standard deviation (s); and commonly also the median and quartiles.
Discrete Limited number of values or categories.
Ordinal Limited number of values or categories, but they are ordered... such as symptoms of pain being good fair or bad, of of performance, etc.
Categorical Limited number of values or categories and not ordered, such as gender or voting preference.

Numerical Methods: Distribution


n biostatistics, statisticians apply their techniques to health-related fields. As in other fields, statistics helps health professional formulate research strategies, organize research data, understand the data, draw conclusions and ascertain how confident they may be in these conclusions. This makes biostatistics an integral part of health care from research and development, to making decisions about implementation based on risk knowledge that has been drawn from inferences.