## Research concepts: Confidence interval of a proportion

Data which exist in categories that only have 2 possible values are known as binary data. “Yes” or “No” survey responses, dead or alive, male or female, etc. are examples of binary values. These data can be expressed as proportions (e.g. the proportion of male students in a class), are known as binomial variables, and follow a binomial distribution. The

## Statistics you are interested in: simple linear regression – part 3

In the first and second posts of this series, we performed simple linear regression of a continuous outcome on a single continuous predictor, but we also learned it is possible to include binary or categorical predictors in such regression models. How is this be done? The hsb2.csv dataset we have been using also contains the variable female where male participants

## Statistics you are interested in: simple linear regression – part 2

In the previous post, we performed simple linear regression of science scores on reading scores from 200 students using ordinary least squares (OLS) estimation. This was done using Python’s Statsmodels package. What does the OLS output show and how should it be interpreted? Here is the figure of the individual subject data and the line of best fit, as well