Learn fundamental concepts in data analysis and statistical inference, focusing on one and two independent samples.

Loading...

From the course by Johns Hopkins University

Mathematical Biostatistics Boot Camp 2

48 ratings

Learn fundamental concepts in data analysis and statistical inference, focusing on one and two independent samples.

From the lesson

Hypothesis Testing

In this module, you'll get an introduction to hypothesis testing, a core concept in statistics. We'll cover hypothesis testing for basic one and two group settings as well as power. After you've watched the videos and tried the homework, take a stab at the quiz.

- Brian Caffo, PhDProfessor, Biostatistics

Bloomberg School of Public Health

Hi, my name is Brian Caffo, and this is mathematical biostatistics boot

camp lecture three, where we're going to talk about two sample tasks.

Okay, so today were going to first talk about matched

data, then were going to talk And matched data brings up

a kind of an interesting immediate discussion on the topic

of regression to the mean, which has a very famous history.

And then we're going to talk about two independent groups.

So, when you're comparing two groups, one of the first things you

want to ascertain is whether or not the data are paired or not.

And so it's, it's a little less.

A little less clear than you might think at

first so, so the most obvious instance where data

are paired, is when the observations are both on

the same subject so take as an example a

trial where you have a treatment, say some medication.

in one case imagine if you randomize that treatment to one group of

treatment and randomize say a placebo to the other, to another group of people.

And that's clearly not paired.

You have one group of people that received the treatment

and another group that received the other treat, the placebo.

Instance where its obviously paired would be, imagine if you, in some random order

for every person, gave them the treatment and had maybe a washout period and

then gave them the placebo where for some people you gave them the placebo first.

And then had a wash out period and then gave them the treatment.

In that case its obviously paired because you had

the same person recieving both the treatment and the control.

So, in both those cases the question would be to compare the

treatment versus the placebo but in one case you would have two

independent groups of separate people and in the other case you would have.

each person receiving both, so in the, in the, in the latter case the

would be paired and in some cases pairing is maybe a little less obvious, so

for example what might happen is you're

not looking at something that's assigned and

let's say you want to compare, let's say, people with high blood pressure to people

with low blood pressure.

[SOUND]

And, You, you wanted to compare, some other characteristic.

let's say, there, We,

[INAUDIBLE]

let's see. We used sleep as an example the other day.

So imagine if you wanted to, to compare

the low blood pressure people to the high blood.

[NOISE]

Pressure people and compare their RDI, or their respiratory disturbance index.

well that seems like it would be unpaired

because you would have the low blood pressure people

would be one group of people and the high

blood pressure people would be another group of people.

but they could be paired because often what

people do in these kind of experiments is they

get say their low blood pressure people, and

then for every person in that, in that group,

They then try to match in terms of age and

gender and other characteristics, try to closely match for every person.

Another person in the high blood pressure group.

In that case it would be sort of softly-paired in that,

that they would've, wouldn't be the exact same person of course 'cause

that person that, that, they won't give an instance, can't have

low blood pressure and high blood pressure True at the same time

but what you could have is where a, a

person one from the low blood person group is very

closely matched in terms of age and other characteristics to

and weight or whatever to the high blood pressure person.

And then it's, it's they are paired, they're just paired in a different way.

so that that's called matching.

When you, when you try to take for every person in one group

find another person in another group that has very similar

characteristics for all these other variables that you're maybe not

interested in but you think could contaminate the comparison, that

process is called matching and its a very effective way.

To to deal with a variable as I mentioned

variables you think might confound the relationship that you want one to look at.

So in any case the, the, the discussion in this slide for today is

to simply you know, to distinguish between

two different types of two group comparisons.

The two main types of two group comparisons.

One incidence is where observations are

paired, either by matching or something else.

OK, so lets consider the instance where the observations are paired and this is

the easy case because then everything reduces

down to basically a one sample problem.

So, when the observations are paired, one strategy is

to take the difference between the paired observations and

do a one-sample t-test of the null hypothesis that

the population mean difference is zero versus the alternative,

that the pop, population mean difference is non-zero

or, or one of the other two alternatives.

and then that would be a, very straightforward thing to do.

you're test statistic would just be the ordinary

one sample test statistic, the average of the distances.

the, minus the hypothesized mean, you, typically 0.

divided by the standard deviation of the differences.

Divided by the square root of the number of pairs of observations.

So, so you want to make sure that your calc, you put in the number of pairs of

observations in there, not the number of total number of observations.

Because when you start out you end up with 2n observations.

one for the first measurement, and the other

for the second measurement, so when you subtract them

you wind up with a half the

number of observations that you typically started with.

so at any rate, the, the, the.

The principal way to, to do this is to

simply take the differences, throw out the data, and

treat the differences as if they were a one-sample

test, and that's the, the ordinary paired two-group t-test.

Coursera provides universal access to the world’s best education,
partnering with top universities and organizations to offer courses online.