Test for differences by outcome level

This post has been de-listed

It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.

Post Body

Hi. I've been thinking about this for way too long and have reached R's equivalent of writer's block. I have a dataset of continuous, ordinal and categorical variables with an ordinal outcome that has four levels, and the outcome of interest is rare (5%). I want to subset the data by outcome level, then take samples of one predictor from each subset and compare them. For example, lets say I sample 100 observations from a continuous variable at outcome levels 1 and 2, so I'l have two vectors from the same variable with different outcomes. I now want to compare those vectors with each other to see if their distributions vary by outcome. Am I causing myself trouble by doing this with a t test? Does bootstrapping this process and using the mean p-value of all boots impact my type 1 error rate?

I want to do the same thing with the categorical variables as well, comparing counts of each factor's levels between subsets. Will Fisher's exact test work on both the nominal and ordinal data? If not, what test should I read up on? Would bootstrapping here cause issues?

I think the samples could be considered independent because they come from people, and each person appears only once in the dataset. My thought with the bootstrapping is to create a new dataset with equal counts of outcome levels for easier model training, then using the orginal data as my test set. Kind of like over and undersampling at the same time I guess. I'm curious to hear what you think. Thank you!

Author

Account Strength

100%

Account Age

5 years

Verified Email

Yes

Verified Flair

Total Karma

16,448

Link Karma

6,261

Comment Karma

9,986

Profile updated: 2 days ago

Posts updated: 1 year ago

PhoenixRising256

Subreddit

r/AskStatistics

Post Details

We try to extract some basic information from the post title. This is not always successful or accurate, please use your best judgement and compare these values to the post title and body for confirmation.

Posted: 2 years ago
Reddit URL: View post on reddit.com
External URL: reddit.com/r/AskStatisti...