BIOSTATISTICS
NOTES ABOUT DATA:
2019 BRFSS SPSS Data File.sav These data are from the Behavioral Risk Factor Surveillance System (BRFSS). The BRFSS collects state data about U.S. residents regarding their health-related risk behaviors, chronic health conditions, and use of preventive services. Be sure to read the Background section of the 2019 BRFSS Overview for more details so you get a little better idea of the BRFSS and how the data are used. Investigators all over the country use these data to conduct research about many different characteristics and how they affect health outcomes. The data file for this project is not the complete data set. There are over 250 variables in the complete data set. I narrowed it down to the few variables I want you to use for this project and simplified coding for the sake of your sanity and to best demonstrate your learning of concepts.
INSTRUCTIONS: (Please read each question thoroughly)
You are a statistician who is tasked with helping a researcher who is interested in determining what characteristics influence a person to report poor health. Using the BRFSS, the researcher find that there are a few variables that can help her answer that question. She first asks if you can conduct some analyses to determine what characteristics predict someone reporting that they had at least one day in the last day where they reported poor physical health (PHYSHLTH_YES_NO) in the last 30 days. In addition, for those who reported at least one day of poor physical health, she is also interested in determining what influences the reported number of days of poor physical health (PHYSHLTH_DAYS). Among other characteristics, the investigator is primarily interested in determining whether veteran status (variable name: VETERAN), and adverse childhood experiences (ACES) (variable name: ACES_Score) influence these two dependent variables. Because there are a range of confounding variables to consider, the researcher also collected data about sex, health insurance, marital status, education, home ownership, income, age, smoking, alcohol use, and exercise, among others. Your job is to help the researcher answer her research questions.
1. Using the graphing options in SPSS, choose two appropriate graphical display options to describe PHYSHLTH_DAYS. You should be able to describe whether this variable is normally distributed, and whether there are outliers in the data using the two display options you choose. Copy and paste your graphs/charts below and for each, provide an interpretation of the graph, and explain why you chose that option.
The selected graphical options are the histogram with normality curve and dot plot.
The Histogram with Normality Curve
Figure 1.1 above shows a histogram with a normal distribution curve. The histogram was selected because it provides a view of the central tendency, spread, and shape of the data set, including the presence of outliers. By showing the shape of the dataset, the histogram will provide an at-a-glance view of whether or not the dataset presents a normal distribution. The dataset presents a normal distribution as evidenced by the single-peaked bell-shaped normality curve, with observations spread out symmetrically around the mean. No outliers are evident from the distribution.
Figure 1.1
The Dot Plot
Figure 2.1 above presents a dot plot. The dot plot, like the histogram, presents a view of the frequency distributions of the different data points in the dataset. However, the dot plot provides information on the frequency of individual values, and not a range of values like the histogram. The dots appear as complete bars due to the large number of values attached to each data point. Longer bars represent higher frequencies. Thus, since it focuses on individual data points, the dot plot provides a more effective way of assessing whether outliers exist in the data set than the histogram. Outliers are data points that can be termed either extremely high or extremely low as compared to the rest of the data point or the nearest data point. The dot plot shows that there are no outliers in the data set.
Figure 2.1
2. The variable PHYSHLTH_YES_NO is a categorical, binary, nominal variable (Either people report poor physical health (Yes=1), or they do not (No=0)). Based on this categorical variable, use the appropriate statistical test to determine if there is a difference in ACES_Score, and ALCOHOL between the groups who report poor physical health. You will be doing two hypothesis testsone for ACES_Score, and one for ALCOHOL. For each test, conduct a formal hypothesis test to answer this question (choose the appropriate statistical test, explain why you chose it, write out your null and alternative hypotheses, run the test, and interpret the results). Include appropriate output from SPSS to show what you did.
To test whether there is a difference in ACES score between the two groups: YES and NO, the independent samples t-test will be used. The independent samples t-test will answer this question by comparing the means of the two independent groups in regard to the ACES score to determine whether the ACES mean score for the group that reports YES (poor physical health) differs significantly from that which reports NO (good physical health). The independent samples t-test is appropriate because the data meets the following requirements: i) The dependent variable ACES score is a continuous ratio variable, ii) the independent variable PHYSHLTH_YES_NO is a categorical variable with only two categories (Yes and No), and iii) the groups or categories are independent and hence, a participant cannot be in both groups. The null and alternative hypotheses for the independent samples t-test are:
H0: ACES SCOREYES ACES SCORENO = 0 (the difference of the means is equal to 0)
H1: ACES SCOREYES ACES SCORENO ? 0 (the difference of the means is not equal to 0)
Before running the t-test it is advisable to run a comparison box plot to obtain an idea of what to expect in the test. The box plot is as presented below. If the means/variances of the two groups or categories in regard to ACES score were equal, the box plots would have equal lengths.
Figure 2.1
From the box plots in figure 2.1, it is evident that the variances for the two categories are quite different as the spread of observations for the YES category is greater than that of the NO category. This suggests that the two groups or categories differ by ACES score. The next step is to run the independent samples t-test to check whether the difference between the groups is significant. Results of the t-test are presented in tables 2.1 and 2.2 below:
Table 2.1
Group Statistics
Did you have any days in the last month when your physical health was not good?
N
Mean
Std. Deviation
Std. Error Mean
Total Adverse Childhood Experiences Score (ACES)
No
58968
1.4565
1.96211
.00808
Yes
37273
2.1046
2.40083
.01244
Table 2.2
Independent Samples Test
Levene's Test for Equality of Vriances
t-test for Equality of Means
F
Sig.
t
df
Sig. (2-tailed)
Mean Difference
Std. Error Difference
95% Confidence Interval of the Difference
Lower
Upper
Total Adverse Childhood Experiences Score (ACES)
Equal variances assumed
1954.881
.000
-45.707
96239
.000
-.64807
.01418
-.67586
-.62028
Equal variances not assumed
-43.700
67753.523
.000
-.64807
.01483
-.67714
-.61900
From the group statistics table 2.1, 58,968 participants reported good health, while 37,273 reported poor physical health. The mean ACES score for the YES (poor physical health) group is 2.10, while that of the NO group is 1.46. Table 2.2 presents the results of the t-test. The Levenes test for equality of variances yields a significant p value of p < 0.001. Hence, we reject the null of the Levenes test and conclude that the variance in ACES score for the group that reports poor physical health (YES) is significantly different from that which reports good physical health. This implies that we need to consider the Equal Variances Not Assumed row in interpreting the t-test results. The negative t-value of 43.7 indicates that the mean ACES score for the first group (NO good physical health) is lower than that of the second group (YES- poor physical health). The associated p-value (p<0.001) is less than the significance level of p =0.05, indicating that the difference in ACES score between the two groups is significant. Thus, we reject the null hypothesis and conclude that ACES scores between people who report poor physical health and those who report good physical health are statistically different.
To test whether there is a difference in alcohol use over the last 30 days between the two groups, the independent samples t-test is still appropriate. The independent sample t-test will be used to determine whether alcohol consumption for the group that reports YES (poor physical health) differs significantly from that which reports NO (good physical health). The independent samples t-test is appropriate for this question because the data meets the following requirements: i) The dependent variable ALCOHOL is a continuous ratio variable, ii) the independent variable PHYSHLTH_YES_NO is a categorical variable with only two categories (Yes and No), and iii) the groups or categories are independent and hence, a participant cannot be in both groups. The null and alternative hypotheses for the independent samples t-test are:
H0: ALCOHOLYES ALCOHOLNO = 0 (the difference of the means is equal to 0)
H1: ALCOHOLYES ALCOHOLNO ? 0 (the difference of the means is not equal to 0)
A comparison box plot on the two variables looks like as shown in figure 2.2 below:
Figure 2.2
From the box plots, the spread of observations for the two categories are almost equal as shown by the fact that the box plots are of almost equal length. This suggests that people in the two categories may not significantly differ by alcohol usage. The next step is to run the independent samples t-test to verify the results of the box plots.
Results of the t-test are presented below:
Table 2.3
Group Statistics
Did you have any days in the last month when your physical health was not good?
N
Mean
Std. Deviation
Std. Error Mean
During the last 30 days what is the largest number of alcoholic drinks you had on any occasion?
No
232486
1.75
2.930
.006
Yes
145011
1.53
2.989
.008
Table 2.4.
Independent Samples Test
Levene's Test for Equality of Variances
t-test for Equality of Means
F
Sig.
t
df
Sig. (2-tailed)
Mean Difference
Std. Error Difference
95% Confidence Interval of the Difference
Lower
Upper
During the last 30 days what is the largest number of alcoholic drinks you had on any occasion?
Equal variances assumed
28.308
.000
21.439
377495
.000
.212
.010
.192
.231
Equal variances not assumed
21.340
303023.157
.000
.212
.010
.192
.231
From the group statistics table 2.3, 32,486 reported good health, while 145,011 reported poor physical health. The mean number of alcoholic drinks score for the YES (poor physical health) group is 1.53, while the mean score for the NO group is 1.475. Table 2.4 presents the results of the t-test. The Levenes test for equality of variances yields a significant p value of p < 0.001 and hence we reject the null of the Levenes test and conclude that the variance in the number of alcoholic drinks consumed for the group that reports poor physical health (YES) is significantly different from that which reports good physical health. This implies that we need to consider the Equal Variances Not Assumed row in interpreting the t-test results. The positive t-value of 21.34 indicates that the mean number of alcoholic drinks for the first group (NO good physical health) is higher than that of the second group (YES- poor physical health). The associated p-value (p<0.001) is less than the significance level of p =0.05, indicating that the difference in the number of alcoholic drinks between the two groups is significant. Thus, we reject the null hypothesis and conclude that the number of alcoholic drinks consumed by people who report poor physical health and those who report good physical health are statistically different.
3. The variable PHYSHLTH_YES_NO is a categorical, binary, nominal variable. Either people report poor physical health (Yes=1), or they do not (No=1). Based on this categorical variable, use the appropriate statistical test to determine if there is an association between PHYSHLTH_YES_NO and VETERAN. Conduct a formal hypothesis test to answer this question (choose the appropriate statistical test, explain why you chose it, write out your null and alternative hypotheses, run the test, and interpret the results). Include appropriate output from SPSS to show what you did. In addition, answer the following question: Is there a difference in the rate of reporting poor physical health between people who are veterans and who are not? How do you know?
ANSWER...
…in this case PHYSHLTH_DAYS, is a continuous variable, and the independent variables are either continuous or categoricalTable 6.1
Model Summary
Model
R
R Square
Adjusted R Square
Std. Error of the Estimate
1
.394a
.155
.154
10.646
a. Predictors: (Constant), Body Mass Index, Own or Rent Home (Housing Status), Current employment status, Respondent Sex, Urban or Rural Location, During the past month, other than your regular job, did you participate in any physical exercise?, Do you have any type of health insurance?, Marital Status, Total Adverse Childhood Experiences Score (ACES), During the last 30 days what is the largest number of alcoholic drinks you had on any occasion?, Education, How often do you smoke cigarettes?, Are you a veteran?, Is your annual household income from all sources:, Age categories
From the R square values presented in the model summary (Table 6.1) above, the independent variables in the model account for 15.5 percent of the variations in the dependent variable PHYSHLTH_DAYS. The correlation value of 0.394 indicates a relatively weak correlation betwee the dependent variable and the independent variables.
Table 6.2
ANOVAa
Model
Sum of Squares
df
Mean Square
F
Sig.
1
Regression
309155.086
15
20610.339
181.839
.000b
Residual
1682820.827
14847
113.344
Total
1991975.913
14862
a. Dependent Variable: If you had any days in the last month when your physical health was not good, how many days was it not good?
b. Predictors: (Constant), Body Mass Index, Own or Rent Home (Housing Status), Current employment status, Respondent Sex, Urban or Rural Location, During the past month, other than your regular job, did you participate in any physical exercise?, Do you have any type of health insurance?, Marital Status, Total Adverse Childhood Experiences Score (ACES), During the last 30 days what is the largest number of alcoholic drinks you had on any occasion?, Education, How often do you smoke cigarettes?, Are you a veteran?, Is your annual household income from all sources:, Age categories
The ANOVA table (Table 6.2) reports how well the regression equation predicts the dependent variable PHYSHLTH_DAYS. The regression model yields a statistical significance of p < 0.001, which is less than 0.05, suggesting that the model is a good fit and significantly predicts the dependent variable PHYSHLTH_DAYS.
Table 6.3
Results of the linear regression are presented in table 6.3 below. Generally, ones sex, marital status, education level, employment status, household income level, smoking status, alcohol consumption, frequency of physical exercise, ACES score, age, and BMI significantly predict their self-reported number of days of poor physical health, with p-values of p < 0.05. At the same time, ones health insurance status, veteran status, house ownership status, and living in an urban and rural area do not significantly influence the self-reported number of days of poor physical health. The specific interpretation of each independent variable is presented below the table:
Coefficientsa
Model
Unstandardized Coefficients
Standardized Coefficients
t
Sig.
B
Std. Error
Beta
1
(Constant)
15.185
.719
21.133
.000
Respondent Sex
-1.169
.201
-.050
-5.810
.000
Do you have any type of health insurance?
.441
.327
.011
1.349
.177
Marital Status
.516
.135
.032
3.822
.000
Education
-.452
.102
-.037
-4.413
.000
Own or Rent Home (Housing Status)
.181
.213
.007
.849
.396
Are you a veteran?
-.057
.258
-.002
-.221
.825
Current employment status
-4.082
.223
-.169
-18.335
.000
Is your annual household income from all sources:
-1.318
.096
-.131
-13.762
.000
How often do you smoke cigarettes?
.876
.196
.037
4.468
.000
During the last 30 days what is the largest number of alcoholic drinks you had on any occasion?
-.161
.026
-.049
-6.101
.000
During the past month, other than your regular job, did you participate in any physical exercise?
-4.197
.185
-.177
-22.690
.000
Total Adverse Childhood Experiences Score (ACES)
.284
.036
.064
7.788
.000
Urban or Rural Location
-.235
.234
-.008
-1.005
.315
Age categories
.495
.088
.059
5.643
.000
Body Mass Index
.048
.012
.030
3.898
.000
a. Dependent Variable: If you had any days in the last month when your physical health was not good, how many days was it not good?
Sex: sex is a significant predictor of self-reported number of days of poor physical health (p<0.001). Holding all other variables constant, males report 1.2 more days of poor physical health than females.
Insurance Status: there is no evidence that insurance status predicts self-reported number of days of poor physical health (p = 0.177).
Marital Status: marital status is a significant predictor of self-reported number of days of poor physical health (p<0.001). Married individuals report 0.52 less days of poor physical health than their divorced, widowed, and separated counterparts.
Education: ones education level is a significant predictor of self-reports of self-reported number of days of poor physical health (p<0.001). Holding other variables constant, a unit increase in the level of education reduces the self-reported days of poor physical health by 0.452 units or 45.2 percent.
Home Ownership Status: there is no evidence that home ownership predicts self-reported number of days of poor physical health (p = 0.396).
Veteran Status: there is no evidence that veteran status predicts self-reported number of days of poor physical health (p = 0.825).
Employment Status: employment status is a significant predictor of self-reported number of days of poor physical health (p<0.001). Ceteris paribus, people employed for wages report four times less the number of days of poor physical health as compared to their unemployed counterparts
Household Income: household income is a significant predictor of self-reported number of days of poor physical health (p<0.001). Ceteris paribus, a unit increase in the level of household income reduces 1.3 times the self-reported number of days of poor physical health.
Smoking: smoking behavior is a significant predictor of self-reported number of days of poor physical health (p<0.001). Ceteris paribus, a unit increase in the frequency of smoking increases 0.876 times the self-reported number of days of poor physical health.
Alcohol Use: alcohol consumption is a significant predictor of self-reported number of days of poor physical health (p<0.001). Ceteris paribus, a unit increase in the number of days of poor physical health reduces 0.16 times or by 16 percent the number of alcoholic drinks one consumes.
Exercise: physical exercise is a significant predictor of self-reported number of days of poor physical health (p<0.001). A unit increase in the frequency of exercising reduces 4.2 times the self-reported number of days of poor physical health.
ACES Score: ACES score is a significant predictor of self-reported number of days of poor physical health (p<0.001). A unit increase in the ACES score increases 0.284 times or by 28.4 percent the self-reported number of days of poor physical health.
Living in Urban/Rural Area: There is no evidence that living in an urban or rural area predicts the number of days of poor physical health (p = 0.315)
Age: Age is a significant predictor of self-reports of number of days of poor physical health (p<0.001). Ceteris paribus, a unit increase in age increases…
care in the situation of Mr. and Mrs. P would be holistic in nature, grounded in a philosophy of caring. There are serious existential issues at stake, as Mr. P has wondered why God has not "taken him" already, while Mrs. P may be suffering from depression given her inability to leave the house or handle the life affairs like paying the bills. Therefore, a recommended treatment plan would
Antipsychotic Medication and the Physical Health Problems of the Patient With Mental Illness More and more attention is now being given to the mental disorders especially in U.S. And due to this increase in attention an increase has also been noticed in the treatment of these mental health issues (Zuvekas, 2005). About 30% of the total U.S. population that is between the ages of 18-52 is being affected by mental health
Psychological Health Unlike physical health, psychological health cannot be measured by a machine or screened for via a blood test. However, psychological health is absolutely critical to the well-being of patients and cannot be disregarded by a responsible health practitioner. Mental health is composed of emotional health (compassion for others and an understanding that the self is not the center of the universe and a balanced perspective on the world); mental
Temperament Correlate to Physical health and Longevity? This research project explores if an individual's temperament is correlated with longevity or physical health. Temperament, "…refers to those aspects of an individual's personality…" (Kagan, 2005). The traits of temperament are considered more innate, and less learned. If a specific temperament could be labeled as associated with increased physical health or longevity, perhaps researchers could aid individuals who become ill earlier. This research
Spirituality The Effects of Spirituality: The Mental and Physical Aspects of Spirituality for the Elderly The relationship of spirituality to aging is not a new idea. Society, as a whole, has been thinking about spirituality and its effects on aging since the first recording of religious history. The relationship is examined by scientists, medical doctors, spiritual healers, and individuals alike. The effects of spirituality on the elderly population have been shown to be
Our semester plans gives you unlimited, unrestricted access to our entire library of resources —writing tools, guides, example essays, tutorials, class notes, and more.
Get Started Now