Marital Status of Women

Essay, Pages 4 (978 words)

Views

129

Independent Project

Frequency distribution of a variable and bar graph of the same variable

The frequency table created from the Independent Project Data shows the marital status of women in our sample. The nominal variable is the “marital status” and the five categories are: widowed, separated, never married, married, and divorced. Out of the 972 women in this sample, 63% have never been married, 12% are separated, and 15.6% of women are divorced. Furthermore, 2.6% of women are widowed, and 6% of women are married. The highest frequency response/mode is “Never married” which occurs 613 times which accounts for a relative frequency of 63% and is the most dominant category.

Don't use plagiarized sources. Get your custom essay on

“ Marital Status of Women ”

Get custom paper

NEW! smart matching with writer

The least frequent response is “Widowed” which occurred 25 times which accounts for a relative frequency of 2.6% and is the least dominant category. Furthermore,the bar graph shows a visual representation that “Never married” is the most frequent response and “Widowed” is the least frequent response.

Descriptives of a continuous variable: mean, median, skewness, kurtosis, standard deviation and graph of that variable

The continuous variable that I will be using for this question is age.

From our sample size (n) of 972 subjects, the mean age is 36.6 years old. The median is 37 years old, and the mode is 41 years old. The median tells us that when all the values are lined up in order from least to greatest, the middle number is 37.The mode tells us that the highest frequency of subjects are aged 41.The mean tells us that if all the values of sample are added and then divided by 972, our answer is 36.

624486. The median (37) is greater than the median 36.6 which means that the graph is slightly negatively skewed. The mode is greater than the mean and median; this also means that the histogram is negatively skewed, which is also supported by the skewness value of -0.36.

The distribution is not normal since the mean, median, and mode do not share the same value. A negative skew also means that most of the distribution is skewed to the right since the left tail is longer. The distribution can be described as platykurtic because the kurtosis value is less than zero (-0.395). The lower quartile is 33, meaning that 25% of values are at/below the age of 33. The upper quartile is 41 meaning that 75% of values are at/below the age of 41. The standard deviation of 6.28 means that on average, the age deviates from the mean by a value of 6.28. The distribution is unimodal because there is only one peak.

The results are statistically significant because the P-value of 0.0121 is less than 0.05. In addition, the chi-square value of 6.30 is greater than the critical value of 3.84, also making the data statistically significant. Based on these results, we can reject/not keep the null hypothesis and accept the alternative hypothesis that there is a relationship between poverty level and smoking status. In addition, the data shows us that 50.78% of the subjects are non-smokers despite their poverty status. On the contrary, 9.22% of the subjects smokers regardless of their poverty level. The data also shows us that 91 smokers are both above the poverty line and are smokers; this amounts to 9.41% of subjects. In the smokers group, which consists of 476 subjects, only 19.12% of these subjects are above the poverty line. For the smoker population below the poverty line, a little bit more than half, or 51.4%, of subjects are smokers.

Comparison of the effect of three or more groups (single variable) on a single continuous variable

The independent variable is marital status since it is a nominal variable with five groups (divorced, married, never married, separated, and widowed). The dependent variable is mental health status because it is a continuous, interval and ratio level variable.

Null Hypothesis: There is no difference in mean between mental health statuses for each of the five groups regarding marital status.

Alternative Hypothesis: There is a difference in mean between mental health statuses for each of the five groups regarding marital status.

Explanation: Since the P-value, 0.1784, is less than 0.05, the results are not statistically significant and we must not reject/accept the null hypothesis. In addition, the F-stat of 1.5765918 is below the critical value of 7.71 which also leads us to acknowledge that the results are not statistically significant as well. Therefore, we can conclude that the marital status’ of the subjects is not related to their mental health status.

Scatterplot of two continuous variables

The scatterplot represents two continuous variables which are BMI values (x-axis) and physical health score (y-axis) among subjects. The independent variable is Body Mass Index (BMI) and the dependent variable is Physical Health Score. The graph appears to have a very weak or little relationship between BMI and Physical Health Score. Since there is no relationship between the variables, it seems very unlikely to determine whether there is a positive or negative relationship. However, many of the points on the scatterplot appear to be clustered on the top left (circled in red) area where the Physical Health Score values are between 45 and 60 on the y-axis and the BMI values are between 15 and 35 on the x-axis.

Correlation between the two continuous variables from #5 above

Null Hypothesis: The correlation coefficient “R” is equal to “0” which means that there is no relationship between the two variables: Physical Health Score and Body Mass Index (BMI).

Alternative Hypothesis: The correlation coefficient “R” is not equal to “0” which means that there is a relationship between the two variables: Physical Health Score and Body Mass Index (BMI).

The R (correlation coefficient) is -0.125 which indicates an inverse relationship between the two variables. The strength of the relationship between the two variables can be described by taking the absolute value of -0.125 which is 0.125. The absolute value of 0.125 indicates that there is not a strong relationship or a very weak relationship between the two variables. Furthermore, the R-squared value is 0.0154 (1.5%) meaning that the independent variable has a very little effect on the dependent variable.