加载中...
加载中...
Assessment for Unit 9: Inference for Quantitative Data: Slopes
Select the one best answer for each question.
1. A statistician is performing a linear regression analysis to predict the price of used cars based on their age. After calculating the least-squares regression line, the statistician constructs a residual plot. The plot displays a distinct fan shape, where the residuals are tightly clustered around zero for newer cars but become increasingly spread out for older cars. Based on this residual plot, which condition for inference for the slope of the regression line has most likely been violated?
2. A marine biologist is investigating whether there is a positive linear relationship between the water temperature (in degrees Celsius) and the growth rate of a specific coral species (in cm per year). She collects data from 30 randomly selected sites. Let $\beta$ represent the true slope of the population regression line relating water temperature to growth rate. Which of the following pairs of hypotheses should the biologist test?
3. A study was conducted to determine the relationship between the number of hours studied and the score on a final exam for a sample of 22 students. The computer output for the least-squares regression analysis is shown below. | Predictor | Coef | SE Coef | T-Value | P-Value | | :--- | :--- | :--- | :--- | :--- | | Constant | 55.32 | 4.12 | 13.43 | 0.000 | | Hours | 2.48 | 0.65 | 3.82 | 0.001 | Which of the following is the correct expression for a 95% confidence interval for the slope of the population regression line? (Assume the critical value $t^*$ for 20 degrees of freedom is 2.086 and for 21 degrees of freedom is 2.080.)
4. In a linear regression analysis predicting monthly heating costs from the average outdoor temperature, the standard error of the slope ($SE_b$) was calculated to be 0.15. Which of the following is the best interpretation of this value?
5. A researcher performs a t-test for the slope of a regression line to predict reaction time ($y$) from sleep duration ($x$). The regression equation is $\hat{y} = 1.2 - 0.08x$. The t-statistic for the slope is calculated to be $t = -2.15$ based on a sample of size $n=20$. Using a significance level of $\alpha = 0.05$, which of the following is the correct conclusion regarding the null hypothesis $H_0: \beta = 0$ against the two-sided alternative $H_a: \beta \neq 0$?
Refer to the figure below.
6. A statistics student performed a least-squares regression analysis to predict the price of used cars (in dollars) based on their age (in years). The student generated a residual plot from the data, which is described in the image cue below. Based on this residual plot, which condition for inference for the slope of the regression model is most clearly violated?
7. A marine biologist wants to estimate the relationship between the length of a specific species of fish (in cm) and its weight (in grams). A random sample of 18 fish is selected, and a linear regression analysis is performed. The computer output is shown below. | Predictor | Coef | SE Coef | T | P | | :--- | :--- | :--- | :--- | :--- | | Constant | -15.2 | 4.1 | -3.71 | 0.002 | | Length | 3.4 | 0.6 | 5.67 | 0.000 | Which of the following represents the margin of error for a 95% confidence interval for the slope of the population regression line?
8. Data were collected from a random sample of 25 students to investigate the relationship between the number of hours spent studying for a final exam (x) and the score on the exam (y). The slope of the least-squares regression line was calculated to be b = 4.2 with a standard error of the slope SE_b = 1.1. Which of the following is the correct 99% confidence interval for the slope of the population regression line?
9. A 95% confidence interval for the slope of the regression line relating the amount of fertilizer applied (in kg) to the yield of corn (in bushels) is calculated to be (1.2, 2.8). Which of the following is the best interpretation of this interval?
10. A marine biologist is studying the relationship between the length (in centimeters) and the weight (in kilograms) of a specific species of fish. A random sample of 30 fish is selected, and a least-squares regression analysis is performed on the data. The computer output for the regression analysis is shown below. Predictor | Coef | SE Coef | T | P --- | --- | --- | --- | --- Constant | -0.54 | 0.23 | -2.35 | 0.026 Length | 0.42 | 0.08 | 5.25 | 0.000 Which of the following is the correct 95% confidence interval for the slope of the population regression line assuming all conditions for inference are met? (Use t* = 2.048)
11. A statistics student computed a 90 percent confidence interval for the slope of the least-squares regression line predicting final exam score from hours of study. The resulting interval was (1.5, 4.5). Based on this interval, which of the following claims is supported?
12. Researchers are investigating the relationship between tree density (trees per acre) and soil nutrient content. They construct a 95 percent confidence interval for the slope of the regression line based on a sample of 20 plots. If the researchers want to decrease the width of this confidence interval in a future study, which of the following actions would be most effective?
13. A linear regression analysis was performed to predict the sale price of homes based on their square footage. A 95 percent confidence interval for the slope of the regression line was calculated to be (120, 160). Which of the following is the correct interpretation of the 95 percent confidence level in this context?
14. A real estate agent wants to determine if there is a positive linear relationship between the square footage of a house (explanatory variable) and its selling price (response variable) in a specific city. The agent collects a random sample of 50 recent home sales and performs a linear regression analysis. Which of the following pairs of hypotheses is appropriate for the test?
15. A statistician is checking the conditions for inference for a regression of monthly heating costs on average daily temperature. The scatterplot of the data appears roughly linear. However, a plot of the residuals versus the average daily temperature shows that the points are tightly clustered around the horizontal axis for low temperatures but become increasingly spread out as the temperature increases (a 'fan' or 'cone' shape). Which condition for inference has likely been violated?
16. Researchers performed a linear regression to predict the weight of a fish based on its length. To verify the conditions for inference, they generated several graphs. Which of the following graphs is necessary to check the condition that, for any fixed length, the weights of the fish are normally distributed?
17. A researcher is conducting a t-test for the slope of a regression line relating the amount of fertilizer used (x) to the yield of corn (y). The computer output for the regression analysis is examined. A residual plot of residuals versus fitted values displays a clear U-shaped pattern. Based on this plot, which of the following conclusions is correct regarding the validity of the t-test?
18. A marine biologist is studying the relationship between the length of a certain species of fish (in centimeters) and its weight (in grams). A random sample of 20 fish is selected, and a linear regression analysis is performed on the data. The computer output for the regression analysis is shown below. | Predictor | Coef | SE Coef | T | P | | :--- | :--- | :--- | :--- | :--- | | Constant | -15.4 | 4.2 | -3.67 | 0.002 | | Length | 24.8 | 6.2 | ? | ? | Which of the following is the value of the test statistic for the hypothesis test $H_0: \beta = 0$ versus $H_a: \beta \neq 0$, where $\beta$ represents the slope of the population regression line?
19. A researcher is investigating the relationship between the number of hours spent studying for a final exam and the score on that exam. A regression analysis on data from a random sample of students produces a test statistic of $t = 2.15$ and a p-value of $0.042$ for the slope of the regression line. Which of the following is the correct interpretation of this p-value?
20. An economist wants to test if there is a positive linear relationship between a country's GDP per capita and its happiness index. She collects data from 28 randomly selected countries and performs a linear regression. The computer output reports a t-statistic for the slope of $t = 1.85$ and a two-sided p-value of $0.076$. If the economist tests the hypotheses $H_0: \beta = 0$ versus $H_a: \beta > 0$ at the significance level $\alpha = 0.05$, which of the following is the correct decision and justification?
21. A statistics student performs a t-test for the slope of a least-squares regression line based on a sample of size $n=32$. The null hypothesis is that the true population slope is zero ($H_0: \beta = 0$). Which of the following describes the sampling distribution of the test statistic $t = \frac{b}{SE_b}$ if the null hypothesis is true and the conditions for inference are met?
22. A marine biologist is studying the relationship between the length of a certain species of fish (in centimeters) and its weight (in grams). A random sample of 18 fish is selected, and a linear regression analysis is performed on the data. The computer output for the regression is shown below. Predictor | Coef | SE Coef | T-Value | P-Value --- | --- | --- | --- | --- Constant | -120.5 | 24.3 | -4.96 | 0.000 Length | 28.4 | 1.8 | 15.78 | 0.000 Which of the following is the correct 95% confidence interval for the slope of the population regression line, assuming all conditions for inference are met?
23. An economist wants to determine if there is a linear relationship between the unemployment rate and the inflation rate in a specific country over the last 30 years. A least-squares regression line is fitted to the data, and a hypothesis test for the slope is conducted. The hypotheses are H_0: β = 0 versus H_a: β ≠ 0. The test yields a t-statistic of t = -1.85 and a p-value of 0.075. Using a significance level of α = 0.05, which of the following is the correct conclusion?
Refer to the figure below.
24. A student collects data on the height (in inches) and vertical jump (in inches) of 25 athletes. A regression analysis is performed, and the student wishes to calculate a confidence interval for the slope of the regression line. A residual plot of the data is shown below. Based on the residual plot, which condition for inference is most clearly violated?
25. Researchers analyzed the relationship between the amount of fertilizer applied (x) and the yield of corn (y). The computer output provided a standard error of the slope (SE_b) of 0.45. Which of the following best interprets this value?