Part A
Given the <span>table which shows the average cost, in dollars, of a movie ticket in the United States for different years.
</span>
![\begin{center} \begin{tabular} {|c|c|} Year&Cost\\[1ex] 1948&0.36\\ 1963&0.86\\ 1974&1.89\\ 1978&2.34\\ 1985&3.55\\ 1991&4.21\\ 1997&4.59\\ 2005&6.41\\ 2010&7.89 \end{tabular} \end{center}](https://tex.z-dn.net/?f=%5Cbegin%7Bcenter%7D%0A%5Cbegin%7Btabular%7D%0A%7B%7Cc%7Cc%7C%7D%0AYear%26Cost%5C%5C%5B1ex%5D%0A1948%260.36%5C%5C%0A1963%260.86%5C%5C%0A1974%261.89%5C%5C%0A1978%262.34%5C%5C%0A1985%263.55%5C%5C%0A1991%264.21%5C%5C%0A1997%264.59%5C%5C%0A2005%266.41%5C%5C%0A2010%267.89%0A%5Cend%7Btabular%7D%0A%5Cend%7Bcenter%7D)
We can form our regression table by rewriting the year column with 1948 represented 0 and the other years represented by the number of years after 1948.
![\begin{center} \begin{tabular} {|c|c|c|c|} Year(X)&Cost(Y)&X^2&XY\\[1ex] 0&0.36&0&0\\ 15&0.86&225&12.9\\ 26&1.89&676&49.14\\ 30&2.34&900&70.2\\ 37&3.55&1,369&131.35\\ 43&4.21&1,849&181.03\\ 49&4.59&2,401&224.91\\ 57&6.41&3,249&365.37\\ 62&7.89&3,844&489.18\\ \sum X=319&\sum Y=32.1&\sum X^2=14,513&\sum XY=1,524.08\end{tabular} \end{center}](https://tex.z-dn.net/?f=%5Cbegin%7Bcenter%7D%20%5Cbegin%7Btabular%7D%20%7B%7Cc%7Cc%7Cc%7Cc%7C%7D%20Year%28X%29%26Cost%28Y%29%26X%5E2%26XY%5C%5C%5B1ex%5D%200%260.36%260%260%5C%5C%0A15%260.86%26225%2612.9%5C%5C%0A26%261.89%26676%2649.14%5C%5C%0A30%262.34%26900%2670.2%5C%5C%0A37%263.55%261%2C369%26131.35%5C%5C%0A43%264.21%261%2C849%26181.03%5C%5C%0A49%264.59%262%2C401%26224.91%5C%5C%0A57%266.41%263%2C249%26365.37%5C%5C%0A62%267.89%263%2C844%26489.18%5C%5C%0A%5Csum%20X%3D319%26%5Csum%20Y%3D32.1%26%5Csum%20X%5E2%3D14%2C513%26%5Csum%20XY%3D1%2C524.08%5Cend%7Btabular%7D%20%5Cend%7Bcenter%7D)
The equation of the least square regression line is given by

, where

and

Thus, we have

and

Therefore, the equation of the least square regression line is given by

.
Part B
To find the correlation coefficient (r), we need an additional column to our table above, the column for Y^2.
![\begin{center} \begin{tabular} {|c|} Y^2\\ [1ex] 0.1296\\0.7396\\3.5721\\5.4756\\12.6025\\17.7241\\21.0681\\41.0881\\62.2521\\ \sum Y^2=164.6518 \end{tabular} \end{center}](https://tex.z-dn.net/?f=%5Cbegin%7Bcenter%7D%0A%5Cbegin%7Btabular%7D%0A%7B%7Cc%7C%7D%0AY%5E2%5C%5C%20%5B1ex%5D%0A0.1296%5C%5C0.7396%5C%5C3.5721%5C%5C5.4756%5C%5C12.6025%5C%5C17.7241%5C%5C21.0681%5C%5C41.0881%5C%5C62.2521%5C%5C%20%5Csum%20Y%5E2%3D164.6518%0A%5Cend%7Btabular%7D%0A%5Cend%7Bcenter%7D)
The correlation coefficient (r) is given by
![r= \frac{n(\sum XY)-(\sum X)(\sum Y)}{\sqrt{[n(\sum X^2)-(\sum X)^2][n(\sum Y^2)-(\sum Y)^2]}} \\ \\ = \frac{9(1,524.08)-(319)(32.1)}{\sqrt{[9(14,513)-(319)^2][9(164.6518-(32.1)^2]}} \\ \\ = \frac{13,716.72-10,239.9}{\sqrt{(130,617-101,761)(1,481.8662-1,030.41}} \\ \\ = \frac{3,476.82}{\sqrt{(28,856)(451.4562)}} = \frac{3,476.82}{\sqrt{13,027,220.11}} \\ \\ = \frac{3,476.82}{3,609.32} =0.963](https://tex.z-dn.net/?f=r%3D%20%5Cfrac%7Bn%28%5Csum%20XY%29-%28%5Csum%20X%29%28%5Csum%20Y%29%7D%7B%5Csqrt%7B%5Bn%28%5Csum%20X%5E2%29-%28%5Csum%20X%29%5E2%5D%5Bn%28%5Csum%20Y%5E2%29-%28%5Csum%20Y%29%5E2%5D%7D%7D%20%20%5C%5C%20%20%5C%5C%20%3D%20%5Cfrac%7B9%281%2C524.08%29-%28319%29%2832.1%29%7D%7B%5Csqrt%7B%5B9%2814%2C513%29-%28319%29%5E2%5D%5B9%28164.6518-%2832.1%29%5E2%5D%7D%7D%20%20%5C%5C%20%20%5C%5C%20%3D%20%5Cfrac%7B13%2C716.72-10%2C239.9%7D%7B%5Csqrt%7B%28130%2C617-101%2C761%29%281%2C481.8662-1%2C030.41%7D%7D%20%20%5C%5C%20%20%5C%5C%20%3D%20%5Cfrac%7B3%2C476.82%7D%7B%5Csqrt%7B%2828%2C856%29%28451.4562%29%7D%7D%20%3D%20%5Cfrac%7B3%2C476.82%7D%7B%5Csqrt%7B13%2C027%2C220.11%7D%7D%20%20%5C%5C%20%20%5C%5C%20%3D%20%5Cfrac%7B3%2C476.82%7D%7B3%2C609.32%7D%20%3D0.963)
Since r = 0.963 which is very close to positive 1, thus, there exist a strong positive association between the two variables, i.e. between the year and the average cost of a movie ticket. In other words, as the year increases, the average cost of movie ticket increases.
Part 3


is called the coefficient of determination and is a measure of the closeness of the data points are to the regression line. The closer the value of

to 1, the closer are the data points to the regression line.
Since we obtained

, which is very close to 1, this means that the data points are close to the regression line.
Part D
We obtained the equation of the least square regression line as

, where y is the average cost of movie ticket in x years after 1948.
1995 is 1995 - 1948 = 47 years after 1948, thus, the predicted average cost of movie ticket in 1995 is given by