Academic Integrity: tutoring, explanations, and feedback — we don’t complete graded work or submit on a student’s behalf.

please answer the following questions The movie data contains several numerical

ID: 3251763 • Letter: P

Question

please answer the following questions

The movie data contains several numerical columns, like metascore, rating, box office gross, etc. We can manually examine these pair-wise relationships like we did above. That would be 30 sets of the commands above. BORINGI Luckily, pandas provides a very easy way of calculating all the pair wise relationships in one go. Each cell in the table below represents the correlation coefficient between the variable denoted in the row, and the one denoted in the column. Find the correlation coefficient between year and votes and see that it is the same as we calculated above. movieData. Corr() ANSWER THE FOLLOWING QUESTIONS (double click this cell to edit): Why is the diagonal of the table all 1.000? ANSWER HERE: Notice that the table is symmetric around the diagonal. Meaning, the value at row 2 column 3 is the same as row3column 2. Explain why this is the Case. ANSWER HERE Find the largest (most positive) off-diagonal (not 1.0) correlation coefficient. What are the two variables, and briefly explain why they might be strongly correlated?

Explanation / Answer

The correlation coefficient of a variable withit slef is always 1. The diagonal shows the correlation coeffcient of a variable from itself. So these are all 1.

--------------

Correlation ceofficient between X and Y will always be equal to correlation coeffcient between Y and X. That is in finding correlation coefficient between the variables order of variables does not matter. So table is symmetric about diagonal.

------------------

Largest positive correlaiton is between metascore and rating becuase it has highest magnitude in the table. The correlation coefficient between these variables is 0.766178.