P.Mean: What is the effect of an unmeasured covariate? (created 2009-06-09)<title> </head> <body> <h4>What is the effect of an unmeasured covariate? (created 2009-06-09)</h4> <p>This page is moving to a <a href="http://new.pmean.com/unmeasured-covariate/">new website</a>.</p> <blockquote> <p>Suppose you want to conduct an analysis of covariance, but you have data on some but not all of the covariates. What do you miss out on because of the unmeasured covariate. To understand this, we need to venture in to the world of partitioned matrices. If you have a symmetric matrix of the form</p> <p><img border="0" src="images/part01.gif" width="75" height="60"></p> <p>then </p> <p><img border="0" src="images/part02.gif" width="178" height="63">.</p> <p>The inverse of this matrix is</p> <p><img border="0" src="images/part03.gif" width="446" height="63"></p> <p>where</p> <p><img border="0" src="images/part04.gif" width="181" height="31"></p> <p>and</p> <p><img border="0" src="images/part05.gif" width="173" height="31">.</p> <p>represent the matrices which project a vector onto the column space perpendicular to A and B, respectively. This results can be found on the Wikipedia page on the block matrix pseudoinverse:</p> <ul> <li><a href="http://en.wikipedia.org/wiki/Block_matrix_pseudoinverse"> en.wikipedia.org/wiki/Block_matrix_pseudoinverse</a> </li> </ul> <p>The formula for the regression coefficients is</p> <p><img border="0" src="images/part10.gif" width="146" height="40"></p> <p>which, when partitioned equals</p> <p><img border="0" src="images/part11.gif" width="265" height="70">.</p> <p>There are two special cases to consider. If the unmeasured covariate is balanced across levels of A, then</p> <p><img border="0" src="images/part12.gif" width="78" height="31"></p> <p>and if the unmeasured covariate is uncorrelated with the response y, then</p> <p><img border="0" src="images/part13.gif" width="71" height="31"></p> <p>If both of these conditions are met, then the regression coefficients for the partitioned case would be</p> <p><img border="0" src="images/part14.gif" width="221" height="70"></p> <p>which is equivalent to using only the information in A. If only the first condition is met then the regression coefficients</p> <p>A test for the effectiveness of the statistical adjustment could be made if B were known in a random subset of the data. This could occur in a situation where B is not truly unknown, but rather is very expensive to measure. There would not be sufficient budget to measure B for all cases, but it could be done for a randomly selected set of cases. I will detail those results in a future webpage.</p> </blockquote> </body> </html>