Cikomyr
2013-05-02, 04:45 AM
Hi people. My Statistics classes are long behind me, and this rust is currently biting me in the ass in my current essay research. I was hoping some of you could help me out with something.
We are currently running multivariate regression analysis. Thing is, I started to suspect some of our variables might not be valid to consider. In our 296 observation for each variables, we have encountered quite large number of "0" (more than half of the observation).
Each of these variables are based on a scale of 1 to 10. They came out as such:
Var Mean StDev Min Max
1 1.5 2.6 0 9
2 1.0 2.5 0 10
3 0.8 2.3 0 10
4 4.2 2.4 0 10
5 3.4 3.3 0 10
6 1.5 3.4 0 10
We will run correlation analysis of these with a much more bell-curved statistic. Here is my question:
Could the heavy-loading of results of "0" invalidate the use of variables in a regression analysis? Isn't there a way to determine if a statistic is statistically analyzable or not? Something like Mean/StDev?
Since for at least 4 of my 6 variables, the StDev is much bigger than the mean, and there is an absolute minimum of 0, I am worried that the data collected might be erroneous.
We are currently running multivariate regression analysis. Thing is, I started to suspect some of our variables might not be valid to consider. In our 296 observation for each variables, we have encountered quite large number of "0" (more than half of the observation).
Each of these variables are based on a scale of 1 to 10. They came out as such:
Var Mean StDev Min Max
1 1.5 2.6 0 9
2 1.0 2.5 0 10
3 0.8 2.3 0 10
4 4.2 2.4 0 10
5 3.4 3.3 0 10
6 1.5 3.4 0 10
We will run correlation analysis of these with a much more bell-curved statistic. Here is my question:
Could the heavy-loading of results of "0" invalidate the use of variables in a regression analysis? Isn't there a way to determine if a statistic is statistically analyzable or not? Something like Mean/StDev?
Since for at least 4 of my 6 variables, the StDev is much bigger than the mean, and there is an absolute minimum of 0, I am worried that the data collected might be erroneous.