"High Quality Random Numbers" Data & Analysis
Posted: Fri Oct 01, 2010 6:57 pm
Alright, I've been monitoring my dice rolls for a little bit and decided something is up, so I decided to analyze it.
If you want the brief summary, look at the Conclusions section of my post.
Data
Now I know that by uploading this, I'm going to get a few "Your sample size is too small." comments. However, my analysis takes that into account, so hold your horses.
Analysis
Looking at this, I was struck by the discrepancies in my dice rolls. I'm rolling +1.31% in my 3v2s, and -3.88% in my 3v1s. I decided to analyze both numbers using the simple Chi-Square test. If you're unfamiliar with it, the Chi-Square test is a very simple, widely-accepted test to see if variations between expected and observed results can be attributed to just random chance, or if there is something else at play. First, I need to have a null hypothesis: There is no significant difference between my observed dice outcomes and my expected dice outcomes. I am going to go with the majority of the science community and define a significant difference as a p value < 0.05. If p > 0.05, I accept my null hypothesis. If p < 0.05, I reject it and accept that there must be something besides random chance affecting my dice outcomes. With that out of the way, on to the calculations.
If you don't understand how chi-square values are calculated, take a statistics class. For those that know how this works, feel free to check my numbers. To find the chi square volume I'm taking (Observed - Expected)^2/Expected for each outcome and adding them together.
3v2 Calculations
Attacker Wins: (1385-1432)^2/1432 = 1.54
Tie: (1206-1208)^2/1208 = 0.003
Attacker Loses: (1008-1053)^2/1053 = 1.92
Add these together, and my chi-square value is 3.46. Since there are 3 outcomes, I have 2 degrees of freedom. Looking at a chi-square chart, my chi-square value falls between p values of 0.1 and 0.2. Since p > 0.05, we accept the null hypothesis that there is no significant difference between the expected dice outcomes and the observed dice outcomes. Alright, we can accept these as possibly random. They are clearly in my favor, but they still may be random. Now let's move on to 3v1.
3v1 Calculations
Attacker Wins: (1199-1273)^2/1273 = 4.30
Attacker Loses: (732-657)^2/657 = 8.33
Add these together and my chi-square value is 12.63 Since there are 2 possible outcomes in a 3v1 match, I have 1 degree of freedom. Looking at my chi-square chart, my chi-square value falls AFTER p values of 0.001. Since p < 0.05, we can conclude that there is a significant difference between my observed and expected results. FURTHERMORE: Based on the fact that p < 0.001, there is less than a 0.1% chance that these results can be attributed to chance alone.
Conclusions
1. There is no significant difference between the expected and observed results of my 3v2 battles. However, there is only a 10-20% chance that the differences can be attributed to random chance- this is not enough to be significant, but it is interesting.
2. There IS a significant difference between the expected and observed results of my 3v1 battles. Furthermore, the difference is great enough that there is a 99.9% chance that something OTHER THAN RANDOM CHANCE is affecting the results of my battles.
3. I have way too much time on my hands.
Now, I'm sure someone is going to find issues with this. These might be:
Your sample size is too small: Granted, it's not 100,000 rolls, but it is more than enough to do a chi-square test on, and chi-square tests take sample size into consideration. The lower limit of accuracy for this test is when you have expected values of less than 5.
Chi-Square is not a valid test: Says who? The vast majority of the scientific community uses it, and for this little project it works fine. Admins contend that all of the results of the dice rolls are purely random, and this tests that. If you feel like you have a better test, feel free to run it.
You did something wrong: Point it out and I'll take a look. I've checked my numbers, but it's not beyond possibility that I've messed up somewhere.
Anyways, hope you've enjoyed this and it does some good.
If you want the brief summary, look at the Conclusions section of my post.
Data
Now I know that by uploading this, I'm going to get a few "Your sample size is too small." comments. However, my analysis takes that into account, so hold your horses.
Analysis
Looking at this, I was struck by the discrepancies in my dice rolls. I'm rolling +1.31% in my 3v2s, and -3.88% in my 3v1s. I decided to analyze both numbers using the simple Chi-Square test. If you're unfamiliar with it, the Chi-Square test is a very simple, widely-accepted test to see if variations between expected and observed results can be attributed to just random chance, or if there is something else at play. First, I need to have a null hypothesis: There is no significant difference between my observed dice outcomes and my expected dice outcomes. I am going to go with the majority of the science community and define a significant difference as a p value < 0.05. If p > 0.05, I accept my null hypothesis. If p < 0.05, I reject it and accept that there must be something besides random chance affecting my dice outcomes. With that out of the way, on to the calculations.
If you don't understand how chi-square values are calculated, take a statistics class. For those that know how this works, feel free to check my numbers. To find the chi square volume I'm taking (Observed - Expected)^2/Expected for each outcome and adding them together.
3v2 Calculations
Attacker Wins: (1385-1432)^2/1432 = 1.54
Tie: (1206-1208)^2/1208 = 0.003
Attacker Loses: (1008-1053)^2/1053 = 1.92
Add these together, and my chi-square value is 3.46. Since there are 3 outcomes, I have 2 degrees of freedom. Looking at a chi-square chart, my chi-square value falls between p values of 0.1 and 0.2. Since p > 0.05, we accept the null hypothesis that there is no significant difference between the expected dice outcomes and the observed dice outcomes. Alright, we can accept these as possibly random. They are clearly in my favor, but they still may be random. Now let's move on to 3v1.
3v1 Calculations
Attacker Wins: (1199-1273)^2/1273 = 4.30
Attacker Loses: (732-657)^2/657 = 8.33
Add these together and my chi-square value is 12.63 Since there are 2 possible outcomes in a 3v1 match, I have 1 degree of freedom. Looking at my chi-square chart, my chi-square value falls AFTER p values of 0.001. Since p < 0.05, we can conclude that there is a significant difference between my observed and expected results. FURTHERMORE: Based on the fact that p < 0.001, there is less than a 0.1% chance that these results can be attributed to chance alone.
Conclusions
1. There is no significant difference between the expected and observed results of my 3v2 battles. However, there is only a 10-20% chance that the differences can be attributed to random chance- this is not enough to be significant, but it is interesting.
2. There IS a significant difference between the expected and observed results of my 3v1 battles. Furthermore, the difference is great enough that there is a 99.9% chance that something OTHER THAN RANDOM CHANCE is affecting the results of my battles.
3. I have way too much time on my hands.
Now, I'm sure someone is going to find issues with this. These might be:
Your sample size is too small: Granted, it's not 100,000 rolls, but it is more than enough to do a chi-square test on, and chi-square tests take sample size into consideration. The lower limit of accuracy for this test is when you have expected values of less than 5.
Chi-Square is not a valid test: Says who? The vast majority of the scientific community uses it, and for this little project it works fine. Admins contend that all of the results of the dice rolls are purely random, and this tests that. If you feel like you have a better test, feel free to run it.
You did something wrong: Point it out and I'll take a look. I've checked my numbers, but it's not beyond possibility that I've messed up somewhere.
Anyways, hope you've enjoyed this and it does some good.
