debun
Mechanical
- Jul 29, 2008
- 34
One of the advantages of the Weibull is you can form a distribution with a much smaller sample size than say a histogram.
I experimented with using only the first 5 data points from a sample set of 26 to do a Weibull analysis. The first 5 points fit a 2 parameter Weibull and gave an R^2 values of 0.99. The second data set (remaining 21 points) changes to a 3 parameter Weibull with an R^2 value somewhere in the neighborhood of 0.976. So my questions are
1) What is the minimum sample size you need to accurately represent the population? For example from this sample size X the PDF is within some metric (std deviations, % etc) of the true PDF. Clearly R^2 value alone isn’t a good metric
2) I would like to plot the remaining 21 data points over the CDF calculated from my first 5 points. I have used 2 methods. Simply calculating the MR and plotting test cycle vs MR. The second using the Beta and Eta to calculate the CDF percent and plotting test cycle vs CDF percentile. Is there a good way to represent predicted CDF vs test data?
Here is my data set.
154173
171158
83431
201778
117578
192083
136262
149487
148009
98317
69798
94195
62548
103574
108364
132377
143047
85272
95760
214166
289237
161265
172490
99972
117440
89717
I experimented with using only the first 5 data points from a sample set of 26 to do a Weibull analysis. The first 5 points fit a 2 parameter Weibull and gave an R^2 values of 0.99. The second data set (remaining 21 points) changes to a 3 parameter Weibull with an R^2 value somewhere in the neighborhood of 0.976. So my questions are
1) What is the minimum sample size you need to accurately represent the population? For example from this sample size X the PDF is within some metric (std deviations, % etc) of the true PDF. Clearly R^2 value alone isn’t a good metric
2) I would like to plot the remaining 21 data points over the CDF calculated from my first 5 points. I have used 2 methods. Simply calculating the MR and plotting test cycle vs MR. The second using the Beta and Eta to calculate the CDF percent and plotting test cycle vs CDF percentile. Is there a good way to represent predicted CDF vs test data?
Here is my data set.
154173
171158
83431
201778
117578
192083
136262
149487
148009
98317
69798
94195
62548
103574
108364
132377
143047
85272
95760
214166
289237
161265
172490
99972
117440
89717