Weight Analysis

Antiochos VII

Antioch Mint – Comparison with the Data Samples in Part I.a

Contents

  1. Basic characteristics
  2. Test of differences

1. Basic characteristics

Basic descriptive statistics of both samples and box-percentile plots are presented in Figures 1 and 2. Histograms are presented in Figure 3, kernel density estimations are presented in Figures 4 and 5, and empirical cumulative distribution functions are presented in Figure 6.

Notes: The bandwidth of the Gaussian kernel was computed as hGauss = 0.9 × min(σ, SIQR) × n-1/5, where σ is the standard deviation, SIQR is the standardised interquartile range (i.e. the interquartile range of the data sample divided by the interquartile range of the standard normal density) and n is the number of observations; see Silverman, Density Estimation for Statistics and Data Analysis, p. 48, formula (3.31). The bandwidth of the Epanechnikov kernel was chosen subjectively in the range from 0.75×hGauss to 1.25×hGauss.

The box-percentile plot is a modified version of the well-known boxplot. At any height the width of the irregular “box” is proportional to the percentile of that height, up to the 50th percentile, and above the 50th percentile the width is proportional to 100 minus the percentile. Thus, the width at any given height is proportional to the percent of observations that are more extreme in that direction. As in boxplots, the median, 25th, and 75th percentiles are marked with line segments across the box. For details see Esty and Banfield, The Box-Percentile Plot.

Fig. 1: Descriptive statistics
Fig. 1: Descriptive statistics

Fig. 2: Box-percentile plots
Fig. 2: Box-percentile plots

Fig. 3: Histograms
Fig. 3: Histograms

Fig. 4: Probability density estimations, Epanechnikov kernel
Fig. 4: Probability density estimations, Epanechnikov kernel

Fig. 5: Probability density estimations, Gaussian kernel
Fig. 5: Probability density estimations, Gaussian kernel

Fig. 6: Empirical cumulative distribution functions
Fig. 6: Empirical cumulative distribution functions

2. Test of differences

The Kolmogorov-Smirnov test was used to test the hypothesis that both distributions are the same. The test statistic of 0.090 is less than the critical value of 0.213 for a 95% level test (the asymptotic p-value is 88.4%). The Wilcoxon rank sum test gives the same conclusion. It could be expected that the distribution of Houghton’s data might be shifted to the left with respect to the distribution of data examined in Part Ia (collected with regard to the state of preservation). However, the Kolmogorov-Smirnov one-sided test gives similar result: the asymptotic p-value is 50.5%. Thus, at the 95% level of significance, we cannot reject the hypothesis that the weight distributions of these two data samples are the same.

References:

Esty, Warren W.; Banfield, Jeffrey D.: The Box-Percentile Plot. Journal of Statistical Software, Volume 8, Number 17, 2003, pp. 1-14.
Silverman, B.W.: Density Estimation for Statistics and Data Analysis. Chapman and Hall, London - New York - Tokyo - Melbourne - Madras, 1993 (reprint of the first edition published in 1986).
www.SFAGN.info  •  October 6, 2003 – December 11, 2005 Top of Page