4-dimensional CSFs: spatial frequency, luminance, size and eccentricity

Fitting error

Dataset	Fitting error				Sensitivity adjustment
Dataset	stelaCSF	VDP CSF	Rovamo 1995 CSF	FovVideoVDP CSF	stelaCSF	VDP CSF	Rovamo 1995 CSF	FovVideoVDP CSF
Average training	4.08 [dB]	8.76 [dB]	5.63 [dB]	6.36 [dB]	N/A	N/A	N/A	N/A
modelfest	3.26 [dB]	3.69 [dB]	3.44 [dB]	2.21 [dB]	1.000	1.000	1.000	1.000
hdrvdp_csf	3.26 [dB]	8.72 [dB]	6.35 [dB]	3.15 [dB]	1.255	1.675	0.957	0.990
hdr_csf	3.40 [dB]	5.94 [dB]	5.15 [dB]	3.97 [dB]	0.719	0.487	0.615	0.713
rovamo1993	2.77 [dB]	6.38 [dB]	1.91 [dB]	4.81 [dB]	1.636	2.577	1.888	1.506
virsu1979	5.11 [dB]	9.73 [dB]	6.24 [dB]	7.91 [dB]	1.210	1.373	0.906	0.995
virsu1982	3.33 [dB]	8.72 [dB]	5.12 [dB]	7.84 [dB]	0.710	0.375	0.507	0.872
anderson1991	6.31 [dB]	12.53 [dB]	7.81 [dB]	10.58 [dB]	1.006	1.848	2.516	1.624

Model comparison statistics

Model	Sum of Square Errors (SS)	Degrees of freedom (df)	F-test		AIC
Model	Sum of Square Errors (SS)	Degrees of freedom (df)	F-statistic	p-value	AIC
stelaCSF (Reference Model)	18.032	366	N/A	N/A	-1157.08
VDP CSF	81.371	381	85.7042	0.0000 ✓	-594.893
Rovamo 1995 CSF	34.135	379	25.141	0.0000 ✓	-932.288
FovVideoVDP CSF	48.077	385	32.0947	0.0000 ✓	-809.698

We use AIC and F-test to test whether the difference in fitting error is statistically significant at alpha=0.05 level. Both statistical metrics take the number of optimized parameters into account.

F-test: For F-test, we compare the fitting results from stelaCSF with those of other models. The F-static is calculated using the residual sum of squares and degrees of freedom (number of data points - number of optimized parameters) from both models. The corresponding p-value indicates whether or not the null hypothesis is rejected, where H₀: the stelaCSF does not provide significant better fit than the other model. The p-values less than 0.05 indicates that stelaCSF provides a better fit to the data at the significance level of 0.05 (marked with ✓). We performed the F-test for all individual datasets as well as for all datasets combined. For smaller datasets, where the number of data points are comparable to the number of model parameters, F-test can not provide any results since it indicates there is more variance within the models' fits than between.

AIC: Akaike information criterion is a statistical estimator of prediction error and relative quality of the models, which accounts for the number of parameters of each model. The model with the lower AIC score is considered to be better and with a good balance of error value and the number of parameters.

The sensitivity adjustment column contains a multiplier that is used to adjust the sensitivity of each datasets. It corresponds to s_d in the paper (Eq. 18).

Model parameters

stelaCSF

p.ach_sust.S_max = [ 52.4592 3.81915 0.222862 7.27366e-07 1.15157e+10 ]; p.ach_sust.f_max = [ 1.52743 14.5391 0.257662 ]; p.ach_sust.bw = 0.0163998; p.ach_sust.a = 0.00677941; p.ach_trans.S_max = [ 1.17739 57.6281 ]; p.ach_trans.f_max = 0.00220211; p.ach_trans.bw = 2.07562; p.ach_trans.a = 0.000273289; p.sigma_trans = 0.141447; p.sigma_sust = 32.2325; p.ecc_drop = 0.0322032; p.ecc_drop_nasal = 0.0199422; p.ecc_drop_f = 0.0243679; p.ecc_drop_f_nasal = 0.0162802;

VDP CSF

p.P = 256.183; p.ob = 1.00787; p.k = 0.300253; p.epsilon = 1.19203; p.a_l_m = 38.4614; p.b_l_m = 1.64298;

Rovamo 1995 CSF

p.ach_sust.S_0 = 38.6286; p.ach_sust.f_max = 9.73821; p.ach_sust.f0 = 0.503634; p.ach_trans.S_0 = 35836.4; p.ach_trans.f_max = 13.8597; p.ach_trans.f0 = 9.23544e-05; p.cm_e2 = 1.43973; p.cm_e2_nasal = 2.83047;

FovVideoVDP CSF

p.S_0 = 2.12826; p.k_cm = 1.02008;