Data, Estimation Strategy, and Empirical Design
The analysis uses a monthly casino-level dataset built to measure fluctuations in mass-market table-game drop. The unit of observation is the casino-month. The preferred sample covers at least eight years of monthly observations; ten to fifteen years gives a stronger basis for identifying seasonality, macroeconomic sensitivity, and volatility persistence.
The dependent variable is monthly mass-market drop. VIP, junket, premium credit play, and commission baccarat are excluded unless modeled separately, since mass-market cash play responds to different behavioral and financial conditions than VIP play. Drop is also distinct from actual win and theoretical win: drop measures gaming volume and patron liquidity, actual win measures realized casino outcome, and theoretical win measures expected revenue given game rules, house advantage, average wager, and time played.
The dataset includes the following monthly variables:
| Variable | Description | Expected Use |
|---|---|---|
| Mass-market drop | Monthly table-game drop from non-VIP play | Main dependent variable |
| CPI | Consumer price index | Inflation adjustment |
| Unemployment rate | Labor-market condition | Macroeconomic stress indicator |
| Consumer confidence | Household sentiment | Demand-side control |
| Tourism arrivals | Visitor flow | Market-demand control |
| Promotion spend | Monthly casino marketing spend | Property-level demand stimulus |
| Active table count | Number of operating tables | Capacity control |
| Average table minimum | Average minimum bet level | Access and pricing control |
| Operating days | Number of days open in the month | Exposure adjustment |
| Closure dummy | Full or partial closure month | Structural disruption control |
| Competitor event dummy | Major competitor opening or campaign | External market shock |
| Regulatory change dummy | Tax, smoking, licensing, or operating-rule change | Structural-break control |
| Major event dummy | Holiday, festival, convention, or major local event | Calendar-demand control |
Nominal drop is adjusted for inflation:
[ RealDrop_t = \frac{NominalDrop_t}{CPI_t} \times 100 ]
The main modeling variable is the monthly change in log real drop:
[ z_t = \Delta \ln(RealDrop_t) ]
This transformation lets the model read changes as approximate real percentage movements and reduces scale effects. It is not applied automatically, though. The choice among log levels, first differences, and seasonal differences is settled by the data. The Box-Jenkins approach is well suited to this because it starts from the observed structure of the series rather than imposing a model in advance (Box, Jenkins, Reinsel, & Ljung, 2015).
Closure months need separate handling. A closed casino does not generate an ordinary missing observation; it generates an observation constrained by regulation or physical non-operation. Such months are either controlled with closure indicators or dropped in robustness checks. The same caution applies to changes in accounting rules, table classification, CMS reporting, and player-segmentation definitions.
Descriptive Statistics
The first table describes the scale and distribution of the data before any modeling.
| Variable | Obs. | Mean | Median | Std. Dev. | Min | Max | Skewness | Kurtosis |
|---|---|---|---|---|---|---|---|---|
| Nominal mass-market drop | - | - | - | - | - | - | - | - |
| Real mass-market drop | - | - | - | - | - | - | - | - |
| Log real drop growth | - | - | - | - | - | - | - | - |
| CPI | - | - | - | - | - | - | - | - |
| Unemployment rate | - | - | - | - | - | - | - | - |
| Consumer confidence | - | - | - | - | - | - | - | - |
| Tourism arrivals | - | - | - | - | - | - | - | - |
| Promotion spend | - | - | - | - | - | - | - | - |
| Active table count | - | - | - | - | - | - | - | - |
| Average table minimum | - | - | - | - | - | - | - | - |
The discussion that follows should establish whether drop is heavily skewed, whether there are extreme months, whether promotional spend clusters in particular periods, and whether tourism or event variables show clear seasonal patterns. Unusual months often reflect real business events rather than statistical noise, and the descriptive table is where those candidates first appear.
Stationarity Testing
Casino drop is typically trended, seasonal, and exposed to breaks in the operating environment, so unit-root testing precedes estimation of the time-series model. The study reports both Augmented Dickey-Fuller and Phillips-Perron tests. Dickey and Fuller (1979) provide the standard framework for testing autoregressive series for unit roots, and Phillips and Perron (1988) offer a complementary test that is more robust to serial correlation and heteroskedasticity.
| Series | ADF Statistic | ADF p-value | PP Statistic | PP p-value | Deterministic Terms | Decision |
|---|---|---|---|---|---|---|
| Nominal drop level | - | - | - | - | Intercept + trend | - |
| Real drop level | - | - | - | - | Intercept + trend | - |
| Log real drop | - | - | - | - | Intercept + trend | - |
| First difference log real drop | - | - | - | - | Intercept | - |
| Seasonal difference log real drop | - | - | - | - | Intercept | - |
If the level series cannot reject the unit-root null while log real drop growth is stationary, the differenced log specification is the right one to carry forward. Where seasonal structure persists after first differencing, seasonal ARMA terms or monthly dummies are retained.
Mean Equation: SARIMAX Specification
The conditional mean of real mass-market drop growth is estimated with a SARIMAX model that combines seasonal autoregressive behavior with macroeconomic and property-level controls:
[ z_t = \mu + \beta_1 Unemployment_t + \beta_2 ConsumerConfidence_t + \beta_3 TourismArrivals_t + \beta_4 PromotionSpend_t + \beta_5 ActiveTables_t + \beta_6 AverageTableMinimum_t + \lambda’ D_t + u_t ]
The residual term follows a seasonal ARMA process:
[ \Phi_P(L^{12}),\phi_p(L),u_t = \Theta_Q(L^{12}),\theta_q(L),\epsilon_t ]
Seasonal time-series modeling has precedent in casino-revenue forecasting. Cargill and Eadington (1978) showed that Nevada gaming revenues contained identifiable time characteristics suitable for Box-Jenkins forecasting. This study carries that logic from aggregate gaming revenue down to mass-market drop and adds macroeconomic and operational covariates.
| Variable | Coefficient | Std. Error | z-statistic | p-value | Expected Sign | Interpretation |
|---|---|---|---|---|---|---|
| Constant | - | - | - | - | - | Baseline growth |
| Unemployment rate | - | - | - | - | Negative | Labor-market pressure |
| Consumer confidence | - | - | - | - | Positive | Household sentiment |
| Tourism arrivals | - | - | - | - | Positive | Visitor demand |
| Promotion spend | - | - | - | - | Positive | Marketing stimulus |
| Active table count | - | - | - | - | Positive | Capacity availability |
| Average table minimum | - | - | - | - | Ambiguous | Price/access effect |
| Closure dummy | - | - | - | - | Negative | Operating disruption |
| Competitor event dummy | - | - | - | - | Negative | Market-share pressure |
| AR(1) | - | - | - | - | - | Short-run persistence |
| MA(1) | - | - | - | - | - | Shock correction |
| Seasonal AR(12) | - | - | - | - | - | Annual persistence |
| Seasonal MA(12) | - | - | - | - | - | Annual shock correction |
The coefficients carry no causal claim. A negative unemployment coefficient would not show that unemployment lowers casino drop; it would show that, conditional on the other variables in the model, higher unemployment coincides with weaker real drop growth. A causal reading would require a stronger identification strategy.
Residual Diagnostics
The SARIMAX residuals are checked before any volatility model is fit. A GARCH model cannot be allowed to paper over a poorly specified mean equation: if residual autocorrelation remains, the mean equation is revised first.
The Ljung-Box test evaluates remaining serial correlation in the residuals (Ljung & Box, 1978). The ARCH-LM test then asks whether squared residuals are serially correlated, following Engle’s original ARCH framework (Engle, 1982).
| Diagnostic Test | Lag | Test Statistic | p-value | Null Hypothesis | Decision |
|---|---|---|---|---|---|
| Ljung-Box residual test | 12 | - | - | No residual autocorrelation | - |
| Ljung-Box residual test | 24 | - | - | No residual autocorrelation | - |
| ARCH-LM test | 6 | - | - | No ARCH effects | - |
| ARCH-LM test | 12 | - | - | No ARCH effects | - |
| Jarque-Bera normality test | - | - | - | Normal residuals | - |
A rejection of the ARCH-LM null means the residual variance is not constant, which is what justifies moving to GARCH-family models.
Conditional Volatility Models
The baseline volatility model is GARCH(1,1):
[ \epsilon_t = \sigma_t \eta_t ]
[ \sigma_t^2 = \omega + \alpha,\epsilon_{t-1}^{2} + \beta,\sigma_{t-1}^{2} ]
Engle (1982) introduced ARCH models for time-varying conditional variance, and Bollerslev (1986) generalized them through GARCH, letting volatility depend on both recent shocks and its own past. The fit to casino drop is natural: unusual months rarely arrive alone, and a single large shock can leave several months of elevated uncertainty behind it.
| Parameter | Estimate | Std. Error | z-statistic | p-value | Interpretation |
|---|---|---|---|---|---|
| (\omega) | - | - | - | - | Long-run variance constant |
| (\alpha) | - | - | - | - | Immediate shock effect |
| (\beta) | - | - | - | - | Volatility persistence |
| (\alpha + \beta) | - | - | - | - | Total persistence |
| Student-t degrees of freedom | - | - | - | - | Tail thickness |
An EGARCH model is also estimated to test whether negative shocks move future volatility differently from positive ones:
[ \ln(\sigma_t^2) = \omega + \alpha\left( \left|\frac{\epsilon_{t-1}}{\sigma_{t-1}}\right| - E|\eta_{t-1}| \right) + \gamma,\frac{\epsilon_{t-1}}{\sigma_{t-1}} + \beta,\ln(\sigma_{t-1}^{2}) ]
Nelson’s EGARCH model permits asymmetric volatility responses without the non-negativity restrictions of standard GARCH (Nelson, 1991). The parameter of interest is (\gamma). A negative, significant (\gamma) would mean that unexpectedly weak drop months raise subsequent uncertainty more than unexpectedly strong ones.
| Parameter | Estimate | Std. Error | z-statistic | p-value | Interpretation |
|---|---|---|---|---|---|
| (\omega) | - | - | - | - | Variance intercept |
| (\alpha) | - | - | - | - | Magnitude effect |
| (\gamma) | - | - | - | - | Asymmetric shock effect |
| (\beta) | - | - | - | - | Persistence |
| Student-t degrees of freedom | - | - | - | - | Fat-tail adjustment |
Forecast Comparison
Forecast accuracy is tested with rolling-origin validation. Random train-test splits are wrong for time-series forecasting because they let future information leak into training. Under rolling-origin validation, the model is estimated on the information available up to a given month, used to forecast the next month, and re-estimated as new data arrive.
| Model | RMSE | MAE | sMAPE | 80% Interval Coverage | 95% Interval Coverage | AIC | BIC | Practical Use |
|---|---|---|---|---|---|---|---|---|
| Seasonal naive | - | - | - | - | - | - | - | Basic annual benchmark |
| 12-month moving average | - | - | - | - | - | - | - | Simple smoothing |
| SARIMA | - | - | - | - | - | - | - | Seasonal structure |
| SARIMAX | - | - | - | - | - | - | - | Macro and operational controls |
| SARIMAX-GARCH, Gaussian | - | - | - | - | - | - | - | Volatility clustering |
| SARIMAX-GARCH, Student-t | - | - | - | - | - | - | - | Fat-tailed shocks |
| SARIMAX-EGARCH, Student-t | - | - | - | - | - | - | - | Asymmetric volatility |
Lowest RMSE alone does not pick the preferred model. A volatility model may add little to point forecasts while sharpening the prediction intervals considerably, and for casino management that interval gain often matters more than a marginal improvement in average accuracy.
Robustness Checks
The main results are tested against alternative specifications.
Alternative Lag Structures
| Model | Nonseasonal Order | Seasonal Order | AIC | BIC | Ljung-Box p-value | ARCH-LM p-value | Decision |
|---|---|---|---|---|---|---|---|
| A | SARIMAX(1,0,1) | (1,0,1,12) | - | - | - | - | - |
| B | SARIMAX(2,0,1) | (1,0,1,12) | - | - | - | - | - |
| C | SARIMAX(1,0,2) | (1,0,1,12) | - | - | - | - | - |
| D | SARIMAX(1,0,1) | (0,0,1,12) | - | - | - | - | - |
| E | SARIMAX(1,0,1) | (1,0,0,12) | - | - | - | - | - |
Alternative Macroeconomic Variables
| Baseline Variable | Alternative Variable | Reason for Test |
|---|---|---|
| Unemployment rate | Real income growth | Measures purchasing power |
| CPI inflation | Real wage growth | Captures household pressure |
| Consumer confidence | Consumer expectations index | Forward-looking sentiment |
| Tourism arrivals | Hotel occupancy | Visitor-market proxy |
| Tourism arrivals | Airport arrivals | Travel-flow proxy |
| Promotion spend | Promotion redemption value | Measures realized incentive use |
| Active table count | Table-hours open | More precise capacity measure |
Innovation Distribution
| Model | Innovation Distribution | Log-Likelihood | AIC | BIC | 95% Coverage | Tail Performance |
|---|---|---|---|---|---|---|
| GARCH(1,1) | Gaussian | - | - | - | - | - |
| GARCH(1,1) | Student-t | - | - | - | - | - |
| EGARCH(1,1) | Gaussian | - | - | - | - | - |
| EGARCH(1,1) | Student-t | - | - | - | - | - |
Student-t innovations deserve close attention, since casino drop can carry extreme observations from closures, major holidays, large events, or economic shocks.
Pre-Shock and Post-Shock Subsamples
The model is also estimated separately across major regimes. Candidate splits include pre-pandemic versus post-pandemic, pre-recession versus recession and recovery, pre-competitor opening versus post-competitor opening, and pre- versus post-regulatory change.
| Subsample | Period | Mean Forecast Error | Volatility Persistence | Macroeconomic Sensitivity | Interpretation |
|---|---|---|---|---|---|
| Pre-shock | - | - | - | - | Stable regime |
| Shock period | - | - | - | - | Disrupted regime |
| Post-shock | - | - | - | - | Recovery or new normal |
Casino demand can shift permanently after major events, and a model fitted to the old regime may simply fail to describe the new one. That is what this split is built to expose.
Original Contribution: Drop-at-Risk
This study introduces Drop-at-Risk as an operating metric for casino management. It borrows the logic of risk quantiles but does not measure financial trading loss; it measures the lower tail of expected patron liquidity entering the gaming floor.
Let (\mathcal{F}t) represent all information available at time (t): past drop, macroeconomic conditions, tourism data, promotional activity, table capacity, calendar events, and known disruptions. Let (Y{t+h}) be mass-market drop at forecast horizon (h). The (q)-level Drop-at-Risk is:
[ DaR_{q,t+h} = Q_q!\left(Y_{t+h} \mid \mathcal{F}_t\right) ]
A 5% one-month DaR forecast answers an operational question: if next month comes in worse than expected but still within a statistically plausible range, how low could mass-market drop fall?
The upper-tail version, Drop Capacity-at-Risk, answers the reverse:
[ DCaR_{q,t+h} = Q_{1-q}!\left(Y_{t+h} \mid \mathcal{F}_t\right) ]
If next month runs stronger than expected, how much staffing, table capacity, chip inventory, and cage liquidity might the property need on hand?
Backtesting Drop-at-Risk
DaR is backtested the way any risk forecast should be. If the model issues a 5% DaR estimate, actual drop should fall below that threshold roughly 5% of the time.
[ I_t = \begin{cases} 1, & Y_t < DaR_{q,t} \ 0, & Y_t \geq DaR_{q,t} \end{cases} ]
[ \hat{v} = \frac{1}{T}\sum_{t=1}^{T} I_t ]
Violations that occur too often mean the model understates downside risk; violations that occur too rarely suggest it is too conservative.
Management Application
| Forecast Component | Value | Management Use |
|---|---|---|
| Expected mass-market drop | - | Budget baseline |
| 5% Drop-at-Risk | - | Downside labor and promotion planning |
| 95% Drop Capacity-at-Risk | - | Upside liquidity and table-capacity planning |
| Volatility regime | - | Staffing flexibility |
| Recommended cage posture | - | Cash and chip reserve planning |
| Recommended promotion posture | - | Marketing spend discipline |
DaR is useful because it reframes the management question. Rather than asking only what drop is expected to be, the property asks what range it has to be ready for, and that question maps directly onto floor operations, cage planning, and executive control.
Results Language
Empirical results are reported only after estimation. Until then the paper makes no claim that any coefficient is significant or that one model outperforms another.
The analysis proceeds in five stages. Descriptive statistics and seasonal decomposition describe the distribution and annual structure of mass-market drop. ADF and Phillips-Perron tests fix the appropriate transformation of the dependent variable. A SARIMAX model estimates the conditional mean of real drop growth from seasonality, macroeconomic variables, operating controls, and intervention dummies. Residual diagnostics test whether the mean equation has removed serial dependence and whether ARCH effects remain. GARCH and EGARCH models then estimate the conditional variance and test whether negative drop shocks produce asymmetric volatility.
Model performance is judged through rolling-origin validation. The SARIMAX-GARCH and SARIMAX-EGARCH models are compared against seasonal naive, moving-average, SARIMA, and homoskedastic SARIMAX benchmarks, with selection resting on point-forecast accuracy, interval coverage, residual diagnostics, and operational usefulness.
The contribution is not confined to forecasting accuracy. The broader contribution is a risk-sensitive method for turning casino drop forecasts into staffing, table-limit, promotional, and cage-liquidity decisions.
Conclusion
Mass-market casino drop is more than a seasonal revenue indicator. It is a measure of patron-side liquidity, operating pressure, and discretionary gaming demand. A property that forecasts only the expected level of drop has solved only half the problem, because the uncertainty around that expectation is what decides whether it ends up overstaffed, short of cash, slow to open tables, too aggressive with promotions, or caught flat by a strong weekend.
This paper develops a time-series approach that separates the expected movement of mass-market drop from the volatility around it. The SARIMAX component captures seasonality, macroeconomic exposure, operating capacity, promotions, and structural breaks. The GARCH component captures volatility clustering. The EGARCH extension tests whether unexpectedly weak months generate more future uncertainty than unexpectedly strong ones.
The original contribution is Drop-at-Risk, which converts a statistical forecast into a working operating range. It lets executives plan not just for the most likely month but for plausible downside and upside scenarios, which is what makes the model useful on the floor as well as on paper: labor planning, table deployment, promotion control, and cage liquidity all draw on the same forecast.
A complete empirical version of this study requires monthly casino drop data, CPI, unemployment, consumer confidence, tourism arrivals, promotion records, table count, closure indicators, competitor-event coding, and regulatory-change controls, together with full coefficient estimation, residual diagnostics, volatility modeling, rolling-origin validation, and robustness checks. With those in place, the work moves from a methodological proposal to a complete empirical study of mass-market casino demand risk.
References
Akaike, H. (1974). “A New Look at the Statistical Model Identification.” IEEE Transactions on Automatic Control, 19(6), 716-723.
Bollerslev, T. (1986). “Generalized Autoregressive Conditional Heteroskedasticity.” Journal of Econometrics, 31(3), 307-327.
Box, G. E. P., Jenkins, G. M., Reinsel, G. C., & Ljung, G. M. (2015). Time Series Analysis: Forecasting and Control. 5th ed. Wiley.
Cargill, T. F., & Eadington, W. R. (1978). “Nevada’s Gaming Revenues: Time Characteristics and Forecasting.” Management Science, 24(12), 1221-1230.
Dickey, D. A., & Fuller, W. A. (1979). “Distribution of the Estimators for Autoregressive Time Series with a Unit Root.” Journal of the American Statistical Association, 74(366), 427-431.
Engle, R. F. (1982). “Autoregressive Conditional Heteroscedasticity with Estimates of the Variance of United Kingdom Inflation.” Econometrica, 50(4), 987-1007.
Horvath, C., & Paap, R. (2012). “The Effect of Recessions on Gambling Expenditures.” Journal of Gambling Studies, 28(4), 703-717.
Ljung, G. M., & Box, G. E. P. (1978). “On a Measure of Lack of Fit in Time Series Models.” Biometrika, 65(2), 297-303.
Nelson, D. B. (1991). “Conditional Heteroskedasticity in Asset Returns: A New Approach.” Econometrica, 59(2), 347-370.
Phillips, P. C. B., & Perron, P. (1988). “Testing for a Unit Root in Time Series Regression.” Biometrika, 75(2), 335-346.