The goal of this paper is to work out a poststratification method for estimating in-time indicators for international end-to-end delivery processes when
collected data cannot cover all the strata making up the logistics universe to be surveyed,
the true weights of the strata - needed to correct biases in representativeness caused by disproportionate sampling and incomplete coverage - are unknown but can be inferred from marginal subtotals related to stratification criteria considered separately, rather than jointly, and conditional on each end of the delivery journey: outbound- versus inbound-specific. Within this perspective, poststratification is used here to mean a statistical correction of measurements derived from incomplete stratified samples, an ex-post calibration aimed at yielding more accurate estimates based on an analysis of the data. Thus, we tackle instances where ex-ante assignment to strata is not a problem, but when surveying all strata is out of the question.
For that purpose, an econometric model is designed
to link the discrete transport lead times, counted in days, of tested items to the specifics of their material characteristics (e.g. size/weight), as well as those of the routes they take through the distribution network (e.g. origin and destination zones),
and provide performance predictions for each of the strata, covered as well as non-covered.
Benchmarking the multinomial cumulative logit regression against the negative binomial one reveals that delivery time had better be treated as an ordinal categorical system’s response, rather than as a ratio-scaled count.
The model-based fitted and extrapolated estimates are then used as inputs to the ex-post weighting stage, which produces robust point- and interval-estimates of aggregate key performance indicators (KPIs) through bootstrapping. Simple linear programs provide two extreme weighting sets, one per country-to-country path: the first minimizes the KPIs’ values, while the second maximizes them.
Probabilities of delivery within deadlines summarize distributions of delivery times better than their means and standard deviations, because logistical efforts to cut transit by one day must be enhanced more and more as it gets shortened. Three types of graphs are proposed to help visualize this exponential increase in the service quality required. The applicability of the methodology developed is demonstrated on the 2023 database of the International Post Corporation.In this case, the imprecision of the KPI estimates depends much more on the uncertainty caused by disturbances occurring during the first- and last-miles, than on the imperfection of the information about the real weights of the strata.
Bultez, A., & Seghers, B. (2025). Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 4(2), 103. doi:10.58567/jea04020006
ACS Style
Bultez, A.; Seghers, B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 2025, 4, 103. doi:10.58567/jea04020006
AMA Style
Bultez A, Seghers B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis; 2025, 4(2):103. doi:10.58567/jea04020006
Chicago/Turabian Style
Bultez, Alain; Seghers, Bert 2025. "Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times" Journal of Economic Analysis 4, no.2:103. doi:10.58567/jea04020006
Share and Cite
ACS Style
Bultez, A.; Seghers, B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 2025, 4, 103. doi:10.58567/jea04020006
AMA Style
Bultez A, Seghers B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis; 2025, 4(2):103. doi:10.58567/jea04020006
Chicago/Turabian Style
Bultez, Alain; Seghers, Bert 2025. "Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times" Journal of Economic Analysis 4, no.2:103. doi:10.58567/jea04020006
APA style
Bultez, A., & Seghers, B. (2025). Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 4(2), 103. doi:10.58567/jea04020006
Article Metrics
Article Access Statistics
References
Abdel-Aty, M. A., and Radwan, A. E. (2000). Modeling traffic accident occurrence and involvement. Accident Analysis & Prevention, 32(5), 633-642. https://doi.org/10.1016/S0001-4575(99)00094-9
Abts, K. C., Ivy, J. A., and DeWoody, J. A. (2018). Demographic, environmental and genetic determinants of mating success in captive koalas (Phascolarctos cinereus). Zoo Biology, 37(6), 416–433. https://doi.org/10.1002/zoo.21457
Agresti, A. (2002). Categorical Data Analysis, John Wiley & Sons, Inc. https://doi.org/10.1002/0471249688
Agresti, A. (2007, 2019). An Introduction to Categorical Data Analysis, John Wiley & Sons, Inc.Alam, N., and Lee Ng, S. (2014). Banking mergers - an application of matching strategy. Review of Accounting and Finance, 13(1), 2-23. https://doi.org/10.1108/RAF-12-2012-0124
Amrhein, V., Greenland, S., and McShane, B. (2019). Retire statistical significance. Nature, 567, 305–307. https://doi.org/10.1038/d41586-019-00857-9
Anderson, J. A., and Philips, P. R. (1981). Regression, discrimination and measurement models of ordered categorical variables. Journal of the Royal Statistical Society. Series C (Applied Statistics), 30(1), 22–31. https://doi.org/10.2307/2346654
Armstrong, B. G., and Sloan, M. (1989). Ordinal Regression Models for Epidemiologic Data. American Journal of Epidemiology, 129(1), 191–204. https://doi.org/10.1093/oxfordjournals.aje.a115109
Anderson, J. A., and Philips, P. R. (1981). Regression, discrimination and measurement models of ordered categorical variables. Journal of the Royal Statistical Society. Series C (Applied Statistics), 30(1), 22–31. https://doi.org/10.2307/2346654
Austin, P. C., and Leckie, L. (2020) Bootstrapped inference for variance parameters, measures of heterogeneity and random effects in multilevel logistic regression models, Journal of Statistical Computation and Simulation, 90(17), 3175-3199. https://doi.org/10.1080/00949655.2020.1797738
Barmby, T., Nolan, M., and Winkelmann, R. (2001). Contracted workdays and absence. The Manchester School, 69(3), 269-275. https://doi.org/10.1111/1467-9957.00247
Bermúdez, L., Karlis, D., and Santolino, M. (2018). A discrete mixture regression for modeling the duration of nonhospitalization medical leave of motor accident victims. Accident Analysis and Prevention, 121, 157–165. https://doi.org/10.1016/j.aap.2018.09.006
Betensky, R. (2019). The p-value requires context, not a threshold. The American Statistician, 73(1), 115–117. https://doi.org/10.1080/00031305.2018.1529624
Boysen, N., Fedtke, S., and Schwerdfeger, S. (2021). Last-mile delivery concepts: a survey from an operational research perspective. OR Spectrum, 43, 1–58. https://doi.org/10.1007/s00291-020-00607-8
Brusco, M. (2022). Logistic Regression via Excel Spreadsheets: Mechanics, Model Selection, and Relative Predictor Importance. INFORMS Transactions on Education, 23(1), 1-11. https://doi.org/10.1287/ited.2021.0263
Bultez, A., and Naert, P. (1979). Does Lag Structure Really Matter in Optimi¬zing Advertising Expenditures? Management Science, 25(5). 454-465. https://doi.org/10.1287/mnsc.32.2.182
Bultez, A., Derbaix, C., and Herrmann, J.-L. (2022). Statistically significant? Let us recognize that estimates of tested effects are uncertain. Recherche et Applications En Marketing (English Edition), 37(1), 82-105. https://doi.org/10.1177/20515707211040743
Bultez, A., and Herrmann, J.-L. (2025). Value added to marketing research diagnoses by add-ons to p-values. Journal of Marketing Analytics, forthcoming. https://doi.org/10.1057/s41270-024-00351-w
Bultez, A., Laurent, G., and Lemay, L. (2025). Quantifying relationships between ordinal categorical variables. Application to metrics tracked by satisfaction barometers. Recherche et Applications En Marketing (English Edition), forthcoming. https://www.researchgate.net/publication/388217974_
Calabrese, R., Marra, G., and Osmetti, S. A. (2016). Bankruptcy prediction of small and medium enterprises using a flexible binary generalized extreme value model. The Journal of the Operational Research Society, 67(4), 604-615. https://doi.org/10.1057/jors.2015.64
Calin-Jageman, R. J., and Cumming, G. (2019). The New Statistics for Better Science: Ask How Much, How Uncertain, and What Else Is Known. The American Statistician, 73(sup 1), 271-280. https://doi.org/10.1080/00031305.2018.1518266
Cai, B., and Shimizu, I. (2014). Negative Binomials Regression Model in Analysis of Wait Time at Hospital Emergency Department. Proceedings of the American Statistical Association (April), 0:4262. PMCID: PMC7183738 https://pubmed.ncbi.nlm.nih.gov/32336961/
Carrubba, C., Friedman, B., Martin, A. D., and Vanberg, G. (2012). Who Controls the Content of Supreme Court Opinions? American Journal of Political Science, 56(2), 400–441. https://doi.org/10.1111/j.1540-5907.2011.00557
Castillo, V. E., Mollenkopf, D. A., Bell, J. E., and Bozdogan, H. (2018). Supply Chain Integrity: A Key to Sustainable Supply Chain Management. Journal of Business Logistics, 39(1), 38–56. https://doi.org/10.1111/jbl.12176
Caulkins, J.P., Barnett, A., Larkey, P.D., Yuan, Y., and Goranson, J. (1993). The On-Time Machines: Some Analyses of Airline Punctuality. Operations Research, 41(4), 710-720. https://doi.org/10.1287/opre.41.4.710
CEN: EUROPEAN COMMITTEE FOR STANDARDIZATION. Technical Committee CEN/TC 331, EN 13850 (2020), Postal services - Quality of services -Measurement of the transit time of end-to-end services for single piece priority mail and first class mail. https://www.cencenelec.eu/about-cen/
Dalla V. L., Leisen, F., Rossini, L., and Zhu, W. (2020). Bayesian analysis of immigration in Europe with generalized logistic regression. Journal of Applied Statistics, 47(3), 424-438. https://doi.org/10.1080/02664763.2019.1642310
De Haan, E., Verhoef, P., and Wiesel, T. (2015). The Predictive Ability of Different Customer Feedback Metrics for Retention. International Journal of Research in Marketing, 32(2), 195-206. https://doi.org/10.1016/j.ijresmar.2015.02.004
Denuit, M., Hainaut, D., and Trufin, J. (2019). Effective Statistical Learning Methods for Actuaries I: GLMs and Extensions, Springer Nature Switzerland AG. https://link.springer.com/book/10.1007/978-3-030-25820-7
Diekmann, A., Bruderer, E. H., Hartmann, J., Kurz, K., Liebe, U., and Preisendörfer, P. (2022), Environmental Inequality in Four European Cities: A Study Combining Household Survey and Geo-Referenced Data. European Sociological Review, 39(1), 44-66. https://doi.org/10.1093/esr/jcac028
Dinler, N., and Rankin, W. B. (2020). Increasing Airports' On-Time Arrival Performance Through Airport Capacity and Efficiency Indicators. International Journal of Aviation, Aeronautics, and Aerospace, 7(1). https://doi.org/10.15394/ijaaa.2020.1444
Efron, B., and Tibshirani, R. (1994). An Introduction to Bootstrap. Chapman & Hall /CRC (New York). https://doi.org/10.1201/9780429246593
Efron, B., and Tibshirani, R. (1986). Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy. Statistical Science 1(1), 54-75. https://doi.org/10.1214/ss/1177013815
Ehrenberg, A. S. C. (1959). The pattern of consumer purchases. Journal of the Royal Statistical Society, Series C (Applied Statistics), 8(1), 26-41. https://doi.org/10.2307/2985810
Farid, A., and Ksaibati, K. (2021). Modeling severities of motorcycle crashes using random parameters. Journal of Traffic and Transportation Engineering, 8(2), 225-236. https://doi.org/10.1016/j.jtte.2020.01.001
Guadagni, P. M., and Little, J. D. C. (1983). A Logit Model of Brand Choice Calibrated on Scanner Data. Marketing Science, 2(3), 203-238. https://doi.org/10.1287/mksc.2.3.203
Harrell, F. E. Jr. (2015), Regression Modeling Strategies With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis, Springer International Publishing (Switzerland). https://link.springer.com/book/10.1007/978-3-319-19425-7
Hilbe, J. M. (2011). Negative Binomial Regression, Cambridge University Press. https://doi.org/10.1017/CBO9780511973420
Hurlbert, S. H., Levine, R. A., and Utts. J. (2019). Coup de Grâce for a tough old bull: ‘Statistically significant’ expires. The American Statistician, 73: 352–357. https://doi.org/10.1080/00031305.2018.1543616
Hoang, V.-N., and Watson, J. (2022), Teaching binary logistic regression modeling in an introductory business analytics course. Decision Sciences Journal of Innovative Education 20(4), 201–211. https://doi.org/10.1111/dsji.12274
IPC (March 2024). INTERNATIONAL MAIL QUALITY OF SERVICE MONITORING: UNEX™ CEN 2023, 1-10. https://www.ipc.be/services/operational-performance-services/unex/results
Khan, R. J., Gebreab, S. Y., Sims, M., Riestra, P., Xu1, R., and Sharon K Davis, S. K. (2015). Prevalence, associated factors and heritabilities of metabolic syndrome and its individual components in African Americans: the Jackson Heart Study. British Medical Journal Open, 5(10). https://bmjopen.bmj.com/content/5/10/e008675
Kim, Y., Choi, Y. K., and Emery, S. (2013). Logistic Regression With Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages. The American Statistician 67(3), 171-182. http://www.jstor.org/stable/24591462
Kiernan, K., Tao, J., and Gibbs, P. (2012). Tips and Strategies for Mixed Modeling with SAS/STAT® Procedures. Paper 332-2012, SAS Global Forum Proceedings, SAS Institute Inc., Cary, NC, USA. https://support.sas.com/resources/papers/proceedings12/332-2012.pdf
McCullagh, P. (1980). Regression Models for Ordinal Data. Journal of the Royal Statistical Society, Series B (Methodological), 42(2), 109–142. https://doi.org/10.1111/j.2517-6161.1980.tb01109.x
Mood, C. (2010). Logistic Regression: Why We Cannot Do What We Think We Can Do, and What We Can Do About it. European Sociological Review, 26(1), 67–82. https://doi.org/10.1093/esr/jcp006
Mulder, J., and Wagenmakers, E-J. (eds) (2016). Bayes factors for testing hypotheses in psychological research: Practical relevance and new developments. The Journal of Mathematical Psychology, 72(June), 1–220. https://doi.org/10.1016/j.jmp.2016.01.002
Norton, E. C., Dowd, B. E., and Matthew L. Maciejew, M. L. (2018). Odds Ratios - Current Best Practice and Use, Journal of the American Medical Association, 320(1), 84-85. https://pubmed.ncbi.nlm.nih.gov/29971384/
Papadopoulos, A., Roland, B., and Stark, R. B. (2021). Does Home Health Care Increase the Probability of 30-Day Hospital Readmissions? Interpreting Coefficient Sign Reversals, or Their Absence, in Binary Logistic Regression Analysis. The American Statistician, 75(2), 173-184. https://doi.org/10.1080/00031305.2019.1704873
Peng, C.-Y. J., So, T.-S. H., Stage, F. K., Edward P. St., and John, E. P. (2002). The Use and Interpretation of Logistic Regression in Higher Education Journals: 1988-1999. Research in Higher Education, 43(3), 259-293. https://doi.org/10.1023/A:1014858517172
Pritchard, J. P., Slovic, A. D., Giannotti, M., Geurs, K., Nardocci, A., Hagen-Zanker, A., Diego, B., Tomasiello, D. B., and Kumar, P. (2021). Satisfaction with travel, ideal commuting, and accessibility to employment. Journal of Transport and Land Use, 14(1), 995-1017. http://dx.doi.org/10.5198/jtlu.2021.1835
Rao, J. N. K., Molina, I. (2015). Small Area Estimation, John Wiley & Sons, ISBN: 978-1-118-73578-7
Ross, M. L., Lawston, A. N., Lowsky, L. O., and Hackman, C. L. (2022). What Factors Predict COVID-19 Vaccine Uptake Intention in College Students? American Journal of Health Education, 53(4), 237–247. https://doi.org/10.1080/19325037.2022
Shaban, T. F., and Alkawareek, M. Y. (2022). Prediction of qualitative antibiofilm activity of antibiotics using supervised machine learning techniques. Computers in Biology and Medicine, 140, 1-9, available online. https://doi.org/10.1016/j.compbiomed.2021.105065
Simatupang, T. M., Wright, A. C., and Sridharan, R. (2002). The knowledge of coordination for supply chain integration. Business Process Management Journal, 8(3) 289-308. https://doi.org/10.1108/14637150210428989
Solow, R. M. (1960). On a Family of Lag Distributions. Econometrica, 28(2), 393-406. https://doi.org/10.2307/1907729
Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103 (2684), 677-680. https://doi.org/10.1126/science.103.2684.677
Stoklosa, J., Blakey, R. V., and Hui, F. C. K. (2022) An Overview of Modern Applications of Negative Binomial Modelling in Ecology and Biodiversity Diversity. 14(320), 1-25. https://doi.org/10.3390/d14050320
Sugasawa, S., and Kubokawa, T. (2023). Mixed-Effects Models and Small Area Estimation, Springer Nature Singapore Pte Ltd. https://doi.org/10.1007/978-981-19-9486-9
UPU: Universal Postal Union. (2023). State of the Postal Sector 2023. A Hyper-Collaborative Path to Postal Development, Berne (CH). https://www.upu.int/UPU/media/upu/publications/State-of-the-Postal-Sector-2023.pdf
Uusitalo, J., Ylhäisi, O., Rummukainen, H., and Makkonen, M. (2018) Predicting probability of A-quality lumber of Scots pine (Pinus sylvestris L.) prior to or concurrently with logging operation, Scandinavian Journal of Forest Research, 33(5), 475-483. https://doi.org/10.1080/02827581.2018.1461922
Wang, K.-S., Owusu, D., Yue, P. Y., and Xie, C. (2016). Bayesian logistic regression in detection of gene–steroid interaction for cancer at PDLIM5 locus. Journal of Genetics, 95(2), 331-340. https://doi.org/10.1007/s12041-016-0642-1
Wasserstein, R. L., and Lazar, N. A. (2016). The ASA Statement on p-Values: Context, Process, and Purpose. The American Statistician, 70(2), 129-133. https://doi.org/10.1080/00031305.2016.1154108
White, R. E., Pearson, J. N., and Wilson, J. R. (1999). JIT Manufacturing: A Survey of Implementations in Small and Large U.S. Manufacturers. Management Science, 45(1),1-15.
Winkelmann, R. (2008). Econometric Analysis of Count Data. Springer-Verlag (Berlin). https://link.springer.com/book/10.1007/978-3-540-78389-3
Wolter, K., Jergovic, D., Moore, W., Murphy, J., and O'Muircheartaigh, C. (2003). Reliability of the Uncertified Ballots in the 2000 Presidential Election in Florida. The American Statistician, 57(1), 1-14. https://doi.org/10.1198/0003130031144
Yang, Y., Hu, X., and Jiang, H. (2022). Group penalized logistic regressions predict up and down trends for stock prices. North American Journal of Economics and Finance, 59(January), 1-15. https://doi.org/10.1016/j.najef.2021.101564
Yirga, A. A., Melesse, S. F., Mwambi, H. G., and Ayele, D. G. (2020). Negative binomial mixed models for analyzing longitudinal CD4 count data. Nature, Scientific Reports, 10(16742), open access. https://doi.org/10.1038/s41598-020-73883-7
Zellner, A. (2002). Keep it sophisticatedly simple. In: Zellner A, Keuzenkamp HA, McAleer M, eds. Simplicity, Inference and Modelling: Keeping It Sophisticatedly Simple, Cambridge University Press: 242-262. https://www.cambridge.org/core/books/simplicity-inference-and-modelling/A03DB5BC14FFA1662EBCE2 5316F460FE
Zhang, F., Sun, B., Diao, X., Zhao, W., and Shu, T. (2021). Prediction of adverse drug reactions based on knowledge graph embedding. BMC Medical Informatics and Decision Making, 21(38), 1-11. https://doi.org/10.1186/s12911-021-01402-3