Journal Browser
Journal Insights

Frequency: Quarterly

Time to first decision: 2.4 Weeks

Submission to publication: 10 Weeks

Acceptance rate: 26 %

E-ISSN: 2811-0943

Journal Citations

Abstracting and Indexing

徽标低可信度描述已自动生成

Members

徽标, 公司名称描述已自动生成图示中度可信度描述已自动生成

徽标描述已自动生成图标描述已自动生成

Related Journal
Open Access Journal Article

Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times

by Alain Bultez a,*  and  Bert Seghers b
a
Louvain School of Management, Catholic University of Louvain, FUCaM, Mons, Belgium
b
International Post Corporation, Brussels, Belgium
*
Author to whom correspondence should be addressed.
JEA  2025 4(2):103; https://doi.org/10.58567/jea04020006
Received: 3 November 2024 / Accepted: 23 December 2024 / Published Online: 13 February 2025

Abstract

The goal of this paper is to work out a poststratification method for estimating in-time indicators for international end-to-end delivery processes when

  • collected data cannot cover all the strata making up the logistics universe to be surveyed,
  • the true weights of the strata - needed to correct biases in representativeness caused by disproportionate sampling and incomplete coverage - are unknown but can be inferred from marginal subtotals related to stratification criteria considered separately, rather than jointly, and conditional on each end of the delivery journey: outbound- versus inbound-specific. Within this perspective, poststratification is used here to mean a statistical correction of measurements derived from incomplete stratified samples, an ex-post calibration aimed at yielding more accurate estimates based on an analysis of the data. Thus, we tackle instances where ex-ante assignment to strata is not a problem, but when surveying all strata is out of the question.

For that purpose, an econometric model is designed

  • to link the discrete transport lead times, counted in days, of tested items to the specifics of their material characteristics (e.g. size/weight), as well as those of the routes they take through the distribution network (e.g. origin and destination zones),
  • and provide performance predictions for each of the strata, covered as well as non-covered.

Benchmarking the multinomial cumulative logit regression against the negative binomial one reveals that delivery time had better be treated as an ordinal categorical system’s response, rather than as a ratio-scaled count.

The model-based fitted and extrapolated estimates are then used as inputs to the ex-post weighting stage, which produces robust point- and interval-estimates of aggregate key performance indicators (KPIs) through bootstrapping. Simple linear programs provide two extreme weighting sets, one per country-to-country path: the first minimizes the KPIs’ values, while the second maximizes them. 

Probabilities of delivery within deadlines summarize distributions of delivery times better than their means and standard deviations, because logistical efforts to cut transit by one day must be enhanced more and more as it gets shortened. Three types of graphs are proposed to help visualize this exponential increase in the service quality required. The applicability of the methodology developed is demonstrated on the 2023 database of the International Post Corporation. In this case, the imprecision of the KPI estimates depends much more on the uncertainty caused by disturbances occurring during the first- and last-miles, than on the imperfection of the information about the real weights of the strata. 


Copyright: © 2025 by Bultez and Seghers. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (Creative Commons Attribution 4.0 International License). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
Cite This Paper
APA Style
Bultez, A., & Seghers, B. (2025). Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 4(2), 103. doi:10.58567/jea04020006
ACS Style
Bultez, A.; Seghers, B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 2025, 4, 103. doi:10.58567/jea04020006
AMA Style
Bultez A, Seghers B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis; 2025, 4(2):103. doi:10.58567/jea04020006
Chicago/Turabian Style
Bultez, Alain; Seghers, Bert 2025. "Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times" Journal of Economic Analysis 4, no.2:103. doi:10.58567/jea04020006

Share and Cite

ACS Style
Bultez, A.; Seghers, B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 2025, 4, 103. doi:10.58567/jea04020006
AMA Style
Bultez A, Seghers B. Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis; 2025, 4(2):103. doi:10.58567/jea04020006
Chicago/Turabian Style
Bultez, Alain; Seghers, Bert 2025. "Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times" Journal of Economic Analysis 4, no.2:103. doi:10.58567/jea04020006
APA style
Bultez, A., & Seghers, B. (2025). Model-Based Poststratification of Measurements that Imperfectly Cover the Universe Studied: The Case of Postal Delivery Times. Journal of Economic Analysis, 4(2), 103. doi:10.58567/jea04020006

Article Metrics

Article Access Statistics

References

  1. Abdel-Aty, M. A., and Radwan, A. E. (2000). Modeling traffic accident occurrence and involvement. Accident Analysis & Prevention, 32(5), 633-642. https://doi.org/10.1016/S0001-4575(99)00094-9
  2. Abts, K. C., Ivy, J. A., and DeWoody, J. A. (2018). Demographic, environmental and genetic determinants of mating success in captive koalas (Phascolarctos cinereus). Zoo Biology, 37(6), 416–433. https://doi.org/10.1002/zoo.21457
  3. Agresti, A. (2002). Categorical Data Analysis, John Wiley & Sons, Inc. https://doi.org/10.1002/0471249688
  4. Agresti, A. (2007, 2019). An Introduction to Categorical Data Analysis, John Wiley & Sons, Inc.Alam, N., and Lee Ng, S. (2014). Banking mergers - an application of matching strategy. Review of Accounting and Finance, 13(1), 2-23. https://doi.org/10.1108/RAF-12-2012-0124
  5. Amrhein, V., Greenland, S., and McShane, B. (2019). Retire statistical significance. Nature, 567, 305–307. https://doi.org/10.1038/d41586-019-00857-9
  6. Anderson, J. A., and Philips, P. R. (1981). Regression, discrimination and measurement models of ordered categorical variables. Journal of the Royal Statistical Society. Series C (Applied Statistics), 30(1), 22–31. https://doi.org/10.2307/2346654
  7. Armstrong, B. G., and Sloan, M. (1989). Ordinal Regression Models for Epidemiologic Data. American Journal of Epidemiology, 129(1), 191–204. https://doi.org/10.1093/oxfordjournals.aje.a115109
  8. Anderson, J. A., and Philips, P. R. (1981). Regression, discrimination and measurement models of ordered categorical variables. Journal of the Royal Statistical Society. Series C (Applied Statistics), 30(1), 22–31. https://doi.org/10.2307/2346654
  9. Austin, P. C., and Leckie, L. (2020) Bootstrapped inference for variance parameters, measures of heterogeneity and random effects in multilevel logistic regression models, Journal of Statistical Computation and Simulation, 90(17), 3175-3199. https://doi.org/10.1080/00949655.2020.1797738
  10. Barmby, T., Nolan, M., and Winkelmann, R. (2001). Contracted workdays and absence. The Manchester School, 69(3), 269-275. https://doi.org/10.1111/1467-9957.00247
  11. Bermúdez, L., Karlis, D., and Santolino, M. (2018). A discrete mixture regression for modeling the duration of nonhospitalization medical leave of motor accident victims. Accident Analysis and Prevention, 121, 157–165. https://doi.org/10.1016/j.aap.2018.09.006
  12. Betensky, R. (2019). The p-value requires context, not a threshold. The American Statistician, 73(1), 115–117. https://doi.org/10.1080/00031305.2018.1529624
  13. Boysen, N., Fedtke, S., and Schwerdfeger, S. (2021). Last-mile delivery concepts: a survey from an operational research perspective. OR Spectrum, 43, 1–58. https://doi.org/10.1007/s00291-020-00607-8
  14. Brusco, M. (2022). Logistic Regression via Excel Spreadsheets: Mechanics, Model Selection, and Relative Predictor Importance. INFORMS Transactions on Education, 23(1), 1-11. https://doi.org/10.1287/ited.2021.0263
  15. Bultez, A., and Naert, P. (1979). Does Lag Structure Really Matter in Optimi¬zing Advertising Expenditures? Management Science, 25(5). 454-465. https://doi.org/10.1287/mnsc.32.2.182
  16. Bultez, A., Derbaix, C., and Herrmann, J.-L. (2022). Statistically significant? Let us recognize that estimates of tested effects are uncertain. Recherche et Applications En Marketing (English Edition), 37(1), 82-105. https://doi.org/10.1177/20515707211040743
  17. Bultez, A., and Herrmann, J.-L. (2025). Value added to marketing research diagnoses by add-ons to p-values. Journal of Marketing Analytics, forthcoming. https://doi.org/10.1057/s41270-024-00351-w
  18. Bultez, A., Laurent, G., and Lemay, L. (2025). Quantifying relationships between ordinal categorical variables. Application to metrics tracked by satisfaction barometers. Recherche et Applications En Marketing (English Edition), forthcoming. https://www.researchgate.net/publication/388217974_
  19. Calabrese, R., Marra, G., and Osmetti, S. A. (2016). Bankruptcy prediction of small and medium enterprises using a flexible binary generalized extreme value model. The Journal of the Operational Research Society, 67(4), 604-615. https://doi.org/10.1057/jors.2015.64
  20. Calin-Jageman, R. J., and Cumming, G. (2019). The New Statistics for Better Science: Ask How Much, How Uncertain, and What Else Is Known. The American Statistician, 73(sup 1), 271-280. https://doi.org/10.1080/00031305.2018.1518266
  21. Cai, B., and Shimizu, I. (2014). Negative Binomials Regression Model in Analysis of Wait Time at Hospital Emergency Department. Proceedings of the American Statistical Association (April), 0:4262. PMCID: PMC7183738 https://pubmed.ncbi.nlm.nih.gov/32336961/
  22. Carrubba, C., Friedman, B., Martin, A. D., and Vanberg, G. (2012). Who Controls the Content of Supreme Court Opinions? American Journal of Political Science, 56(2), 400–441. https://doi.org/10.1111/j.1540-5907.2011.00557
  23. Castillo, V. E., Mollenkopf, D. A., Bell, J. E., and Bozdogan, H. (2018). Supply Chain Integrity: A Key to Sustainable Supply Chain Management. Journal of Business Logistics, 39(1), 38–56. https://doi.org/10.1111/jbl.12176
  24. Caulkins, J.P., Barnett, A., Larkey, P.D., Yuan, Y., and Goranson, J. (1993). The On-Time Machines: Some Analyses of Airline Punctuality. Operations Research, 41(4), 710-720. https://doi.org/10.1287/opre.41.4.710
  25. CEN: EUROPEAN COMMITTEE FOR STANDARDIZATION. Technical Committee CEN/TC 331, EN 13850 (2020), Postal services - Quality of services -Measurement of the transit time of end-to-end services for single piece priority mail and first class mail. https://www.cencenelec.eu/about-cen/
  26. Dalla V. L., Leisen, F., Rossini, L., and Zhu, W. (2020). Bayesian analysis of immigration in Europe with generalized logistic regression. Journal of Applied Statistics, 47(3), 424-438. https://doi.org/10.1080/02664763.2019.1642310
  27. De Haan, E., Verhoef, P., and Wiesel, T. (2015). The Predictive Ability of Different Customer Feedback Metrics for Retention. International Journal of Research in Marketing, 32(2), 195-206. https://doi.org/10.1016/j.ijresmar.2015.02.004
  28. Denuit, M., Hainaut, D., and Trufin, J. (2019). Effective Statistical Learning Methods for Actuaries I: GLMs and Extensions, Springer Nature Switzerland AG. https://link.springer.com/book/10.1007/978-3-030-25820-7
  29. Diekmann, A., Bruderer, E. H., Hartmann, J., Kurz, K., Liebe, U., and Preisendörfer, P. (2022), Environmental Inequality in Four European Cities: A Study Combining Household Survey and Geo-Referenced Data. European Sociological Review, 39(1), 44-66. https://doi.org/10.1093/esr/jcac028
  30. Dinler, N., and Rankin, W. B. (2020). Increasing Airports' On-Time Arrival Performance Through Airport Capacity and Efficiency Indicators. International Journal of Aviation, Aeronautics, and Aerospace, 7(1). https://doi.org/10.15394/ijaaa.2020.1444
  31. Efron, B., and Tibshirani, R. (1994). An Introduction to Bootstrap. Chapman & Hall /CRC (New York). https://doi.org/10.1201/9780429246593
  32. Efron, B., and Tibshirani, R. (1986). Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy. Statistical Science 1(1), 54-75. https://doi.org/10.1214/ss/1177013815
  33. Ehrenberg, A. S. C. (1959). The pattern of consumer purchases. Journal of the Royal Statistical Society, Series C (Applied Statistics), 8(1), 26-41. https://doi.org/10.2307/2985810
  34. Farid, A., and Ksaibati, K. (2021). Modeling severities of motorcycle crashes using random parameters. Journal of Traffic and Transportation Engineering, 8(2), 225-236. https://doi.org/10.1016/j.jtte.2020.01.001
  35. Guadagni, P. M., and Little, J. D. C. (1983). A Logit Model of Brand Choice Calibrated on Scanner Data. Marketing Science, 2(3), 203-238. https://doi.org/10.1287/mksc.2.3.203
  36. Harrell, F. E. Jr. (2015), Regression Modeling Strategies With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis, Springer International Publishing (Switzerland). https://link.springer.com/book/10.1007/978-3-319-19425-7
  37. Hilbe, J. M. (2011). Negative Binomial Regression, Cambridge University Press. https://doi.org/10.1017/CBO9780511973420
  38. Hurlbert, S. H., Levine, R. A., and Utts. J. (2019). Coup de Grâce for a tough old bull: ‘Statistically significant’ expires. The American Statistician, 73: 352–357. https://doi.org/10.1080/00031305.2018.1543616
  39. Hoang, V.-N., and Watson, J. (2022), Teaching binary logistic regression modeling in an introductory business analytics course. Decision Sciences Journal of Innovative Education 20(4), 201–211. https://doi.org/10.1111/dsji.12274
  40. IPC (March 2024). INTERNATIONAL MAIL QUALITY OF SERVICE MONITORING: UNEX™ CEN 2023, 1-10. https://www.ipc.be/services/operational-performance-services/unex/results
  41. Khan, R. J., Gebreab, S. Y., Sims, M., Riestra, P., Xu1, R., and Sharon K Davis, S. K. (2015). Prevalence, associated factors and heritabilities of metabolic syndrome and its individual components in African Americans: the Jackson Heart Study. British Medical Journal Open, 5(10). https://bmjopen.bmj.com/content/5/10/e008675
  42. Kim, Y., Choi, Y. K., and Emery, S. (2013). Logistic Regression With Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages. The American Statistician 67(3), 171-182. http://www.jstor.org/stable/24591462
  43. Kiernan, K., Tao, J., and Gibbs, P. (2012). Tips and Strategies for Mixed Modeling with SAS/STAT® Procedures. Paper 332-2012, SAS Global Forum Proceedings, SAS Institute Inc., Cary, NC, USA. https://support.sas.com/resources/papers/proceedings12/332-2012.pdf
  44. McCullagh, P. (1980). Regression Models for Ordinal Data. Journal of the Royal Statistical Society, Series B (Methodological), 42(2), 109–142. https://doi.org/10.1111/j.2517-6161.1980.tb01109.x
  45. Mood, C. (2010). Logistic Regression: Why We Cannot Do What We Think We Can Do, and What We Can Do About it. European Sociological Review, 26(1), 67–82. https://doi.org/10.1093/esr/jcp006
  46. Mulder, J., and Wagenmakers, E-J. (eds) (2016). Bayes factors for testing hypotheses in psychological research: Practical relevance and new developments. The Journal of Mathematical Psychology, 72(June), 1–220. https://doi.org/10.1016/j.jmp.2016.01.002
  47. Norton, E. C., Dowd, B. E., and Matthew L. Maciejew, M. L. (2018). Odds Ratios - Current Best Practice and Use, Journal of the American Medical Association, 320(1), 84-85. https://pubmed.ncbi.nlm.nih.gov/29971384/
  48. Papadopoulos, A., Roland, B., and Stark, R. B. (2021). Does Home Health Care Increase the Probability of 30-Day Hospital Readmissions? Interpreting Coefficient Sign Reversals, or Their Absence, in Binary Logistic Regression Analysis. The American Statistician, 75(2), 173-184. https://doi.org/10.1080/00031305.2019.1704873
  49. Peng, C.-Y. J., So, T.-S. H., Stage, F. K., Edward P. St., and John, E. P. (2002). The Use and Interpretation of Logistic Regression in Higher Education Journals: 1988-1999. Research in Higher Education, 43(3), 259-293. https://doi.org/10.1023/A:1014858517172
  50. Pritchard, J. P., Slovic, A. D., Giannotti, M., Geurs, K., Nardocci, A., Hagen-Zanker, A., Diego, B., Tomasiello, D. B., and Kumar, P. (2021). Satisfaction with travel, ideal commuting, and accessibility to employment. Journal of Transport and Land Use, 14(1), 995-1017. http://dx.doi.org/10.5198/jtlu.2021.1835
  51. Rao, J. N. K., Molina, I. (2015). Small Area Estimation, John Wiley & Sons, ISBN: 978-1-118-73578-7
  52. Ross, M. L., Lawston, A. N., Lowsky, L. O., and Hackman, C. L. (2022). What Factors Predict COVID-19 Vaccine Uptake Intention in College Students? American Journal of Health Education, 53(4), 237–247. https://doi.org/10.1080/19325037.2022
  53. Shaban, T. F., and Alkawareek, M. Y. (2022). Prediction of qualitative antibiofilm activity of antibiotics using supervised machine learning techniques. Computers in Biology and Medicine, 140, 1-9, available online. https://doi.org/10.1016/j.compbiomed.2021.105065
  54. Simatupang, T. M., Wright, A. C., and Sridharan, R. (2002). The knowledge of coordination for supply chain integration. Business Process Management Journal, 8(3) 289-308. https://doi.org/10.1108/14637150210428989
  55. Solow, R. M. (1960). On a Family of Lag Distributions. Econometrica, 28(2), 393-406. https://doi.org/10.2307/1907729
  56. Stevens, S. S. (1946). On the theory of scales of measurement. Science, 103 (2684), 677-680. https://doi.org/10.1126/science.103.2684.677
  57. Stoklosa, J., Blakey, R. V., and Hui, F. C. K. (2022) An Overview of Modern Applications of Negative Binomial Modelling in Ecology and Biodiversity Diversity. 14(320), 1-25. https://doi.org/10.3390/d14050320
  58. Sugasawa, S., and Kubokawa, T. (2023). Mixed-Effects Models and Small Area Estimation, Springer Nature Singapore Pte Ltd. https://doi.org/10.1007/978-981-19-9486-9
  59. UPU: Universal Postal Union. (2023). State of the Postal Sector 2023. A Hyper-Collaborative Path to Postal Development, Berne (CH). https://www.upu.int/UPU/media/upu/publications/State-of-the-Postal-Sector-2023.pdf
  60. Uusitalo, J., Ylhäisi, O., Rummukainen, H., and Makkonen, M. (2018) Predicting probability of A-quality lumber of Scots pine (Pinus sylvestris L.) prior to or concurrently with logging operation, Scandinavian Journal of Forest Research, 33(5), 475-483. https://doi.org/10.1080/02827581.2018.1461922
  61. Wang, K.-S., Owusu, D., Yue, P. Y., and Xie, C. (2016). Bayesian logistic regression in detection of gene–steroid interaction for cancer at PDLIM5 locus. Journal of Genetics, 95(2), 331-340. https://doi.org/10.1007/s12041-016-0642-1
  62. Wasserstein, R. L., and Lazar, N. A. (2016). The ASA Statement on p-Values: Context, Process, and Purpose. The American Statistician, 70(2), 129-133. https://doi.org/10.1080/00031305.2016.1154108
  63. White, R. E., Pearson, J. N., and Wilson, J. R. (1999). JIT Manufacturing: A Survey of Implementations in Small and Large U.S. Manufacturers. Management Science, 45(1),1-15.
  64. Winkelmann, R. (2008). Econometric Analysis of Count Data. Springer-Verlag (Berlin). https://link.springer.com/book/10.1007/978-3-540-78389-3
  65. Wolter, K., Jergovic, D., Moore, W., Murphy, J., and O'Muircheartaigh, C. (2003). Reliability of the Uncertified Ballots in the 2000 Presidential Election in Florida. The American Statistician, 57(1), 1-14. https://doi.org/10.1198/0003130031144
  66. Yang, Y., Hu, X., and Jiang, H. (2022). Group penalized logistic regressions predict up and down trends for stock prices. North American Journal of Economics and Finance, 59(January), 1-15. https://doi.org/10.1016/j.najef.2021.101564
  67. Yirga, A. A., Melesse, S. F., Mwambi, H. G., and Ayele, D. G. (2020). Negative binomial mixed models for analyzing longitudinal CD4 count data. Nature, Scientific Reports, 10(16742), open access. https://doi.org/10.1038/s41598-020-73883-7
  68. Zellner, A. (2002). Keep it sophisticatedly simple. In: Zellner A, Keuzenkamp HA, McAleer M, eds. Simplicity, Inference and Modelling: Keeping It Sophisticatedly Simple, Cambridge University Press: 242-262. https://www.cambridge.org/core/books/simplicity-inference-and-modelling/A03DB5BC14FFA1662EBCE2 5316F460FE
  69. Zhang, F., Sun, B., Diao, X., Zhao, W., and Shu, T. (2021). Prediction of adverse drug reactions based on knowledge graph embedding. BMC Medical Informatics and Decision Making, 21(38), 1-11. https://doi.org/10.1186/s12911-021-01402-3