The study provides an in-depth analysis of COVID-19 infections in Kenya, aiming to model the non-linear trajectory of daily cases. The research explores two statistical techniques: fractional polynomials and linear splines, to fit the growth of infection rates over time. COVID-19, which first appeared in Kenya in March 2020, exhibited fluctuating trends in daily infections. The study utilizes infection data collected from March 13, 2020, to June 6, 2021. Descriptive statistics and exploratory data analysis revealed significant variability in daily cases, with the infection trajectory characterized by multiple waves. Fractional polynomial models, known for their flexibility in fitting non-linear relationships, were evaluated at varying degrees to identify the best model for COVID-19 incidence trends. The analysis showed that a second-degree fractional polynomial with powers (1, 2) provided the most accurate fit for the data. The closed test algorithm was applied to confirm the model's suitability. Additionally, linear spline models were employed, partitioning the data into segments and fitting linear splines at each knot point. The model with 19 knots demonstrated superior performance based on Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), outperforming the fractional polynomial model. The comparison of the two methods concluded that linear splines provided a more precise fit for the infection data, capturing the complex nature of COVID-19's spread in Kenya. The study's findings offer critical insights into the infection dynamics and can aid policymakers in resource allocation and mitigation planning during pandemics. The study recommends further analysis by incorporating more covariates and extending the models to other countries for a comparative understanding of pandemic management strategies.
Published in | International Journal of Statistical Distributions and Applications (Volume 10, Issue 4) |
DOI | 10.11648/j.ijsd.20241004.11 |
Page(s) | 78-88 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2024. Published by Science Publishing Group |
COVID-19, Fractional Polynomials, Linear Splines, Continous Data, Knots Placement, Best Fitting Model, Multivariate Regression Models
[1] | Xu, B., Gutierrez, B., Mekaru, S. et al. (2020) “Epidemiological data from the COVID-19 outbreak, real-time case information”, Sci Data, 7(106) |
[2] | Read, J. M. et al. (2020). Novel coronavirus 2019- nCoV: early estimation of epidemiological parameters and epidemic predictions. MedRxiv, pp. 1-11. |
[3] | Gilbert, M. et al. (2020). Preparedness and vulnerability of African countries against importations of COVID-19: a modelling study. The Lancet, 395(10227), pp. 871-877. |
[4] | Anastassopoulou, C., Russo, L., Tsakris, A., Siettos, C. (2020). Data-based analysis, modeling and forecasting of the COVID-19 outbreak, PLoS ONE, 15(3), e0230405 |
[5] | Keeling, M. J. & Rohani, P. (2008).Modeling Infectious Diseases in Humans and Animals, Princeton University Press. |
[6] |
Ministry of Health. (2020). Kenya COVID-19 cases hits 1,161 Nairobi , Friday May 22, 2020 - MINISTRY OF HEALTH. Retrieved 24 May 2020, from
https://www.health.go.ke/kenya-covid-19-cases- hits-1161-nairobi-friday-may-22-2020/ |
[7] |
Winn, H. (2020). How Kenya is tracking against our COVID-19 model. Retrieved 24 May 2020, from
https://rescue.co/update-5th-may-how-kenya-is-tracking- against-our-covid-19-model/ |
[8] | Nanyingi, M. (2020). Predicting COVID-19: what applying a model in Kenya would look like. Retrieved 25 May 2020, from |
[9] | Tilling, K., Macdonald-Wallis, C., Lawlor, D., Hughes, R., & Howe, L. (2014). Modelling Childhood Growth Using Fractional Polynomials and Linear Splines. Annals Of Nutrition And Metabolism, 65(2-3), 129-138. |
[10] | Wong, E., Wang, B., Garrison, L., Alfonso-Cristancho, R., Flum, D., Arterburn, D., & Sullivan, S. (2011). Examining the BMI-mortality relationship using fractional polynomials. BMC Medical Research Methodology, 11(1). |
[11] | Royston, P. (2017). Model selection for univariable fractional polynomials. The Stata Journal, 17(3), 619- 629. |
[12] | Baneshi, M., Nakhaee, F., & Law, M. (2020). On the Use of Fractional Polynomial Models to Assess Preventive Aspect of Variables: An Example in Prevention of Mortality Following HIV Infection. International Journal Of Preventive Medicine, 4(4), 414-419. |
[13] | Duong, H & Volding, D. (2014). Modelling continuous risk variables: Introduction to fractional polynomial regression. Vietnam Journal of Science, (1). 1-5. |
[14] | Kahan, B., Rushton, H., Morris, T., & Daniel, R. (2016). A comparison of methods to adjust for continuous covariates in the analysis of randomised trials. BMC Medical Research Methodology, 16(1). |
[15] | Harrell Jr FE. (2001). Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. New York: Springer. |
[16] | Binder, H., Sauerbrei, W., & Royston, P. (2012). Comparison between splines and fractional polynomials for multivariable model building with continuous covariates: a simulation study with continuous response. Statistics In Medicine, 32(13), 2262-2277. |
[17] | Sauerbrei, W., Royston, P., & Binder, H. (2007). Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Statistics In Medicine, 26(30), 5512- 5528. |
[18] | Simonsen, L., Spreeuwenberg, P., Lustig, R., Taylor, R., Fleming, D., & Kroneman, M. et al. (2013). Global Mortality Estimates for the 2009 Influenza Pandemic from the GLaMOR Project: A Modeling Study. Plos Medicine, 10(11), e1001558. |
[19] | Royston, P., Ambler, G., & Sauerbrei, W. (1999). The use of fractional polynomials to model continuous risk variables in epidemiology. International journal of epidemiology, 28(5), 964-974. |
[20] | Hansen, M. H., Huang, J., Kooperberg, C., Stone, C. J., & Truong, Y. K. (2001). Statistical Modeling with Spline Functions Methodology and Theory. |
[21] | Wang, Y. (2011). Smoothing splines: methods and applications. CRC Press. |
[22] | Likhachev, D. V. (2017). Selecting the right number of knots for B-spline parameterization of the dielectric functions in spectroscopic ellipsometry data analysis. Thin Solid Films, 636, 519-526. |
APA Style
Njoroge, D., Mwalili, S., Wanjoya, A. (2024). Fitting COVID-19 Incidences in Kenya Using Fractional Polynomials and Linear Splines. International Journal of Statistical Distributions and Applications, 10(4), 78-88. https://doi.org/10.11648/j.ijsd.20241004.11
ACS Style
Njoroge, D.; Mwalili, S.; Wanjoya, A. Fitting COVID-19 Incidences in Kenya Using Fractional Polynomials and Linear Splines. Int. J. Stat. Distrib. Appl. 2024, 10(4), 78-88. doi: 10.11648/j.ijsd.20241004.11
@article{10.11648/j.ijsd.20241004.11, author = {Damaris Njoroge and Samuel Mwalili and Anthony Wanjoya}, title = {Fitting COVID-19 Incidences in Kenya Using Fractional Polynomials and Linear Splines}, journal = {International Journal of Statistical Distributions and Applications}, volume = {10}, number = {4}, pages = {78-88}, doi = {10.11648/j.ijsd.20241004.11}, url = {https://doi.org/10.11648/j.ijsd.20241004.11}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijsd.20241004.11}, abstract = {The study provides an in-depth analysis of COVID-19 infections in Kenya, aiming to model the non-linear trajectory of daily cases. The research explores two statistical techniques: fractional polynomials and linear splines, to fit the growth of infection rates over time. COVID-19, which first appeared in Kenya in March 2020, exhibited fluctuating trends in daily infections. The study utilizes infection data collected from March 13, 2020, to June 6, 2021. Descriptive statistics and exploratory data analysis revealed significant variability in daily cases, with the infection trajectory characterized by multiple waves. Fractional polynomial models, known for their flexibility in fitting non-linear relationships, were evaluated at varying degrees to identify the best model for COVID-19 incidence trends. The analysis showed that a second-degree fractional polynomial with powers (1, 2) provided the most accurate fit for the data. The closed test algorithm was applied to confirm the model's suitability. Additionally, linear spline models were employed, partitioning the data into segments and fitting linear splines at each knot point. The model with 19 knots demonstrated superior performance based on Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), outperforming the fractional polynomial model. The comparison of the two methods concluded that linear splines provided a more precise fit for the infection data, capturing the complex nature of COVID-19's spread in Kenya. The study's findings offer critical insights into the infection dynamics and can aid policymakers in resource allocation and mitigation planning during pandemics. The study recommends further analysis by incorporating more covariates and extending the models to other countries for a comparative understanding of pandemic management strategies.}, year = {2024} }
TY - JOUR T1 - Fitting COVID-19 Incidences in Kenya Using Fractional Polynomials and Linear Splines AU - Damaris Njoroge AU - Samuel Mwalili AU - Anthony Wanjoya Y1 - 2024/11/18 PY - 2024 N1 - https://doi.org/10.11648/j.ijsd.20241004.11 DO - 10.11648/j.ijsd.20241004.11 T2 - International Journal of Statistical Distributions and Applications JF - International Journal of Statistical Distributions and Applications JO - International Journal of Statistical Distributions and Applications SP - 78 EP - 88 PB - Science Publishing Group SN - 2472-3509 UR - https://doi.org/10.11648/j.ijsd.20241004.11 AB - The study provides an in-depth analysis of COVID-19 infections in Kenya, aiming to model the non-linear trajectory of daily cases. The research explores two statistical techniques: fractional polynomials and linear splines, to fit the growth of infection rates over time. COVID-19, which first appeared in Kenya in March 2020, exhibited fluctuating trends in daily infections. The study utilizes infection data collected from March 13, 2020, to June 6, 2021. Descriptive statistics and exploratory data analysis revealed significant variability in daily cases, with the infection trajectory characterized by multiple waves. Fractional polynomial models, known for their flexibility in fitting non-linear relationships, were evaluated at varying degrees to identify the best model for COVID-19 incidence trends. The analysis showed that a second-degree fractional polynomial with powers (1, 2) provided the most accurate fit for the data. The closed test algorithm was applied to confirm the model's suitability. Additionally, linear spline models were employed, partitioning the data into segments and fitting linear splines at each knot point. The model with 19 knots demonstrated superior performance based on Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC), outperforming the fractional polynomial model. The comparison of the two methods concluded that linear splines provided a more precise fit for the infection data, capturing the complex nature of COVID-19's spread in Kenya. The study's findings offer critical insights into the infection dynamics and can aid policymakers in resource allocation and mitigation planning during pandemics. The study recommends further analysis by incorporating more covariates and extending the models to other countries for a comparative understanding of pandemic management strategies. VL - 10 IS - 4 ER -