SciELO - Scientific Electronic Library Online

 
vol.38 issue3Effectiveness of constraint-induced movement therapy on motor function and daily living independence in people with stroke compared to other interventions: a systematic reviewSociodemographic and clinical characterization of patients with primary eosinophilic gastrointestinal pathology in a third level hospital in Colombia, 2019-2022 author indexsubject indexarticles search
Home Pagealphabetic serial listing  

Services on Demand

Journal

Article

Indicators

Related links

  • On index processCited by Google
  • Have no similar articlesSimilars in SciELO
  • On index processSimilars in Google

Share


Medicas UIS

Print version ISSN 0121-0319On-line version ISSN 1794-5240

Medicas UIS vol.38 no.3 Bucaramanga Sep./Dec. 2025  Epub Oct 01, 2025

https://doi.org/10.18273/revmed.v38n3-2025007 

Artículo original

Evaluación puntuación CALL, COVID-GRAM e ISARIC para riesgo de ingreso a cuidados intensivos y mortalidad en pacientes COVID-19 hospitalizados: estudio observacional multicéntrico, Bogotá, Colombia, 2020

Evaluation of CALL, COVID-GRAM, and ISARIC scores for intensive care unit admission and mortality risk in hospitalized COVID-19 patients: a multicenter observational study, in Bogotá, Colombia, 2020

John Jaime Sprockel-Díaz1  2  3  * 
http://orcid.org/0000-0002-7021-6769

Sebastián Tabares-Rodríguez2 
http://orcid.org/0000-0001-5470-4079

María Paula Carrillo-Ayerbe2 
http://orcid.org/0000-0002-7349-7105

Iván Santiago Acuña-Cortés2 
http://orcid.org/0000-0001-5762-3434

Ruddy Paola Montoya-Rumpf2 
http://orcid.org/0000-0002-1559-2470

Eliana Angarita-González2 
http://orcid.org/0000-0002-4697-9854

Victoria Elena Coral-Zuñiga2 
http://orcid.org/0000-0002-6786-7771

Hellen Cárdenas-Rodríguez3 
http://orcid.org/0000-0001-8678-1191

Edna Carolina Araque-Parra2  4 
http://orcid.org/0000-0003-3444-9722

Jhon EdisonParra-Mancipe3 
http://orcid.org/0000-0002-0362-6223

1Internal Medicine Department, San José Hospital. Bogotá, Colombia.

2Fundación Universitaria de Ciencias de la Salud (Health Sciences University Foundation). Bogotá, Colombia.

3Department of intensive Care, El Tunal Hospital - South Integrated Health Services Subnetwork. Bogotá, Colombia.

4Department of Internal Medicine, San José University Children's Hospital. Bogotá, Colombia.


Abstract:

Introduction:

Accurate risk assessment is essential for sound decision-making in patients hospitalized with COVID-19, and several prediction rules have been developed to estimate their risk of death or admission to intensive care.

Objective:

To validate the CALL, COVID-GRAM, and ISARIC-4C scores for predicting intensive care unit admission or death among patients hospitalized with COVID-19 in a cohort from three Bogotá hospitals during 2020.

Methods:

A prospective multicenter analytical observational study that included adults hospitalized for COVID-19 between April and November 2020 in three fourth-level hospitals in Bogotá. The scores from the three scales were calculated for each participant. Receiver-operating-characteristic curves were plotted and areas under the curve with 95 % confidence intervals were estimated; calibration was assessed with the Brier statistic and LOESS smoothing, and clinical utility was examined using decision-curve analysis.

Results:

Of 793 patients, 358 had data to calculate the three prediction rules. 186 (26.2 %) were admitted to intensive care and 120 (16,9 %) died. The areas under the curve for intensive care admission were: COVID-GRAM 0,67 (95 % CI: 0,61-0,73), CALL score 0,54 (95 % CI: 0,48-0,60), ISARIC-4C 0,61 (95 % CI: 0,55-0,67). For mortality they were: COVID-GRAM 0,76 (95 % CI: 0,70-0,81), CALL score 0,65 (95 % CI: 0,58-0,73), ISARIC-4C 0,75 (95 % CI: 0,6-0,8).

Conclusions:

the clinical prediction rules evaluated showed poor to moderate performance for intensive care admission and good performance for mortality. The ISARIC-4C and COVID-GRAM could be considered to evaluate the risk of death.

Keywords: COVID-19; SARS-CoV-2; mortality; prognosis; clinical prediction rule; critical care

Resumen:

Introducción:

La evaluación del riesgo es necesaria para la correcta toma de decisiones en pacientes hospitalizados por COVID-19. Existen reglas de predicción para estimar el riesgo de muerte o ingreso a cuidados intensivos de estos pacientes.

Objetivo:

Validar el puntaje CALL, COVID-GRAM y ISARIC 4C para la admisión a cuidados intensivos o el fallecimiento en pacientes hospitalizados por COVID-19 en una cohorte de tres hospitales de Bogotá durante 2020.

Métodos:

Estudio observacional analítico prospectivo multicéntrico que incluyó adultos hospitalizados por COVID-19, entre abril y noviembre de 2020, en tres hospitales de cuarto nivel en Bogotá. Para cada participante se calcularon las tres escalas. Se estimaron curvas ROC con sus áreas bajo la curva e IC 95 %, se evaluó la calibración mediante estadística de Brier y LOESS, y la utilidad clínica mediante curvas de decisión.

Resultados:

De 793 pacientes, 358 tenían datos para calcular las tres escalas; 186 (26,2 %) fueron admitidos en cuidados intensivos y 120 (16,9 %) fallecieron. Las áreas bajo la curva para admisión a cuidados intensivos fueron: COVID-GRAM 0,67 (IC 95 %: 0,61-0,73), CALL 0,54 (IC 95 %: 0,48-0,60), ISARIC-4C 0,61 (IC 95 %: 0,55-0,67). Para la mortalidad fueron: COVID-GRAM 0,76 (IC 95 %: 0,70-0,81), puntaje CALL 0,65 (IC 95 %: 0,58-0,73), ISARIC-4C 0,75 (IC 95 %: 0,6-0,8).

Conclusiones:

las reglas de predicción mostraron un rendimiento pobre a moderado para la admisión en unidad de cuidados intensivos y un buen rendimiento para la mortalidad. El ISARIC-4C y COVID-GRAM podrían considerarse para evaluar el riesgo de muerte.

Palabras clave: COVID-19; SARS-CoV-2; mortalidad; pronóstico; reglas de predicción clínica; cuidados críticos

Introduction

In December 2019, the Wuhan Municipal Health Commission (China) reported a cluster of pneumonia cases, later identified as being caused by a novel coronavirus initially named 2019 n-CoV and then SARS-CoV-21; progressively, it reached high levels of intercontinental propagation until March 11 of 2020, when the World Health Organization categorized it as a pandemic2. In 2020, Colombia experienced 12 784 confirmed COVID-19 deaths, with a significant excess mortality of 66 % above historical averages and 72 % of these deaths occurring in hospitals3. As a reflection of the most critical phases of the pandemic, by October 2020, 57,7 % of the intensive care unit (ICU) beds in the country were occupied. By early 2021, Colombia had doubled its capacity, increasing from 5539 ICU beds in 2019 to 11 9054. In Bogotá, the situation was even more critical: by January 2021, ICU bed occupancy for COVID-19 patients was around 93 %5. The scientific community has focused its efforts on the knowledge of the natural evolution of the disease and the recognition of risk factors that can predict adverse clinical outcomes such as admission to the ICU and/or death.

The clinical manifestations and severity of the resulting disease, known as COVID-19 (Coronavirus disease), exhibit wide heterogeneity6, leading to challenges in forecasting its outcomes. These manifestations can range from mild symptoms to severe pneumonia, which can rapidly progress to acute respiratory failure, septic shock, and multiple organ dysfunctions that could result in death. This variability accentuates the complexity and unpredictability of the disease's impact on individuals. It has been postulated that robust models capable of predicting the prognosis of COVID-19 are urgently needed to support decisions about protection, hospital admission, treatment, and population-level interventions7. A systematic review conducted shortly after the pandemic was declared identified 31 predictive models for diagnosis and prognosis8, an update in February 2021 already counted 232 models among which the Jehi group's model for diagnosis and the 4C Mortality Score for prognosis are considered promising9.

Ji et al. developed the CALL score, a clinical prediction rule for the risk of progression in 208 patients from two hospitals in Beijing (China)10. The resulting nomogram included age, comorbidities, lymphocytes, and lactate dehydrogenase, and has become one of the most widely utilized and validated to date. Liang et al. developed and validated the COVID-GRAM on 1590 patients from 575 hospitals in 31 provinces in China11, the model incorporates variables such as: X-ray abnormalities, age, hemoptysis, dyspnea, unconsciousness, number of comorbidities, cancer history, neutrophil to lymphocyte ratio, lactate dehydrogenase, and direct bilirubin, each weighted differently in the formula. Subsequently, two separate models were developed by the ISARIC-4C investigators group: the ISARIC-4C Deterioration model12 and the ISARIC-4C Mortality Score13 on 73 948 and 35 463 patients respectively from 260 hospitals in England, Scotland, and Wales. It predicts COVID-19 severity based on age, sex, comorbidities, respiratory rate, oxygen saturation, Glasgow Coma Scale, urea/BUN, and C-reactive protein levels. The supplementary material presents the nomograms and formulas of the different prediction rules.

SARS-CoV-2 virus infection has a natural evolution different from other previously known types of pneumonia, with a high capacity for contagion and high morbidity and mortality; it is for this reason that clinical prediction rules capable of predicting unfavorable outcomes are needed. Currently, there is limited information regarding the performance of these predictive tools in our context, which makes the validation of the CALL score, COVID-GRAM, and ISARIC-4C models a priority task, especially given the risk of resurgence of the disease. The aim of this study is to validate the clinical prediction rules specifically designed for COVID-19 prognosis: CALL score, COVID- GRAM, and ISARIC-4C for the prediction of clinical worsening given by admission to intensive care or death in hospitalized patients in a cohort of 3 hospitals in Bogotá, Colombia, during 2020.

Materials and methods

Type of study: a prospective observational analytical multicenter study was carried out.

Population: inclusion criteria were: 1) adult individuals hospitalized for COVID-19 confirmed by a Real-Time Polymerase Chain Reaction (RT-PCR) in nasal swab during the period between april 15 and november 30, 2020, in three fourth-level care hospitals in Bogotá, Colombia: El Tunal Hospital, San José Hospital, and University Children's Hospital San Jose. 2) Patients with direct admission to intensive care, those referred after 72 hours of stay in another institution, pregnant women, and those with any condition that seriously affected their short-term survival, such as advanced neoplasms, poorly controlled hepatopathies, renal disease, or patients who had declined advanced resuscitation measures, were excluded from the study.

Screening: after identifying COVID-19 compatible cases from the institutional censuses, the process involved weekly reviews of the hospitalization rosters from the internal medicine services within the designated COVID-19 areas. Potential study participants were identified during these reviews. Subsequently, the inclusion criteria for each identified case were meticulously verified to ensure eligibility.

Information collection: the information for each patient was recorded in the study format only after confirming discharge, transfer to the ICU, or death. Was completed a virtual data collection form, capturing variables recommended by the International Severe Acute Respiratory and Emerging Infections Consortium (ISARIC) of the World Health Organization, based on clinical observations and laboratory data from the patient's admission13. Additionally, patients were followed up after their discharge to determine the outcome regarding transition to intensive care or death, ensuring a comprehensive understanding of the patient journey and the disease's progression.

Variables: the study analyzed a comprehensive set of demographics, clinical, and laboratory variables to understand their impact on COVID-19 outcomes. The key demographic and clinical variables included the number and percentage of female patients, the average age in years with Standard Deviation (SD), and the number and percentage of patients with obesity. Additionally, the presence of comorbidities was documented, including hypertension, diabetes, chronic heart disease (excluding hypertension), chronic kidney disease, smoking, chronic lung disease, chronic neurological disease, and cirrhosis. The duration of illness before hospital admission was also recorded, with the mean number of days and SD.

Laboratory variables were meticulously documented to provide a detailed clinical picture. These included the mean white blood cell count in x103 cells per liL with SD, lymphocyte count in x103 cells per liL with SD, and the number and percentage of patients with lymphocytes smaller than 1,2 x103 cells per |lL. The mean platelet counts in x103 cells per liL with SD, lactate levels in mmol/L with SD, and creatinine levels in mg/dL with SD were also recorded. Furthermore, liver enzyme levels, such as aspartate aminotransferase and alanine aminotransferase, were measured, along with the number and percentage of patients with prolonged clotting time. Inflammatory markers such as high-sensitivity C-reactive protein, ferritin, D-dimer, and lactate dehydrogenase were noted, along with the number and percentage of patients with positive high-sensitivity cardiac troponin I results. Clinical prediction scores, including the CALL score, COVID-GRAM, and ISARIC-4C, were also included to assess their association with ICU admission and mortality.

Measures to control potential biases: to mitigate potential biases, several measures were implemented throughout the study. First, strict inclusion and exclusion criteria were applied to ensure the homogeneity of the patient population, focusing on adults hospitalized with confirmed COVID-19 via RT-PCR tests. Weekly reviews of hospitalization rosters and meticulous verification of inclusion criteria helped identify eligible participants accurately. Additionally, was used a virtual data collection form to capture variables recommended by ISARIC of the World Health Organization, based on clinical observations and laboratory data from the patient's admission. Follow-ups were conducted after patient discharge to determine outcomes regarding transition to intensive care or death, ensuring comprehensive data collection and minimizing information bias. Despite these efforts, some potential biases could not be completely avoided, such as the inherent severity of cases in fourth-level hospitals and the impact of other conditions on patient outcomes.

Statistical analysis: the description was made taking the qualitative variables reported with absolute frequencies, percentages and the quantitative variables with measures of central tendency and dispersion according to the distribution of the variables. The score for each patient was calculated for each of the study scales (CALL score, COVID-GRAM, ISARIC-4C in its versions for deterioration and mortality). Based on this score, the respective receiver operating characteristic (ROC) curves were constructed for each prediction rule, taking into account the two outcomes evaluated: transfer to the ICU and in hospital death. The area under the ROC curve (AUC) with its respective 95 % confidence interval (CI) was calculated. The AUC ranges classify as follows: AUC = 0,5 indicates no better discrimination than chance, 0,7 < AUC < 0,8 indicates acceptable discrimination, 0,8 < AUC < 0,9 indicates good discrimination, and AUC >0,9 indicates excellent discrimination. The optimal cut off value was calculated using the Youden technique, assuming normal distribution of the variables, from which contingency tables were constructed and the operative characteristics (sensitivity, specificity, accuracy, positive and negative predictive values, and positive and negative likelihood ratios) were calculated for the outcomes described.

Calibration of each of the prediction rules was performed by plotting the scores obtained (x-axis) against the proportion of observed events (y-axis); regarding the prediction of ICU admission or death using LOESS (locally estimated scatterplot smoothing) curves, then was calculated the Brier Score. In order to incorporate clinical decision reasoning into the model evaluation, was used the decision curve analysis framework, where predictive models are compared with common strategies to treat all or none of the patients; the net benefit for each strategy was calculated by subtracting the proportion of false positives from true positives weighted by the relative harm of a false positive and false negative result. Analyses were performed in the statistical program R version 4.0.2 (R Foundation, Vienna, Austria) using the packages "pROC", "ROCit", and "rmda".

Sampling: a consecutive sampling (non-probabilistic) method was used, including patients in the order of their admission who met the inclusion criteria during the study period. Although no formal sample size calculation was performed prior to the start of the study, a post hoc estimation based on an expected AUC of 0,75, an event rate of 26 %, a significance level of 5 %, and a statistical power of 80 % suggests that approximately 514 patients were required.

Measures to control potential biases: given the importance of maintaining the integrity and accuracy of the analysis, it was deemed inappropriate to perform data imputation for the missing values.

Ethical considerations: the study was approved by the ethics and research committees of each of the institutions (act number 138 of the Health Services Unit from El Tunal Hospital, 0498-2020 of the San José Hospital and SDM-026-20 of the University Children's Hospital San José) and informed consent was not required. Funding was obtained from the Health Sciences University Foundation from the call for research promotion number DI-I-0631-20.

Results

Between april 15 and november 30 a total of 1150 patients were screened, of whom 793 met the inclusion criteria and were enrolled in the study.

However, only 358 patients had complete data available to calculate all three clinical prediction scores evaluated. 98 patients came from El Tunal Hospital, 142 from San José Hospital, and 118 from University Children's Hospital San José. Clinical worsening indicating transfer to the ICU primarily due to complications related to COVID-19 occurred in 186 (60 %) patients and 128 (35,7 %) patients died.

Table 1 describes the general characteristics of the population, 136 (38,0 %) were women, the average age was 58,9 years (SD 15,7) and the duration of symptoms before admission was 8 days (SD 6,3). The most frequent comorbidities were: arterial hypertension in 138 (38,5 %), obesity in 71 of the 213 patients (33,3 %), and smoking in 83 patients (23,2 %); other comorbidities less frequently identified were diabetes in 67 (18,7 %), chronic lung disease in 49 (13,7 %), chronic heart disease in 37 (10,3 %), chronic kidney disease in 18 (5 %), chronic neurological disease in 26 (7,3 %), and cirrhosis in 1 (0,3 %). The mean lymphocyte count was 1060 cells/LiL (SD 800), with 246 (68,7 %) cases below 1200; creatinine was 1.1 mg/dL (SD 1,4), C Reactive Protein 68,7 mg/dL (SD 92,4), ferritin 1091 ng/mL (SD 1124), D-dimer 1252 m g/ml (SD 2515), lactate dehydrogenase 520 U/L (SD 316); troponin I was positive in 69 of 327 patients in whom it was measured (21,1 %).

Table 1 General characteristics of the population. 

Characteristics Total Study Population (n = 358) Survivors (n = 230) No Survivors (n = 128)
Female sex, n (%) 136 (38,0) 93 (40,4) 43 (33,6)
Age (years), average (SD) 58,9 (15,7) 57,4 (16,1) 61,5 (14,5)
Obesity, n/no. patients with data (%) 71/213 (33,3) 40/126 (31,7) 31/87 (35,6)
Comorbidities, n (%)
Hypertension 138 (38,5) 90 (39,1) 48 (37,5)
Diabetes 67 (18,7) 42 (18,2) 25 (19,5)
Chronic Heart Disease (except Hypertension) 37 (10,3) 23 (10) 14 (10,9)
Chronic Kidney Disease 18 (5) 7 (3) 11 (8,6)
Smoking 83 (23,2) 51 (22,1) 32 (25)
Chronic Lung Disease 49 (13,7) 32 (13,9) 17 (13,2)
Chronic Neurological Disease 26 (7,3) 15 (6,5) 11 (8,6)
Cirrhosis 1 (0,3) 0 (0) 1 (0,8)
Duration of illness before admission to hospital (days), mean (SD) 8 (6,3) 7,6 (4,1) 8,8 (8,9)
Laboratories
White blood cell count (×10³ cell per μL), mean (SD) 9,5 (5,8) 8,8 (6,2) 10,7 (5,1)
Lymphocyte count (×10³ cells per μL), mean (SD) 1,1 (0,8) 1,2 (0,9) 0,9 (0,5)
Lymphocytes smaller than 1,2 ×10³ cells per μL, n (%) 246 (68,7) 147 (63,9) 99 (77,3)
Platelet count (×10³ cells per μL), mean (SD) 230 (87) 231 (91) 227 (78)
Lactate (mmol/L), mean (SD) 1,6 (0,8) 1,48 (0,7) 1,7 (0,8)
Creatinine (mg/dL), mean (SD) 1,1 (1,4) 0,95 (1,0) 1,4 (1,9)
Aspartate aminotransfera se (U/L), average (SD) 66 (147,7) 53,3 (33,9) 87,7 (238,6)
Alanine aminotransfera se (U/L), mean (SD) 58,7 (76,7) 51,2 (38,8) 72,1 (116,8)
Prolonged clotting time more than 5 seconds, nr/no. patients with test (%) 69/233 (29,6) 39/134 (29,1) 30/127 (24,1)
High- sensitivity C- reactive protein (mg/L), mean (SD) 68,7 (92,4) 68,2 (89,5) 69,4 (97,5)
Ferritin (ng/mL), mean (SD) 1091 (1123) 931 (891) 1388 (1417)
D-dimer (μg/mL), mean (SD) 1252 (2515) 983 (1454) 1732 (3683)
Lactate dehydrogenase (U/L), mean (SD) 520 (316) 460 (238) 627 (401)
Positive high- sensitivity cardiac troponin I, n/no. patients with test (%) 69/327 (21,1) 37/207 (17,8) 32/120 (26,6)

*SD: Standard Deviation.

Source: the authors.

Figure 1 and Table 2 present the ROC curves of the different clinical prediction rules for the outcomes evaluated. For the prediction of admission to the ICU, the performance was moderate to poor, obtaining an AUC of 0,68 (95 % CI: 0,62-0,73) for the COVID-GRAM, 0,62 (95 % CI: 0,55-0,67) for the ISARIC-4C and 0,55 (95 % CI: 0,48-0,60) for the CALL score. Regarding the prediction of death, an improvement is shown, obtaining an AUC of 0,76 (95 % CI: 0,71-0,82) for the COVID-GRAM, 0,76 (95 % CI: 0,70-0,82) for the ISARIC-4C and 0,66 (95 % CI: 0,59-0,73) for the CALL score.

*AUC: Area Under the ROC Curve. *FPR: False Positive Rate Source: authors.

Figure 1 ROC curves with the diagnostic performance of the different clinical prediction rules. a) Intensive care admission. b) Death. 

Table 2 Results of the areas under the ROC curve and Brier Score for the different clinical prediction rules in patients hospitalized by COVID-19. 

DEATH ADMISSION TO THE ICU
Prediction Rule AUC CI 95 % Brier Score AUC CI 95 % Brier Score
COVID- GRAM 0,7637 0,7078-0,8195 0,175 0,6757 0,6175-0,7339 0,302
CALL Score 0,6596 0,5888-0,7304 0,483 0,5462 0,4843-0,6081 0,497
ISARIC-4C 0,7572 0,6977-0,8166 0,179 0,6152 0,5531-0,6772 0,352

*AUC: Area Under the ROC Curve. *CI: Confidence Interval.

Source: authors.

Table 3 shows the results of the operational characteristics of each of the clinical prediction rules evaluated according to the optimal cut-off values obtained by the Youden technique. In all cases, very poor results are shown for the prediction of ICU admission, while for death the COVID-GRAM and ISARIC-4C obtain high specificities and negative predictive values (96,9 % and around 84 % respectively) at the expense of very low sensitivities (14,3 % and 12,7 %).

Table 3 Operational characteristics of the clinical prediction rules for each pre-specified cut-off point. 

Prediction rule CALL Score COVID-GRAM ISARIC-4C
cut-off value 9 and 8 cut-off value 228 and 211 cut-off value 615 and 15
Admission to ICU Death Admission to ICU Death Admission to ICU Death
True positives 56 47 9 9 45 8
True negatives 136 138 238 286 187 286
False positives 106 157 4 9 55 9
False negatives 60 16 107 54 71 55
Accuracy (%) 53,6 51,7 69,0 82,4 64,8 82,1
Sensitivity (%) 48,3 74,6 7,8 14,3 38,8 12,7
Specificity (%) 56,2 46,8 98,3 96,9 77,3 96,9
Positive predictive value (%) 34,6 23,0 69,2 50,0 45,0 47,1
Negative predictive value (%) 69,4 89,6 69,0 84,1 72,5 83,9
LR+ 1,102 1,402 4,694 4,683 1,707 4,162
LR- 0,920 0,543 0,938 0,884 0,792 0,900

*ICU: Intensive Care Unit. *LR: Likelihood Ratio

Source: authors.

The calibration analysis of the different prediction rules shows how the CALL score obtained a poor result for mortality demonstrated by a high Brier Score (0,48) and the lack of linearity in the LOESS curve while it is adequate for the COVID-GRAM and ISARIC-4C rules (Figure 2 and Table 2). The three rules evaluated showed poor calibration for the prediction of ICU admission, with high Brier Score scores (Table 2). These results are in accordance with Table 4, which shows the percentages of the outcomes found in each of the rules evaluated compared to the expected result, categorized according to the risk groups set out in the original documents; the result would only indicate a good calibration for ISARIC-4C in the assessment of mortality.

*ICU: Intensive Care Unit. Source: authors.

Figure 2 Calibration using LOESS curves of the various clinical prediction rules for predicting admission to intensive care or death.  

Table 4 Correlation between the outcomes obtained with the clinical prediction scales CALL score, COVID-GRAM, and ISARIC-4C, for intensive care admission and mortality in our population, compared to the original studies. 

Clinical prediction scale CALL score COVID-GRAM ISARIC-4C
Outcom e ICU admission ICU admission Death
Risk Class Observed Expected Risk Group Observed Expected Risk Group Observed Expected
A 12/53 (22,6 %) <10 % Low 0/0 (0 %) 0,7 % Low (0-3) 0/10 (0 %) 1,7 %
B 48/143 (33,6 %) 10-40 % Mild 41/184 (22,3 %) 7,3 % Mild (4-8) 7/134 (5,2 %) 9,1 %
C 56/162 (34,6 %) >50 % High 75/174 (43,1 %) 59,3 % High (9-14) 40/173 (23,1 %) 34,9 %
Very High (>14) 16/41 (39,0 %) 66,2 %

The ISARIC-4C for clinical deterioration did not establish risk groups. *ICU: Intensive Care.

Source: authors.

Source: authors.

Figure 3 Decision curves for each of the clinical prediction rules evaluated. a) Intensive care admission. b) Death. 

Discussion

In our study, the evaluated outcomes, including clinical worsening leading to ICU transfer due to COVID-19 complications in 186 (52,0 %) patients and death in 128 (35,8 %) patients, align with those reported in other databases14. These findings underscore the critical importance of predicting which patients are at risk of progression to severe disease states. However, our results do not provide definitive guidance supporting the routine use of any of the evaluated prediction rules in our clinical settings. Furthermore, we encountered a significant gap in the literature, as we were unable to locate studies from our region that validate these prediction models in our local context.

The CALL score was probably one of the first rules developed specifically for the identification of clinical worsening in patients with COVID-19, reporting an AUC of 0,91 in the cohort in which it was developed10. It has been validated in several cohorts in which, despite an early report with an AUC of 0,85 among 252 patients15. Other studies show rather poor results, such as one that included 210 Italian patients with an AUC of 0,6216, another with 411 English patients with 0,5617, and what is perhaps the largest validation cohort with 1363 patients from Sâo Paulo and Barcelona with an AUC of 0,5218; while a Dutch cohort of 403 patients with an AUC of 0,7019. This poor performance is in line with that reported in this study with an AUC of 0,55.

Although it was not developed to predict mortality, three previously mentioned studies evaluated the CALL score in this sense, finding AUC of 0,77 in Italy16, 0,77 among 1363 patients from Spain and Brasil18 and 0,76 among 403 patients in The Netherlands19. This is in line with the moderate performance obtained in this study with AUC of 0,66.

The COVID-GRAM prediction rule obtained in the derivation cohort an AUC of 0,88 for clinical worsening11, the various validations performed have documented a wide variability with AUC of 0,5218 in patients from Sâo Paulo and Barcelona, 0,88 among 523 Spanish patients20, 0,72 in another Spanish cohort of 306 patients21, 0,68 among 481 Turkish patients22, and 0,70 among 14 343 French patients23. The result of the present cohort barely reached an AUC of 0,62.

As with the CALL score, the performance of the COVID-GRAM for in-hospital death was evaluated in three different studies documenting an AUC of 0,78 among 210 elderlies in Italy24, 0,77 among 1363 Brazilian and Spanish18, and 0,70 among 481 Turkish patients22 results quite close to the 0,76 achieved in the present cohort.

The ISARIC-4C scale for death in its derivation and initial validation study showed AUC of 0,75 and 0,77 respectively13, it has validation studies with large cohorts of patients, achieving quite consistent results of AUC to the one reached in this study of 0,76 with reports of 0,79 among 6802 American patients25, 0,78 among 14 343 French patients23 and 0,78 among 76 588 patients in the United Kingdom26.

Ultimately, the ISARIC-4C rule for clinical deterioration obtained an AUC of 0,77 in the development cohort with the largest number of patients at 73 94813. It has undergone several validations in which an AUC of 0,80 was found among 481 Turkish patients21, 0,76 among 76 588 UK patients26, 0,77 in the Dutch cohort of 403 patients18 and 0,76 in 410 Singaporean patients27. These good results contrast with the moderate 0,62 obtained in this analysis.

Thus, the results could be grouped as moderate to poor for the prediction of clinical worsening by the three rules evaluated, with poor calibration and poor net benefit; this situation improves substantially for the prediction of death, except in the case of the CALL score. Among the limitations recognized in this study, the first is that the populations included are from fourth-level hospitals, which may induce a selection bias in patients with greater baseline severity and cannot be applied to second or first-level entities, nor to patients with COVID-19 in an outpatient setting. An additional limitation is that the estimated sample size was not reached, as more than half of the eligible cases lacked sufficient data to calculate all three risk scores, this may have reduced the statistical power of the comparisons and limited the generalizability of the findings. As a limitation or potential bias, the outcome of the patients (death or ICU admission) could be due to a reason other than COVID-19, even though exclusion criteria were applied for conditions that affect short-term survival. In this study, it is recognized as a strength the multicenter character, the large number of outcomes to make comparisons, and the calibration and net benefit analysis of the different scales.

More external validation studies are needed to ensure the generalization of its use.

Conclusions

The clinical prediction rules evaluated showed poor to moderate performance for ICU admission and good performance for death in a Colombian population hospitalized by COVID-19. Specifically, the CALL score exhibited the weakest performance, while the ISARIC-4C and COVID- GRAM showed potential utility in assessing the risk of death. Despite these findings, the results do not unequivocally support the routine application of any evaluated prediction rules in our clinical settings, echoing the broader discussion which highlights the vital need for accurate risk stratification in managing COVID-19 progression.

Acknowledgements

We thank the relatives of the participants for their support. We also want to thank all the health and non-health care personnel involved in the care of COVID-19 patients

References

1. Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet. 2020;395(10223):497-506. [ Links ]

2. World Health Organization [Internet]. Geneve: WHO. WHO Director-General's opening remarks at the media briefing on COVID-19; 2020 Mar 11 [citado 2021 May 28]. Disponible en: Disponible en: https://www.who.int/director-general/speeches/detail/who-director-general-s-opening-remarks-at-the-media-briefing-on-covid-19-11-march-2020Links ]

3. Departamento Administrativo Nacional de Estadística. Boletín Técnico defunciones por COVID-19. 2 de marzo de 2020pr al 9 de mayo de 2021pr. Bogotá: DANE; 2021. [ Links ]

4. Ministerio de Salud y Protección Social [Internet]. Bogotá: Minsalud. Colombia tiene 44,35 de disponibilidad de camas UCI. Boletín de Prensa No 7 de 2022; 2022 Ene 06 [citado 2025 Jun 05]. Disponible en: Disponible en: https://www.minsalud.gov.co/Paginas/Colombia-tiene-44,35-de-disponibilidad-de-camas-UCI.aspxLinks ]

5. Reinoso G. Lo que debe saber del toque de queda nocturno que inicia hoy en Bogotá. 18 de enero de 2021 [consultado el 09 de junio de 2025]. En: El Tiempo [Internet]. Disponible en: Disponible en: https://www.eltiempo.com/bogota/bogota-toque-de-queda-y-cuarentena-general-hoy-19-de-enero-561340Links ]

6. Alhazzani W, Meller MH, Arabi YM, Loeb M, Gong MN, Fan E, et al. Surviving sepsis campaign: guidelines on the management of critically ill adults with coronavirus disease 2019 (COVID-19). Crit Care Med. 2020;48(6):e440-469. [ Links ]

7. Sperrin M, McMillan B. Prediction models for covid-19 outcomes. BMJ. 2020; 371:m3777. [ Links ]

8. Wynants L, Van Calster B, Collins GS, Riley RD, Heinze G, Schuit E, et al. Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal. BMJ. 2020;369:m1328. [ Links ]

9. Update to living systematic review on prediction models for diagnosis and prognosis of covid-19. BMJ. 2021;372:n236. [ Links ]

10. Ji D, Zhang D, Xu J, Chen Z, Yang T, Zhao P, et al. Prediction for progression risk in patients with COVID-1 pneumonia: the CALL Score. Clin Infect Dis. 2020;71(6):1393-1399. [ Links ]

11. Liang W, Liang H, Ou L, Chen B, Chen A, Li C, et al. Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with COVID-19. JAMA Intern Med. 2020;180(8):1081-1089. [ Links ]

12. Gupta RK, Harrison EM, Ho A, Docherty AB, Knight SR, van Smeden M, et al. Development and validation of the ISARIC 4C deterioration model for adults hospitalised with COVID-19: a prospective cohort study. Lancet Respir Med. 2021;9(4):349-359. [ Links ]

13. Knight SR, Ho A, Pius R, Buchan I, Carson G, Drake TM, et al. Risk stratification of patients admitted to hospital with COVID-19 using the ISARIC WHO Clinical Characterisation Protocol: development and validation of the 4C mortality score. BMJ. 2020;370:m3339. [ Links ]

14. Petrilli CM, Jones SA, Yang J, Rajagopalan H, O'Donnell L, Chernyak Y, et al. Factors associated with hospital admission and critical illness among 5279 people with coronavirus disease 2019 in New York City: prospective cohort study. BMJ. 2020;369:m1966. [ Links ]

15. Kamran SM, Mirza ZE, Moeed HA, Naseem A, Hussain M, Fazal ISr, et al. CALL score and RAS score as predictive models for coronavirus disease 2019. Cureus. 2020;12(11):e11368. [ Links ]

16. Grifoni E, Valoriani A, Cei F, Vannucchi V, Moroni F, Pelagatti L, et al. The CALL Score for Predicting Outcomes in Patients With COVID-19. Clin Infect Dis. 2021;72(1):182-183. [ Links ]

17. Gupta RK, Marks M, Samuels THA, Luintel A, Rampling T, Chowdhury H, et al. Systematic evaluation and external validation of 22 prognostic models among hospitalised adults with COVID-19: an observational cohort study. Eur Respir J. 2020;56(6):2003498. [ Links ]

18. Neto FL, Marino LO, Torres A, Cilloniz C, Meirelles Marchini JF, Garcia de Alencar JC, et al. Community-acquired pneumonia severity assessment tools in patients hospitalized with COVID-19: a validation and clinical applicability study. Clin Microbiol Infect. 2021;27(7):1037.e1-1037.e8. [ Links ]

19. van Dam PMEL, Zelis N, van Kuijk SMJ, Linkens AEMJH, Brüggemann RAG, Spaetgens B, et al. Performance of prediction models for short-term outcome in COVID-19 patients in the emergency department: a retrospective study. Ann Med. 2021;53(1):402-409. [ Links ]

20. Armiñanzas C, Arnaiz de Las Revillas F, Gutiérrez Cuadra M, Arnaiz A, Fernández Sampedro M, González-Rico C, et al. Usefulness of the COVID-GRAM and CURB-65 scores for predicting severity in patients with COVID-19. Int J Infect Dis. 2021;108:282-288. [ Links ]

21. Moreno-Pérez Ó, Andrés M, León-Ramírez JM, Sánchez-Payá J, Boix V, Gil J, et al. The COVID-GRAM Tool for Patients Hospitalized With COVID-19 in Europe. JAMA Intern Med. 2021;l8l(7):l000-l00l. [ Links ]

22. Doganay F, Ak R. Performance of the CURB-65, ISARIC-4C and COVID-GRAM scores in terms of severity for COVID-19 patients. Int J Clin Pract. 2021;75(l0):el4759. 2S. [ Links ]

23. Lombardi Y, Azoyan L, Szychowiak P, Bellamine A, Lemaitre G, Bernaux M, et al. External validation of prognostic scores for COVID-19: a multicenter cohort study of patients hospitalized in Greater Paris University Hospitals. Intensive Care Med. 2021;47(l2):l426-1439. [ Links ]

24. Covino M, De Matteis G, Burzo ML, Russo A, Forte E, Carnicelli A, et al. Predicting In-Hospital Mortality in COVID-19 Older Patients with Specifically Developed Scores. J Am Geriatr Soc. 2021;69(1):37-43. [ Links ]

25. Gordon AJ, Govindarajan P, Bennett CL, Matheson L, Kohn MA, Camargo C, et al. External validation of the 4C Mortality Score for hospitalised patients with COVID-19 in the RECOVER network. BMJ Open. 2022;12(4):e054700. [ Links ]

26. Knight SR, Gupta RK, Ho A, Pius R, Buchan I, Carson G, et al. Prospective validation of the 4C prognostic models for adults hospitalised with COVID-19 using the ISARIC WHO Clinical Characterisation Protocol. Thorax. 2022;77(6):606-615. [ Links ]

27. Ong SWX, Sutjipto S, Lee PH, Dugan C, Khoo BY, Ren D, et al. Validation of ISARIC 4C mortality and deterioration scores in a mixed vaccination status cohort of hospitalized coronavirus disease 2019 (COVID-19) patients in Singapore. Clin Infect Dis. 2022; 75(1):e874-e877. [ Links ]

¿Cómo citar este artículo? Sprockel-Díaz JJ, Tabares-Rodríguez S, Carrillo-Ayerbe MP, Acuña-Cortes IS, Montoya- Rumpf RP, Angarita-González E, Coral-Zúñiga VE, Cárdenas-Rodríguez H, Araque-Parra EC, Parra JE. Evaluation of CALL, COVID-GRAM, and ISARIC scores for intensive care unit admission and mortality risk in hospitalized COVID- 19 patients: multicenter observational study, Bogotá, Colombia, 2020. Med. UIS. 2025; 38(3). 61-71. DOI: https://doi.org/10.18273/revmed.v38n3-2025007

Funding Funding was obtained from the Health Sciences University Foundation from the call for research promotion number DI-I-0631-20

Received: February 18, 2024; Accepted: August 09, 2025

* Correo de correspondencia: jjsprockel@fucsalud.edu.co

Competing interests

The authors declare that they have no competing interests.

Creative Commons License This is an open-access article distributed under the terms of the Creative Commons Attribution License