Background Interstitial inflammation and peritubular capillaritis are observed in many diseases on native and transplant kidney biopsies. A precise and automated evaluation of these histological criteria could help stratify patients’ kidney prognoses and facilitate therapeutic management.
Methods We used a convolutional neural network to evaluate those criteria on kidney biopsies. A total of 423 kidney samples from various diseases were included; 83 kidney samples were used for the neural network training, 106 for comparing manual annotations on limited areas to automated predictions, and 234 to compare automated and visual gradings.
Results The precision, recall and F-score for leukocyte detection were, respectively, 81%, 71% and 76%. Regarding peritubular capillaries detection the precision, recall and F-score were, respectively, 82%, 83% and 82%. There was a strong correlation between the predicted and observed grading of total inflammation, as for the grading of capillaritis (r = 0.89 and r = 0.82, respectively, all P < .0001). The areas under the receiver operating characteristics curves for the prediction of pathologists’ Banff total inflammation (ti) and peritubular capillaritis (ptc) scores were respectively all above 0.94 and 0.86. The kappa coefficients between the visual and the neural networks’ scores were respectively 0.74, 0.78 and 0.68 for ti ≥1, ti ≥2 and ti ≥3, and 0.62, 0.64 and 0.79 for ptc ≥1, ptc ≥2 and ptc ≥3. In a subgroup of patients with immunoglobulin A nephropathy, the inflammation severity was highly correlated to kidney function at biopsy on univariate and multivariate analyses.
Conclusion We developed a tool using deep learning that scores the total inflammation and capillaritis, demonstrating the potential of artificial intelligence in kidney pathology.