Outlier analysis for clinical discovery: An augmented intelligence framework and a systematic review

Outlier analysis diagram

Outlier analysis for accelerating clinical discovery: An augmented intelligence framework and a systematic review

Introduction

Clinical research often hinges on identifying unusual or outlier cases, which can lead to groundbreaking discoveries. Traditionally, these outliers are identified through manual methods like case reports, which are time-consuming and limited by human observation. However, a recent study published in PLOS Digital Health proposes a novel approach using an augmented intelligence framework for outlier analysis. This method systematically scans large datasets to identify unique cases, offering a more efficient and comprehensive way to accelerate clinical discovery.

Methods

The study employed two primary outlier analysis techniques: the extreme misclassification contextual outlier method and the isolation forest point outlier method. The extreme misclassification approach used a random forest predictive model to identify misclassified data points, with a confidence level exceeding 90%. On the other hand, the isolation forest method identified outliers based on path length z-scores in the dataset. These methods were applied to two large datasets: the folic acid clinical trial (FACT) and the Ottawa and Kingston birth cohort (OaK). Clinical experts then reviewed the identified outliers to determine their potential as novel discoveries.

Outlier analysis diagram

Discussion

The study found that the extreme misclassification method was more effective in identifying potential novel discoveries compared to the isolation forest method. Specifically, in the FACT dataset, 76.9% of the identified outliers were considered potential novelties, whereas in the OaK dataset, 32.7% were deemed novel. This highlights the importance of using advanced machine-learning techniques to enhance the process of clinical discovery. The researchers suggest that this method could be generalized across various medical disciplines, potentially transforming how we approach clinical research.

Conclusion

The study concludes that augmented intelligence, through systematic outlier analysis, offers a feasible and effective way to accelerate clinical discoveries. By identifying unique cases that might be overlooked by traditional methods, this approach has the potential to uncover novel insights and improve patient outcomes. The researchers advocate for broader adoption of this framework in clinical research to enhance the efficiency and impact of medical discoveries.