Lakera releases robustness testing suite for radiology AI teams.

Lakera
3 min readJun 2, 2023

--

This article was originally posted on our company website. Lakera’s developer platform enables ML teams to ship fail-safe computer vision models.

What’s in this article?

  • Why should I care about robustness testing?
  • Robustness testing as a proxy for model generalization.
  • Where to go from here?

Before medical imaging technologies are put into hospitals, it is important to evaluate their predictive performance and obtain regulatory approval. This requires appropriate testing methodologies that far exceed how testing is done in other industries — an undertaking that can quickly become overwhelming.

This is why we’re proud to announce the release of our robustness testing suite for radiology imaging. In addition to all existing MLTest features, this release makes it even easier for medical imaging teams to validate their computer vision models, put together optimal datasets, and deploy their AI systems quickly and safely.

💡 Lakera’s computer vision safety testing is used by leading medical imaging teams. You can get started in minutes here.

Why should I care about robustness testing?

While of course delivering potentially life-saving information, the field of radiology is often plagued by a variety of artifacts, which can stem from incorrect handling of the image to defects in the film, to even the patients’ movement or clothing. These artifacts can negatively impact the accuracy of a diagnostic exam and lead to incorrect treatments and patient harm.

Mitigating these artifacts is therefore vital for ensuring the quality and reliability of X-ray imaging results, allowing for an accurate diagnosis and appropriate patient care. Failure to adequately address these issues can also result in additional testing and exposure to unnecessary radiation.

Transformations with double exposure, grid lines, static electricity, lighting changes, pixel dropout, rotations

Our latest release enables teams to take their medical machine-learning testing capabilities to a new level. In addition to all of MLTest’s existing features, this test suite includes robustness tests specifically relevant for radiology applications, such as:

  • Double exposure: When the receptor is exposed twice and two images appear superimposed over one another.
  • Grid lines: Grids are placed between the patient and the X-ray to reduce scatter, but can be present in the image.
  • Static electricity: Which can occur from flexing of the film or low humidity.
  • Image quality differences: Often caused when data comes from multiple scanning devices.
  • Dead pixels: Meaning certain regions of the image will be purely black.
  • Variations in focus: Often caused by the presence of foreign objects.
  • Variations in lighting conditions (e.g. brightness and contrast): Often caused by differences in the patients, processing, or scanning devices.
  • Geometric changes: Different patient locations and rotations.

💡 Read our latest Nature journal article on testing medical imaging systems here.

Robustness testing as a proxy for model generalization.

The fundamental question when testing ML models is then how to select the model with the best generalization properties. The gold standard is picking the model with the highest validation accuracy. But as we’ve written in one of our recent articles, this approach is seriously flawed. Reaching a great validation accuracy doesn’t necessarily imply that we’re any closer to having a production-ready model.

MLTest’s automatically synthesizes additional data to test the robustness of your model beyond the training distribution. It exercises your model on variations of the data that are likely to appear in the real world. And this is exactly what generalization means! Robustness tests allow you to go way beyond validation set accuracy and are a great predictor of model performance in the wild.

So if you want to get a better grasp on which of your models generalize, adding robustness tests with MLTest can give you much more confidence around model performance — prior to deployment.

Where to go from here?

Our latest release enables teams to take their medical machine learning testing capabilities to a new level. With MLTest, you can now easily test whether your algorithms are robust to radiological artifacts and variations. It lets you stress test your computer vision models to gain confidence in their robustness properties prior to clinical validation. Does your lung infection model still work in cases of double exposure? Can something as simple as grid lines dramatically change your models’ performance?

If these questions are on your mind, then head on over to MLTest to learn more about how to get started within minutes, or get in touch with us at dan@lakera.ai.

--

--

No responses yet