Deep Learning-Enhanced TopoStats for the Automated Quantification of DNA and Complex Biomolecular Structures
Deep Learning-Enhanced TopoStats for the Automated Quantification of DNA and Complex Biomolecular Structures
Whittle, S.; Firth, T. A.; Gamill, M. C.; Wiggins, L.; Shephard, N.; Allwood, T.; Catley, T. E.; Pyne, A. L. B.
AbstractAtomic force microscopy (AFM) enables nanometre-scale, label-free imaging of biomolecules and surfaces under near-native conditions, yet quantitative analysis of AFM data remains limited compared to other bioimaging modalities. This limitation largely arises from the absence of open, automated tools capable of addressing AFM-specific artefacts, data formats, and topographical outputs. Here, we present the latest version of TopoStats, an open-source Python package for automated and quantitative AFM image analysis, developed as a deep-learning enabled advancement of our original TopoStats software to support more complex samples and richer molecular characterisation. The pipeline integrates all key processing stages, including image flattening and noise correction, object detection and segmentation, morphometric feature extraction, and strand tracing with topological classification. Designed for accessibility and reproducibility, TopoStats adheres to the FAIR for Research Software (FAIR4RS) principles and provides configurable workflows adaptable to diverse biological samples. Combining high-resolution AFM and our analysis pipeline allows the quantification of subtle structural changes within a heterogeneous sample set, revealing properties not accessible with other structural biology techniques. We demonstrate the effectiveness of our pipeline to differentiate between plasmids with both different topology and sequence, by extracting meaningful quantitative descriptors that distinguish the samples with statistical significance. Collectively, these developments establish TopoStats as a versatile framework for high-throughput, quantitative AFM analysis, advancing AFM from a fundamentally qualitative visualisation technique toward a quantitative analytical tool.