Sourcepredict: Prediction of metagenomic sample sources using dimension reduction followed by machine learning classification

Abstract

SourcePredict is a Python package distributed through Conda, to classify and predict the origin of metagenomic samples, given a reference dataset of known origins, a problem also known as source tracking.

Publication
Journal of Open Source Software