This is the database of predicted proteins used in the study published in doi.org/10.1101/2020.03.26.009324