This is the database of predicted proteins containing SEED, MMETSP, and RefSeq used in publications of metagenomic and metatranscriptomic datasets (doi.org/10.1038/s41564-019-0630-3, doi.org/10.1038/s41564-017-0047-9, doi.org/10.1101/2020.03)