May 2, 2021 at 7:32 pm Exciting! This begs a few questions, actually: 1) Why not re-train the hmm suite on a larger corpus than the small set of seed alignments? 2) Why not just take (a representative subset of) the new Pfam-N alignments and build more HMMs out of them (naming each the same thing that ProtENN mapped them to)? 3) For sequences that still don’t have great matches to existing cluster