ProtFlash: A lightweight protein language model
-
Updated
Mar 2, 2024 - Python
8000
ProtFlash: A lightweight protein language model
Transmembrane proteins predicted through Language Model embeddings
LLM-powered classification of phage protein functions to identify strong lytic candidates against Klebsiella, using transfer learning and biological embeddings.
This work was aimed at finding methods to identify the most distant proteins and most diverse subsets of proteins from large protein databases in a scalable and efficient way using a dataset of protein embeddings from SwissProt, data mining techniques and metaheuristics.
Repository containing bio_embeddings resources
Transmembrane proteins predicted through Language Model embeddings
Add a description, image, and links to the protein-embeddings topic page so that developers can more easily learn about it.
To associate your repository with the protein-embeddings topic, visit your repo's landing page and select "manage topics."