TY - GEN
T1 - ProteinEngine
T2 - 22nd International Artificial Intelligence in Medicine Conference, AIME 2024
AU - Shen, Yiqing
AU - Lv, Outongyi
AU - Zhu, Houying
AU - Wang, Yu Guang
PY - 2024
Y1 - 2024
N2 - Large language models (LLMs) have garnered considerable attention for their proficiency in tackling intricate tasks, particularly leveraging their capacities for zero-shot and in-context learning. However, their utility has been predominantly restricted to general tasks due to an absence of domain-specific knowledge. This constraint becomes particularly pertinent in the realm of protein engineering, where specialized expertise is required for tasks such as protein function prediction, protein evolution analysis, and protein design, with a level of specialization that existing LLMs cannot furnish. In response to this challenge, we introduce ProteinEngine, a human-centered platform aimed at amplifying the capabilities of LLMs in protein engineering by seamlessly integrating a comprehensive range of relevant tools, packages, and software via API calls. Uniquely, ProteinEngine assigns three distinct roles to LLMs, facilitating efficient task delegation, specialized task resolution, and effective communication of results. This design fosters high extensibility and promotes the smooth incorporation of new algorithms, models, and features for future development. Extensive user studies, involving participants from both the AI and protein engineering communities across academia and industry, consistently validate the superiority of ProteinEngine in augmenting the reliability and precision of deep learning in protein engineering tasks. Consequently, our findings highlight the potential of ProteinEngine to bride the disconnected tools for future research in the protein engineering domain.
AB - Large language models (LLMs) have garnered considerable attention for their proficiency in tackling intricate tasks, particularly leveraging their capacities for zero-shot and in-context learning. However, their utility has been predominantly restricted to general tasks due to an absence of domain-specific knowledge. This constraint becomes particularly pertinent in the realm of protein engineering, where specialized expertise is required for tasks such as protein function prediction, protein evolution analysis, and protein design, with a level of specialization that existing LLMs cannot furnish. In response to this challenge, we introduce ProteinEngine, a human-centered platform aimed at amplifying the capabilities of LLMs in protein engineering by seamlessly integrating a comprehensive range of relevant tools, packages, and software via API calls. Uniquely, ProteinEngine assigns three distinct roles to LLMs, facilitating efficient task delegation, specialized task resolution, and effective communication of results. This design fosters high extensibility and promotes the smooth incorporation of new algorithms, models, and features for future development. Extensive user studies, involving participants from both the AI and protein engineering communities across academia and industry, consistently validate the superiority of ProteinEngine in augmenting the reliability and precision of deep learning in protein engineering tasks. Consequently, our findings highlight the potential of ProteinEngine to bride the disconnected tools for future research in the protein engineering domain.
KW - Deep Learning
KW - Large Language Model
KW - Protein Design
KW - AI for Protein Design
UR - https://doi.org/10.1007/978-3-031-66538-7
UR - http://www.scopus.com/inward/record.url?scp=85200738310&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-66538-7_37
DO - 10.1007/978-3-031-66538-7_37
M3 - Conference proceeding contribution
SN - 9783031665349
T3 - Lecture Notes in Artificial Intelligence
SP - 373
EP - 383
BT - Artificial intelligence in medicine
A2 - Finkelstein, Joseph
A2 - Moskovitch, Robert
A2 - Parimbelli, Enea
PB - Springer
CY - Cham
Y2 - 9 July 2024 through 12 July 2024
ER -