Skip to main navigation Skip to search Skip to main content

AskBeacon—performing genomic data exchange and analytics with natural language

Anuradha Wickramarachchi, Shakila Tonni, Sonali Majumdar, Sarvnaz Karimi, Sulev Kõks, Brendan Hosking, Jordi Rambla, Natalie A. Twine, Yatish Jain, Denis C. Bauer*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Enabling clinicians and researchers to directly interact with global genomic data resources by removing technological barriers is vital for medical genomics. AskBeacon enables large language models (LLMs) to be applied to securely shared cohorts via the Global Alliance for Genomics and Health Beacon protocol. By simply “asking” Beacon, actionable insights can be gained, analyzed, and made publication-ready. 

Results: In the Parkinson's Progression Markers Initiative (PPMI), we use natural language to ask whether the sex-differences observed in Parkinson's disease are due to X-linked or autosomal markers. AskBeacon returns a publication-ready visualization showing that for PPMI the autosomal marker occurred 1.4 times more often in males with Parkinson’s disease than females, compared to no differences for the X-linked marker. We evaluate commercial and open-weight LLM models, as well as different architectures to identify the best strategy for translating research questions to Beacon queries. AskBeacon implements extensive safety guardrails to ensure that genomic data is not exposed to the LLM directly, and that generated code for data extraction, analysis and visualization process is sanitized and hallucination resistant, so data cannot be leaked or falsified. 

Availability and implementation: AskBeacon is available at https://github.com/aehrc/AskBeacon.

Original languageEnglish
Article numberbtaf079
Pages (from-to)1-4
Number of pages4
JournalBioinformatics
Volume41
Issue number3
Early online date22 Feb 2025
DOIs
Publication statusPublished - Mar 2025

Bibliographical note

Copyright the Author(s) 2025. Version archived for private and non-commercial use with the permission of the author/s and according to publisher conditions. For further rights please contact the publisher.

Fingerprint

Dive into the research topics of 'AskBeacon—performing genomic data exchange and analytics with natural language'. Together they form a unique fingerprint.

Cite this