We made three pangenome graphs ๐งฌ public, one for the Japanese, one for the Saudi population, and a merged graph (JaSaPaGe). Useful for ๐ฅ๏ธ bioinformatics on either population, or to evaluate how pangenome graphs behave when two different populations are included. jasapage.bio2vec.net/view for PanGene
Posts by Robert Hoehndorf
Good question; as far as I know, there is no relation between the manuscripts, they are different efforts. Focus is a little different in each manuscript, e.g., in our manuscript we combine a Saudi pangenome graph with a Japanese graph to evaluate effects of combining different populations.
ProtBoost: protein function prediction with Py-Boost and Graph Neural Networks -- CAFA5 top2 solution
The second-place solution in CAFA5 has now been published.
Paper: arxiv.org/abs/2412.045...
GitHub Repo: github.com/btbpanda/CAF...
Kaggle Writeup: www.kaggle.com/competitions...
Proud to share our new paper! A complete genome from Saudi Arabia (KSA001), freely available to all. Complex work - not just sequencing & assembly challenges, but also navigating IRB approval to ensure ethical data sharing & open science principles.
nature.com/articles/s41...
Great work --- would love to be added, will share updates on computational protein function prediction and some complex disease work.
I am excited to share that our paper on creating a very large structure causal model for diseases has been published. We generate an SCM containing most common diseases, and validate with #UKBiobank data, for better polygenic scores, and finding pleitropic variants academic.oup.com/bioinformati...