Advertisement · 728 × 90

Posts by BioHackrXiv

Figure 1: Mid-week reporting poster used during BioHackathon Europe 2025 to communicate early design decisions, illustrate EDAM branches, and motivate the use of the Model Context Protocol to mitigate LLM hallucinations during ontology-driven tool annotation.

Figure 1: Mid-week reporting poster used during BioHackathon Europe 2025 to communicate early design decisions, illustrate EDAM branches, and motivate the use of the Model Context Protocol to mitigate LLM hallucinations during ontology-driven tool annotation.

"Improving package annotation in metabolomics andproteomics via robust, ontology-driven LLM integration" https://doi.org/10.37044/osf.io/x5v6b_v1

"At BioHackathon Europe 2025, our team explored how Large Language Models (LLMs) can assist this process through […]

[Original post on fediscience.org]

6 days ago 3 0 1 1
Original post on fediscience.org

"GA4GH VRS for the Semantic Web" https://doi.org/10.37044/osf.io/qy4td_v1

"We implemented an RDF Schema in ShEx, using OWL ontologies in the OBO Foundry such asthe NCI Thesaurus (NCIT), the Semanticscience Integrated Ontology (SIO), and the GenomeVariation Ontology (GVO). We based our modelling […]

1 week ago 1 0 0 0
Original post on fediscience.org

"Minimal information standardization of phenomic experimental data in animals" https://doi.org/10.37044/osf.io/ncrkm_v1

"The current landscape of animal phenomics is characterized by a substantial lack of stan-dardization, hindering data reuse, reproducibility, and interoperability across […]

1 week ago 0 0 0 0
Original post on fediscience.org

"Evolving FAIR Image Analysis in Galaxy for Cross-domain and AI-ready Applications" https://doi.org/10.37044/osf.io/tsxby_v1

"he project addressed three major challenges:1. Improving semantic annotation for image analysis resources.2. Introducing content-based reproducibility validation […]

2 weeks ago 0 0 0 0
Original post on fediscience.org

@rupdecat there have been two reports from previous #SWAT4HCLS biohackathons:

"BioHackSWAT4HCLS25 report: Towards AI-Ready Datasets for the Life Sciences" https://doi.org/10.37044/osf.io/xqe8h_v1

and

"Publishing FAIR datasets from Bioimaging repositories" Publishing FAIR datasets from […]

4 weeks ago 0 2 0 0
Preview
SWAT4HCLS Biohackathon 2026 Preprints for BioHackathons

good luck to the participants of the SWAT4HCLS Biohackathon 2026 in the coming week! https://index.biohackrxiv.org/tag/SWAT4HCLS26

Use the hashtag to post your progress and have it show up on the BioHackRxiv #fediwall: https://guide.biohackrxiv.org/fediwall/

#SWAT4HCLS26

4 weeks ago 2 5 0 1
Figure 2.10-1: Distribution of species and proteins across different taxa (Human, Virus, Prokaryote, Eukaryote) in the ProteinGym benchmark.

It shows each of the four in separate colorful boxplots. Not easy to explain, and plz read the article for the details.

Figure 2.10-1: Distribution of species and proteins across different taxa (Human, Virus, Prokaryote, Eukaryote) in the ProteinGym benchmark. It shows each of the four in separate colorful boxplots. Not easy to explain, and plz read the article for the details.

"Towards Federated Learning Across Biobanks: Prototype Software from the 2026 Carnegie Mellon University–NVIDIA Hackathon" https://doi.org/10.37044/osf.io/5psfj_v1

"The Carnegie Mellon University-NVIDIA Federated Learning Hackathon for Biomedical […]

[Original post on fediscience.org]

4 weeks ago 0 1 0 0
Original post on fediscience.org

"Tools to develop constraint-based models in R: adapting existing toolboxes" https://doi.org/10.37044/osf.io/ey4c5_v1

"In this project, we proposed the (re)development of an R based framework for developing and simulating constraint-based models. We proposed to expand the Sybil library for […]

1 month ago 0 0 0 0

good luck to the hackers at #SnakemakeHackathon2026 !

1 month ago 0 1 0 0
Figure 3:Principal component analysis (PCA) of bio.tools entries with a GitHub repository basedon the numbers of contributors, forks (and network count), commits, pulls, releases, open issues,subscribers, watchers, stargazers and the average time to close issues. The colors represent thebio.toolsmaturitylevel, i.e.,Emerging,MatureorLegacy.

Figure 3:Principal component analysis (PCA) of bio.tools entries with a GitHub repository basedon the numbers of contributors, forks (and network count), commits, pulls, releases, open issues,subscribers, watchers, stargazers and the average time to close issues. The colors represent thebio.toolsmaturitylevel, i.e.,Emerging,MatureorLegacy.

"Bidirectional bridge: GitHub ⇄ bio.tools" https://doi.org/10.37044/osf.io/8ktd6_v1

"Here, we describe the tooling for a bidirectional bridge between the software developmentplatform GitHub and the ELIXIR bio.tools registry of life sciences software tools […]

[Original post on fediscience.org]

1 month ago 1 0 0 0
Advertisement
Figure 1: Curated crosswalks between MoDALIA and Schema.org metadata models for training materials.

The figure shows schema.org on the left side, SSSOM icon below an bidirectional arrow in the middle in a screenshot of a MoDALIA predicate definition on the right.

Figure 1: Curated crosswalks between MoDALIA and Schema.org metadata models for training materials. The figure shows schema.org on the left side, SSSOM icon below an bidirectional arrow in the middle in a screenshot of a MoDALIA predicate definition on the right.

"BH25DE report: On the path to machine-actionabletraining materials" https://doi.org/10.37044/osf.io/un6cd_v1

"We demonstrated contentfederationvia themTeSS-Xplatform, enabling cross-instanceexchange and preparing for future integration with the EOSC […]

[Original post on fediscience.org]

2 months ago 0 1 0 0
Part of Figure 1:Photo of project poster for the mid-week presentation. The photo shows a A0 sheet notes from the meeting, including country flags, some key words, and the output of brainstorming.

Part of Figure 1:Photo of project poster for the mid-week presentation. The photo shows a A0 sheet notes from the meeting, including country flags, some key words, and the output of brainstorming.

"BioHackEU25 report: METRICS - Monitoring of KeyPerformance Indicators for ELIXIR Services" https://doi.org/10.37044/osf.io/2jgk4_v1

"As part of the BioHackathon Europe 2025, we report on the activities of the METRICS project, which addresses the need for […]

[Original post on fediscience.org]

2 months ago 0 1 0 0
Screenshot of the linked BioHackrXiv preprint, showing the top half of a preprint PDF page, showing the BioHackrXiv logo from the template, a table at the top listing a Arabidopsis thaliana pathway, and below that part of Figure 1 showing a "[p]athway diagram for for Caffeine synthesis inCoffea arabica.  This diagram is already published in WikiPathways at https://www.wikipathways.org/pathways/WP5586.html".

Screenshot of the linked BioHackrXiv preprint, showing the top half of a preprint PDF page, showing the BioHackrXiv logo from the template, a table at the top listing a Arabidopsis thaliana pathway, and below that part of Figure 1 showing a "[p]athway diagram for for Caffeine synthesis inCoffea arabica. This diagram is already published in WikiPathways at https://www.wikipathways.org/pathways/WP5586.html".

"QPX: Pathway analysis environment" https://doi.org/10.37044/osf.io/m37f2_v1

"Building on our work at DBCLS BioHackathon 2023 (BH23), where we introduced QPX andpromoted pathway modeling with WikiPathways (Pico et al., 2008) using PathVisio (Kutmon etal […]

[Original post on fediscience.org]

3 months ago 0 1 0 0
Original post on fediscience.org

2025 has come to an end. This year, we published 45 preprints resulting from 16 different biohackathons. Sometimes reports come in late, as we will see. A quick list of #biohackathon meetings with preprints in 2025:

- DBCLS BioHackathon 2023 #BH23JP: https://index.biohackrxiv.org/tag/BH23JP
- […]

3 months ago 0 4 0 0
Original post on fediscience.org

2025 has come to an end. This year, we published 45 preprints resulting from 16 different biohackathons. Sometimes reports come in late, as we will see. A quick list of #biohackathon meetings with preprints in 2025:

- DBCLS BioHackathon 2023 #BH23JP: https://index.biohackrxiv.org/tag/BH23JP
- […]

3 months ago 0 4 0 0
Original post on fediscience.org

"BioHackEU25 Report Project 16: MiCoReCa (Microbiome Community Resource Catalogue) - Towards Centralized Curation And Integration Of Microbiome Bioinformatics Resources" https://doi.org/10.37044/osf.io/jfpsx_v1

"To address this critical gap, the ELIXIR Microbiome Community proposes the […]

3 months ago 2 1 0 0
Original post on fediscience.org

"Enhancement of the Interoperability of Trait Data on Genetic Resources between Japan and France" https://doi.org/10.37044/osf.io/hw2fj_v1

"This paper presents the current status of trait data standardizationbetween the two organizations and outlines a direction for standardization. Trait data […]

3 months ago 2 0 0 0
Original post on fediscience.org

"Increasing FAIRness in agrosystem sciences and plantphenomics" https://doi.org/10.37044/osf.io/cy65w_v1

"As part of the de.NBI BioHackathon 2023, we here report about our progress on increasingFAIR-compliance in agrosystem sciences and plant phenomics. Through the collaborative effortsof the […]

4 months ago 0 0 0 0
Original post on fediscience.org

"Decoding Complex Genotype-Phenotype Interactions byDiscretizing the Genome" https://doi.org/10.37044/osf.io/xhkc3_v1

"ere, we introduce a new methodology for genotype-phenotype mapping based ongenomic hashes, unique representations of local genomic background. Each hash correspondsto a […]

4 months ago 1 0 0 0
Advertisement
Original post on fediscience.org

"MCP server tools with RDF shapes" https://doi.org/10.37044/osf.io/8qeh5_v1

"In this paper, we present the work we have done during the Japan Biohackathon 2025 about implementing MCP servers supported by RDF data shapes to improve natural language interactions with large RDF datasets using […]

4 months ago 2 3 0 1
Original post on fediscience.org

"BioHackEU25 report: Towards a Robust ValidationService for Data and Metadata in ARC RO-Crates" https://doi.org/10.37044/osf.io/zah28_v1

"For the metadata, validation will ensure structural and semantic compliance to the base RO-Crate specification and the ARC family of RO-Crate profiles […]

4 months ago 0 1 0 0
Original post on fediscience.org

"Mining the potential of knowledge graphs for metadata on training" https://doi.org/10.37044/osf.io/gv2ac_v1

"A dedicated pipeline parses RDF/Turtle dumps, deduplicates entries, and builds rich indexes (keyword, provider, location, date, topic) that power a Model Context Protocol (MCP) server […]

4 months ago 0 3 0 0
Original post on fediscience.org

"A Blueprint for Open Science: How Transatlantic Teams Built and Deployed Knowledge Graphs to Enable Biological (AI) Models" https://doi.org/10.37044/osf.io/g4rk2_v1

"these projects and pipelines showcase methods for constructing KGs from existing biomedical datasets andexemplify the practical […]

4 months ago 0 4 0 0
Original post on fediscience.org

"BioHackEU25 report: Scop3PTM Next - Interactive visualization of PTM data across sequence, structure and interactions" https://doi.org/10.37044/osf.io/xvrud_v1

"Scop3PTM Next was developed during BioHackathon Europe 2025 to address the need for integrated visualization of protein-centric data […]

4 months ago 0 1 0 0
Preview
SWAT4HCLS Biohackathon 2026

Call for Biohackathon topics: "SWAT4HCLS Biohackathon 2026, Amsterdan, The Netherlands" https://www.swat4ls.org/swat4hcls-biohackathon-2026/

#biohackathon #swat4hcls26 #swat4hcls

5 months ago 0 0 0 1
Original post on fediscience.org

"DBCLS BioHackathon 2025 report on the WikiBlitz" https://doi.org/10.37044/osf.io/7s6da_v1

"As part of the DBCLS BioHackathon 2025, we organized a WikiBlitz to improve biodiversity knowledge by integrating iNaturalist, GBIF, Wikidata, and Wikipedia. Participants identified local flora and fauna […]

5 months ago 0 1 0 0
Advertisement
Original post on fediscience.org

"on2vec: Ontology Embeddings with Graph Neural Networks and Sentence Transformers" https://doi.org/10.37044/osf.io/4f763_v1

"Ontologies provide structured vocabularies and relationships essential for organizing biological knowledge, yet their symbolic nature limits integration with modern […]

5 months ago 2 1 0 0
Original post on fediscience.org

"AI in Practice: Insights from a Community Survey of Biohackathon Participants" https://doi.org/10.37044/osf.io/pza7v_v1

"Findings reveal that most participants are frequent AI users, with tools like ChatGPT, Gemini, and Claude widely adopted, with ChatGPT as number one response. AI is […]

5 months ago 1 1 0 0
Original post on fediscience.org

"Translating and Formalizing the MIRAGE Guidelines to a Prototype MIRAGE Ontology and DCAT3 Extension Vocabulary for Glycomics Data Management" https://doi.org/10.37044/osf.io/wj8bz_v1

"We present the first comprehensive semantic formalization of MIRAGE guidelines through an integrated RDF […]

5 months ago 1 2 0 0
Original post on fediscience.org

"DBCLS BioHackathon 2025 report: Creation and Publication Analytical Workflow of Creators' Interests" https://doi.org/10.37044/osf.io/qd5sz_v1

"At the DBCLS BioHackathon 2025, we converted metatranscriptomic analytical shell scripts into Common Workflow Language (CWL) containerized with Docker […]

5 months ago 1 1 0 0