Join us at #WBF2026 in Davos (14–19 June) for our workshop “Biodiversity Evidence”!
We’ll explore tools & workflows to navigate biodiversity literature and launch the foundations of a Community of Practice
📝 Register: tinyurl.com/WBF-CON20
🔎 Details: meetingorganizer.copernicus.org/WBF2026/sess...
Posts by SIB Text Mining group
Nice momentum for TransBERT : 4 → 54 downloads in a month !
It's encouraging to see that machine-translated corpora can support high-quality domain-specific models for low-resource languages. Looking forward to the next steps. 🗺️
huggingface.co/jknafou/TransBERT-bio-fr
📣 The call for tutorials & workshops at #ECCB2026 is now open! Share your tools, methods or expertise with the community.
🗓️ Deadline: 5 January 2026
👉 Submit: tinyurl.com/tw-eccb26
💡 ECCB will take place on 31 Aug–4 Sept in Geneva, gathering 1,000+ scientists from academia, industry, & healthcare.
Revisiting on last month’s #SciDataCon2025 session co-chaired by Julien Gobeill and Wolmar Nyberg Åkerström on transforming supplementary data into FAIR datasets 🔍🔓🔗♻️
Such an important challenge for the community.
#FAIRdata #OpenScience
Two weeks ago, Julien Knafou presented his work at #EMNLP2025 in Suzhou.
TransBERT is a framework for pre-training LM using synthetically translated text, and TransCorpus is our toolkit for creating large-scale translated corpora.
Try it here
huggingface.co/jknafou/Tran...
github.com/jknafou/Tran...
The SIB TM group will attend the ELIXIR Data Platform at the Co-Located Event in Marseille (17–20 Nov 2025)! We look forward to discussions on curated data, FAIR practices, AI-ready datasets, and cross-platform collaboration in biomedical and biodiversity research.
elixir-europe.org/events/elixi...
Hello world! The SIB Text Mining Group just landed on Bluesky!
We process clinical, scientific and other types of documents to help biomedical / biodiversity experts make sense of complex information.
Expect smart search tools, data science, AI… and plenty of text wizardry 🧙♂️✨📝 Stay tuned!