Advertisement Β· 728 Γ— 90

Posts by Marnix Medema

Post image Post image

Congratulations πŸ‘ πŸŽ‰ to Felicia Wolters to compete two studies at the same time! πŸ™Œ

They describe a FAIR compliant plant metabolomics dataset and a systematic profiling thereof!

Thanks to all coauthors!!

#ProudPI in #CompMetabolomics
#metabolomics

See for details:

www.linkedin.com/posts/felici...

15 hours ago 9 1 0 1
PARAS: High-Accuracy Machine Learning of Substrate Specificities in Nonribosomal Peptide Synthetases Nonribosomal peptides are diverse natural products with important applications in medicine and agriculture. Bacterial and fungal genomes contain thousands of nonribosomal peptide biosynthetic gene clusters (BGCs) of unknown function, providing a promising resource for peptide discovery. Core structural features of such peptides can be inferred by predicting the substrate(s) of adenylation (A) domains in nonribosomal peptide synthetases (NRPSs). However, existing approaches to A domain prediction rely on limited data sets and often struggle with domains selecting large substrates and domains from underrepresented taxa. Here, we systematically curate and computationally analyze 3653 A domains and present two high-accuracy specificity predictors, PARAS and PARASECT. A type of A domain with unusually high l-tryptophan specificity was identified through the application of PARAS. Cloning and expression of the biosynthetic gene cluster encoding the NRPS showed that it directs the biosynthesis of tryptopeptin-related metabolites in Streptomyces species. Together, these technologies will accelerate the characterization of novel NRPSs and their metabolic products. PARAS and PARASECT are available at https://paras.bioinformatics.nl.

The paper can be read here: pubs.acs.org/doi/10.1021/...

1 week ago 1 0 0 0
PARAS: High-Accuracy Machine Learning of Substrate Specificities in Nonribosomal Peptide Synthetases Nonribosomal peptides are diverse natural products with important applications in medicine and agriculture. Bacterial and fungal genomes contain thousands of nonribosomal peptide biosynthetic gene clusters (BGCs) of unknown function, providing a promising resource for peptide discovery. Core structural features of such peptides can be inferred by predicting the substrate(s) of adenylation (A) domains in nonribosomal peptide synthetases (NRPSs). However, existing approaches to A domain prediction rely on limited data sets and often struggle with domains selecting large substrates and domains from underrepresented taxa. Here, we systematically curate and computationally analyze 3653 A domains and present two high-accuracy specificity predictors, PARAS and PARASECT. A type of A domain with unusually high l-tryptophan specificity was identified through the application of PARAS. Cloning and expression of the biosynthetic gene cluster encoding the NRPS showed that it directs the biosynthesis of tryptopeptin-related metabolites in Streptomyces species. Together, these technologies will accelerate the characterization of novel NRPSs and their metabolic products. PARAS and PARASECT are available at https://paras.bioinformatics.nl.

Link to the paper: pubs.acs.org/doi/10.1021/...

1 week ago 0 0 0 0
PARAS

PARAS (and its sister algorithm PARASECT) are available at paras.bioinformatics.nl and via antiSMASH.

1 week ago 0 0 1 0

And above all kudos to #BarbaraTerlouw for her perseverance during multiple years of hard work, and to #ChuanHuang, #DavidMeijer, @jcedielbecerra.bsky.social and #RuolinHe for major contributions!

1 week ago 2 0 1 0

Thanks to great collaborations with #GregChallis, @wildtypemc.bsky.social , @gillesvanwezel.bsky.social #SerinaRobinson #MattJenner #FabrizioAlberti and many others.

1 week ago 2 0 1 0
Post image Post image

To showcase its practical use, PARAS was employed in collaboration with the Challis group for the experimental characterization of the tryptopeptin biosynthetic gene cluster from Streptomyces strains. 5/n

1 week ago 0 0 1 0
Post image Post image

Benchmarks on independent hold-out data showed that PARAS outperformed previous algorithms. Comparison of a wide range of both older and recent algorithms indicates that the volume of training data, not model choice, is the key determining factor of accuracy in these data regimes. 4/n

1 week ago 0 0 1 0
Post image

Computational analysis of the data also highlighted how different clades of independently evolved A domains can bind the side chains of large substrates using
architecturally distinct pockets, a crucial insight to improve prediction. 3/n

1 week ago 1 0 1 0
Post image Post image

Key to the improvements in accuracy where increases in volume and accuracy of the training data, partly fueled by large-scale community annotations (MIBiG) and followed by painstaking further manual curation to correct errors. For certain taxa, like fungi, training data increased >5-fold. 2/n

1 week ago 0 0 1 0
Advertisement
Post image

Now out in @acs.org JACS Au, the manuscript by #BarbaraTerlouw et al. describing PARAS, a high-accuracy machine-learning algorithm to predict substrate specificities of nonribosomal peptide synthetase (NRPS) adenylation domains, key for estimating natural product structures from BGC sequence. 1/n

1 week ago 27 7 3 1

Check out this blog from @catarinacarolina.bsky.social on the story behind the paper: communities.springernature.com/posts/making... A story of overcoming challenges, careful engineering and collaborative teamwork.

1 month ago 7 3 0 0
Preview
Microbial secondary metabolites: advancements to accelerate discovery towards application Nature Reviews Microbiology - In this Review, Dinglasan and colleagues explore innovations that facilitate rapid microbial secondary metabolite discovery, focusing on recent techniques for the...

Our new review article "Microbial secondary metabolites: advancements to accelerate discovery towards application" in Nature Reviews Microbiology is now published!

rdcu.be/d6BHX

1 year ago 47 20 0 0
Transporter genes in biosynthetic gene clusters predict metabolite characteristics and siderophore activity An international, peer-reviewed genome sciences journal featuring outstanding original research that offers novel insights into the biology of all organisms

thanks @acritschristoph.bsky.social ! And of course we were fortunate that we could build upon some of your excellent earlier work: genome.cshlp.org/content/31/2...

1 month ago 1 0 0 0

Kudos to @zachreitz.bsky.social for the great work and thanks to funders and collaborators! @erc.europa.eu

1 month ago 0 0 0 0
Post image

Last but not least, it allowed Bita Pourmohsenin from @nadineziemert.bsky.social 's group to reconstruct the evolutionary history of NRP metallophores, which supports that some chelating groups may predate the Great Oxygenation Event.

1 month ago 1 0 1 0
Post image

Several metallophores were then experimentally validated in the lab of Alison Butler at UCSB.

1 month ago 0 0 1 0
Post image

The new method allowed Zach to perform a systematic analysis of metallophore biosynthetic diversity across bacterial genomes, identifying BGCs encoding diverse combinations of chelating moieties.

1 month ago 0 0 1 0
Post image

The approach has a very high precision, and only occasionally fails when these subclusters are encoded elsewhere in the genome. Combining it with metallophore transporter detection further boosts performance.

1 month ago 1 0 1 0
Advertisement
Post image

In this work, @zachreitz.bsky.social developed a systematic approach to predict which BGCs encode production of metallophores, based on carefully tuned rule-based detection of chelating group biosynthetic subclusters.

1 month ago 1 0 1 0
Preview
Automated genome mining predicts structural diversity and taxonomic distribution of peptide metallophores across bacteria Automated detection of metallophore biosynthesis reveals that metal-chelating non-ribosomal peptides are widespread, chemically diverse, and deeply rooted in bacterial evolution.

Are you interested in how to predict functions of natural product biosynthetic gene clusters (BGCs) and their products? And/or do you love metallophores and would love to identify their producers in microbiomes? Check out @zachreitz.bsky.social 's new paper!
elifesciences.org/articles/109... 1/n

1 month ago 27 12 4 1
Assistant/Associate/Full Professor Computational Biology Assistant/Associate/Full Professor Computational Biology

Sounds like a great opportunity for a professor position in computational biology in the Netherlands careers.universiteitleiden.nl/job/Assistan...

1 month ago 13 13 0 0

All in all, thanks to the engineering efforts of Arjan Draisma, @catarinacarolina.bsky.social, Nico Louwen, Satria Kautsar, @jorgenavarro.bsky.social and the rest of the author team, and shout-out to the
@jgi.doe.gov
and
@nigelmouncey.bsky.social .bsky.social
for a wonderful collaboration.

1 month ago 4 0 0 0
Post image

At the same time, for mid-size datasets (most frequent use case), BiG-SCAPE v2 approaches the speed of BiG-SLiCE. For larger datasets, BiG-SCAPE has become more resource-efficient, which now allowed us to analyse >250k BGCs (the antiSMASH database) with it in just a few days on a compute server.

1 month ago 3 0 1 0
Post image

Benchmarking on several different types of ground truth datasets shows the increases in accuracy, with BiG-SLiCE v2 often matching BiG-SCAPE v1 accuracy, and BiG-SCAPE v2 combining its speed increase with a small but notable accuracy gain as well.

1 month ago 4 0 1 0
Post image

BiG-SCAPE was completely rewritten from scratch over the past few years, and now features new alignment modes, a more prominent query mode (which allows you to search genomes from a single BGC query) and more scalable interactive visualizations and data selection/filtering.

1 month ago 5 0 1 0
Post image

Now out in @natcomms.nature.com :
versions 2.0 of both BiG-SCAPE and BiG-SLiCE! With significant speed and accuracy increases, as well as new interactive functionalities.
Read the full paper here #openaccess:
www.nature.com/articles/s41...

1 month ago 39 20 1 2

Make sure to join us in the MIBiG Annotathons! The MITE database (mite.bioinformatics.nl) will join the efforts! If you are interested in tailoring enzymes/maturases, make sure to join us!

2 months ago 9 3 0 0

Come join us again in a next round of this massive online open science community effort! πŸ’ͺ
Sign up using the link in the thread.
It’s great fun, and really helps the scientific community. What more can you ask? πŸ™‚

2 months ago 13 8 0 0
Advertisement
Post image

The MIBiG 5.0 Annotathon is coming soon, and registration is now open!

🧬 Does your research involve biosynthetic gene clusters? Do you love natural product biosynthesis? Do you have an interest in rare & exotic enzymes? We can use your help & expertise.

Register here πŸ‘‰ forms.gle/C1cWcLHtrjT2...

2 months ago 26 21 1 5