Advertisement Β· 728 Γ— 90

Posts by

AFESM Clusters Foldseek clustered 820M AlphaFold DB + ESMatlas structures

45 novel protein folds in the updated AFESM (AFDB + ESMatlas) manuscript:
β€’ 12 high-confidence folds in AFESM
β€’ 33 by ColabFold-repredicting 2.3M low-quality domains
We show AFDB captures most domains already and ESMfold struggles with novelty
🌏 afesm.foldseek.com
πŸ“„ biorxiv.org/content/10.1...

2 weeks ago 20 9 1 0
Preview
Mirdita Lab - Laboratory for Computational Biology & Molecular Machine Learning Mirdita Lab builds scalable bioinformatics methods.

My time in @martinsteinegger.bsky.social's group is ending, but I’m staying in Korea to build a lab at Sungkyunkwan University School of Medicine. If you or someone you know is interested in molecular machine learning and open-source bioinformatics, please reach out. I am hiring!
mirdita.org

3 months ago 104 55 7 1
Video

End-to-end protein design in the browser through evedesign. Generate and interactively explore designs in 2D/3D and export them as codon-optimized DNA. The underlying open source framework (released soon) is build to easily add new methods, more on that soon.
🌐 evedesign.bio

5 months ago 93 29 2 1

Protein Structure Informed Bacteriophage Genome Annotation with Phold www.biorxiv.org/content/10.1101/2025.08....

8 months ago 13 10 0 0
Video

Folddisco webserver result view update:
- Added description texts for AFDB
- Integrated TaxoView taxonomy visualization & filter by @sunjaelee.bsky.social
- Inter-residue distance clustering by DBSCAN to explore motif diversity.
🌐 search.foldseek.com/folddisco
πŸ“„ www.biorxiv.org/content/10.1...

8 months ago 36 14 0 0
Preview
Metagenomic-scale analysis of the predicted protein structure universe Protein structure prediction breakthroughs, notably AlphaFold2 and ESMfold, have led to an unprecedented influx of computationally derived structures. The AlphaFold Protein Structure Database now prov...

Today at 2 PM at 3DSIG #ISMBECCB2025, @nbordin.bsky.social presents our joint work on metagenomic-scale clustering and novel domain discovery in predicted structures!
πŸ“„ www.biorxiv.org/content/10.1...

Also check out poster:
B-50 lolalign Sensitive structural alignments by Lasse
B-123 BFVD by Rachel

8 months ago 36 8 2 0
Preview
Planetary microbiome structure and generalist-driven gene flow across disparate habitats Microbes are ubiquitous on Earth, forming microbiomes that sustain macroscopic life and biogeochemical cycles. Microbial dispersion, driven by natural processes and human activities, interconnects mic...

Our new preprint is out!
www.biorxiv.org/content/10.1...
In this study, we present the largest systematic analysis of microbiome structure and function, integrating 85K uniformly processed metagenomes from diverse habitats worldwide.
@podlesny.bsky.social @jonas-bio.bsky.social @borklab.bsky.social

9 months ago 28 18 1 4
Post image Post image

Today at 5pm, @eunbelivable.bsky.social will present her work on the Big Fantastic Viral Database (BFVD) at #ISMB2025 in BOSC. She also has a poster B-123 (tomorrow, 22nd), so please drop by to have ta chat and grab some stickers!
πŸ“„ academic.oup.com/nar/article/...

9 months ago 35 9 1 1

Thank you for the nice comments. Most figures are made with β€œsoft” or β€œflat” lighting with gainsboro color for the protein :)

9 months ago 1 0 0 0

I’m excited to share our #Folddisco preprint! πŸš€ We introduce a novel pairwise-geometric feature set and an optimized index structure to enable scalable structural motif search. Dive into our case studies and key results here: www.biorxiv.org/content/10.1...

9 months ago 8 0 0 0
Advertisement
Post image

Folddisco accurately detects discontinuous motifs like zinc fingers and segment-based motifs, previously requiring separate tools. Additionally, we built a SCOPe benchmark by sampling conserved residues from families and measuring the recall up to the first false positive. 3/9

9 months ago 6 1 1 0
Post image

Folddisco builds indexes faster and smaller than previous tools: indexing AFDB50 (53M structures) takes only ~24h vs. ~20 days (extrapolated) for pyScoMotif. Querying a zinc-finger motif across AFDB50 takes just ~13s, up to 48x faster than pyScoMotif. 4/9

9 months ago 3 1 1 0
Post image

Folddisco can annotate proteins: querying a canonical zinc-finger uncovers an uncharacterized oyster protein and metagenomic proteins. It also detects partial catalytic metal sites in E. coli peptide deformylase. All of these hits would be missed by Foldseek or sequence aligners. 5/9

9 months ago 12 1 2 0
Post image

Folddisco can distinguish functional states. We searched GPCR activation motifs (CWxP, NPxxY, DRY), clearly separating active/inactive states. A search in the AFDB shows ~53% active, closely mirroring experimental PDB 54%, suggesting AlphaFold might follow its training conformation distribution. 6/9

9 months ago 5 1 1 0
Post image

Folddisco can be applied for PPI interface searches. When querying an interface between antibody chains (gray/black), it successfully identifies matching interfaces within monomeric antibody fragments (cyan), showcasing its potential to detect novel interaction partners and interfaces. 7/9

9 months ago 5 1 1 0
Video

We provide a user-friendly Folddisco webserver, enabling instant structural motif searches in PDB, AFDB-Proteomes, AFDB50 (available later today), and ESMatlas (ESM30). Explore it here: search.foldseek.com/folddisco 8/9

9 months ago 4 1 1 0

Structural motif search across the protein-universe with Folddisco www.biorxiv.org/content/10.1101/2025.07....

9 months ago 25 13 0 0

We've updated our AFESM website to now include biome filtering, allowing exploration of protein structures adapted to specific environments.
🌐 afesm.foldseek.com
Read more about the work in the skeetorial
πŸ¦‹ bsky.app/profile/mart...
or our preprint
πŸ“„ www.biorxiv.org/content/10.1...

11 months ago 60 22 2 0
Advertisement
Post image

We identified 11,941 novel multi-domain combinations. We found membrane-associated domains (e.g., TonB dependent receptor, highlighting domain recombination rather than new folds as a driver of structural innovation. 5/n

11 months ago 9 1 1 0
Post image

ESMatlas uses MGnify environmental labels. Leveraging this, we computed the lowest common biomes per structural cluster, revealing protein adaptations unique to specific environments, especially extreme ones like hyperthermal, hypersaline, and glaciers. 3/n

11 months ago 5 1 1 0
Post image

AFESM: a metagenomic guide through the protein structure universe! We clustered 821M structures (AFDB&ESMatlas) into 5.12M groups; revealing biome-specific groups, only 1 new fold even after AlphaFold2 re-prediction & many novel domain combos. 🧡
🌐 afesm.foldseek.com
πŸ“„ www.biorxiv.org/content/10.1...

11 months ago 141 70 4 4

It's a big collaborative effort by @jingiyeo.bsky.social @yewonhan.bsky.social @nbordin.bsky.social, Andy Lau, Shaun M. Kandathil, @hbkgenomics.bsky.social, Eli Levy Karin, @milot.bsky.social David T. Jones and Christine Orengo.
Visit our #RECOMB2025 poster (719) & talk (1 pm at B145 on April 29).

11 months ago 10 2 0 0

Check out Folddisco poster at #RECOMB2025!

11 months ago 1 0 1 0
Preview
Discovery of highly active kynureninases for cancer immunotherapy through protein language model Abstract. Tailor-made enzymes empower a wide range of versatile applications, although searching for the desirable enzymes often requires high throughput s

SNU Profs Woon Ju Song & Martin Steinegger (Biology) developed the AI-based SeekRank algorithm to discover enzymes for cancer immunotherapy. doi.org/10.1093/nar/...

1 year ago 7 6 0 0