Advertisement · 728 × 90

Posts by Lorenzo Pantolini

@lorenzopantolini.bsky.social and I are headed to @iclr-conf.bsky.social at Rio soon, with talks about this work at @gembioworkshop.bsky.social and LMRL workshops. Reach out to chat about representation learning for de novo protein design! 🫖

1 week ago 9 2 0 0

I'm excited to share *Stoic*, a method for fast and accurate protein complex stoichiometry prediction directly from sequence. Preprint: www.biorxiv.org/content/10.6... 🧵👇(1/10)

1 month ago 40 15 2 4
Stoic:  Fast and accurate protein stoichiometry prediction (preprint header with authors and affiliations)

Stoic: Fast and accurate protein stoichiometry prediction (preprint header with authors and affiliations)

Meet Stoic from @daniil-litvinov.bsky.social and @ninjani.bsky.social: embeddings to predict stoichiometry of protein complexes from sequence fast and accurately 🧬🧩💻🤩

www.biorxiv.org/content/10.6...

1 month ago 30 10 2 1
The Critical Assessment of Structure Prediction (CASP) experiment is calling for prediction targets: Immune Complexes, Organic Ligand-Protein Complexes, Nucleic Acids and Complexes, Conformational Ensembles, Difficult Protein Structures and Complexes. 
Rule of Thumb: If AlphaFold3 can generate a high-quality model, it is likely not a CASP-grade challenge. If it struggles, we want it.

The Critical Assessment of Structure Prediction (CASP) experiment is calling for prediction targets: Immune Complexes, Organic Ligand-Protein Complexes, Nucleic Acids and Complexes, Conformational Ensembles, Difficult Protein Structures and Complexes. Rule of Thumb: If AlphaFold3 can generate a high-quality model, it is likely not a CASP-grade challenge. If it struggles, we want it.

Is #AI hitting a plateau in structure prediction? Help us find out at CASP17! 🧪🧬

Calling for Targets: Immune Complexes, protein - ligand complexes, RNA/DNA, conformational ensembles, membrane proteins, viral origins, and large complexes.

The Rule of Thumb: If AF3 can’t model it, we want it.

1 month ago 48 35 2 3

Remote homology and protein design: two sides of the same coin. Instead of finding remote homologs, we used TEA to design completely de novo proteins, folding into desired TEA sequences.

I always love working with Jay, and “speed-running” this proof of concept was no exception.

2 months ago 9 3 0 0

🚀 New paper in @natmethods.nature.com!
We present OpenStructure's powerful scoring capabilities, used to assess predictionsin CAMEO and CASP.
Read the full study here:
🔗 doi.org/10.1038/s415...
#StructuralBiology #Bioinformatics #OpenStructure #CASP #CAMEO #ProteinStructure

3 months ago 5 4 1 0

I’m presenting this work at the EMBO Computational Structural Biology Workshop in Heidelberg #EMBOComp3D this week, and @workshopmlsb.bsky.social in Copenhagen over the weekend. Let’s connect!

4 months ago 3 0 0 0

Huge thanks to my co-authors: @lauraengist.bsky.social, @ievapudz.bsky.social @martinsteinegger.bsky.social, @torstenschwede.bsky.social, especially to @ninjani.bsky.social! Couldn't have done this without the whole team, including the Swiss-Model development team and the rest of the Schwede group.

4 months ago 1 0 1 0
Preview
GitHub - PickyBinders/tea: The Embedded Alphabet (TEA) The Embedded Alphabet (TEA). Contribute to PickyBinders/tea development by creating an account on GitHub.

Try it out yourself! github.com/PickyBinders/tea. A web-service for search is coming soon at alphabet.scicore.unibas.ch.

4 months ago 0 0 1 0
Advertisement

Ultimately, TEA brings deep learning representation to protein sequence bioinformatics algorithms, such as profiles, phylogenetic trees, motif finding, multiple sequence alignments, and more, all while maintaining the speed and low resource consumption of amino acid sequences. (6/n)

4 months ago 1 0 1 0
Post image

We used TEA to connect >1.5 million singletons in AFDB Clusters, proteins which slipped past structure-based clustering approaches due to disordered or repetitive regions or simply because of low confidence structure predictions. (5/n)

4 months ago 0 0 1 0
Post image

TEA sequences come with a built-in confidence metric in the form of Shannon entropy, which we saw correlates with pLDDT, and can be used to filter out uncertain predictions. (4/n)

4 months ago 1 0 1 0
Post image

Running MMseqs2 with TEA gives extremely fast and highly sensitive results, similar to structural searches, even on unseen folds! Check out our ablations to see how we ended up with the final architecture. (3/n)

4 months ago 0 0 1 0
Post image

By using a contrastive objective, we trained an alphabet enriched with structural information, without the need for the actual structure. This approach ensures that remote homologs expressed with TEA maintain high sequence identity. (2/n)

4 months ago 0 0 1 0
Preview
Rewriting protein alphabets with language models Detecting remote homology with speed and sensitivity is crucial for tasks like function annotation and structure prediction. We introduce a novel approach using contrastive learning to convert protein...

Fresh from bioRxiv our latest work introducing The Embedded Alphabet (TEA), a powerful new representation for protein sequences obtained by discretising ESM2 embeddings into 20 characters.

Pre-print: www.biorxiv.org/content/10.1...

🧵👇(1/n)

4 months ago 28 12 1 3
Preview
Torsten Schwede appointed as new President of the SNSF Research Council from 2025 Swiss National Science Foundation (SNSF)

It is a great privilege and honor to be elected as new President of the SNSF Research Council. I’m very much looking forward to serving the Swiss scientific community in this new role in the coming years.

snf.ch/en/xgVKfkp88...

2 years ago 8 4 0 0