(@youngsuko9) Bsky - nopzon.com

Do you have any thoughts on ipsae_min/max when using Boltz? This figure was interesting bc for boltz, ipsae_max is better. I think min makes more intuitive sense but not sure what to make of the results here.

I would love to use AF3 ipsae_min, but the AF3 daily limits make it impractical.

7 months ago 0 0 1 0

Plot that shows an increasing number of papers about de novo protein design

We’re getting more and more papers on de novo protein design, but I still don’t really know what it means

What’s your definition of de novo design?

1 year ago 52 6 9 1

Miniaturizing, Modifying, and Magnifying Nature's Proteins with Raygun Proteins have evolved over billions of years through extensive and coordinated substitutions, insertions and deletions (indels). Computational protein design cannot yet fully mimic nature's ability to...

We've updated the Raygun preprint with additional validations (more fluorescence assays, biotin ligase reengineering, EGF optimization etc.) Take a look!

Here's the preprint: www.biorxiv.org/content/10.1...

1 year ago 3 4 1 0

Adaptyv Foundry

Congratulations to @youngsuko9.bsky.social on their top-10 placement in Adaptyv Bio's EGFR-binder competition!

It's an elegant pipeline-- using our protein-design method Raygun to expand and modify the template (EGF), then filtering with ProTrek.

foundry.adaptyvbio.com/competition
1/

1 year ago 7 1 1 1

This is really horrific. These single cell reference atlases are widely used as is to train all kinds of models! This is one of the reasons I've constantly harping about uniform reprocessing & extremely careful QC of large atlases. 1/

1 year ago 144 41 9 4

@jeremyparkeryang.bsky.social

Don’t let the v*rtual cell mafia see this

1 year ago 0 0 0 0

So OGT can act as a proxy for thermostability. I feel like there’s a lot of potential in trying to find viable proxies for other important properties. But finding these proxies seems non-trivial.

1 year ago 1 0 1 0

I prefer mean-pooling bc saves a lot of disk space and felt faster to train. But I realized I mean-pool partially because I noticed that was the status-quo, without really asking why.
So, I have just been trying to figure out when and how using the [seq, dim] shaped embeddings can be good.

1 year ago 0 0 1 0

Light attention predicts protein location from the language of life AbstractSummary. Although knowing where a protein functions in a cell is important to characterize biological processes, this information remains unavailab

Does that still apply if you don't flatten the representation from the start? Like in this paper where the authors use [dim, seq len] embeddings as inputs to a model using 1d conv and attention mechanisms, and argue it can extract more info than mean-pool.

academic.oup.com/bioinformati...

1 year ago 0 0 1 0

The paper mentions that embeddings need to be compressed for most downstream tasks.

If you aren’t concerned about the computational requirements, would you expect using the uncompressed embeddings as input to a model to be better than using compressed?

1 year ago 1 0 2 0

Leveraging genomic deep learning models for non-coding variant effect prediction The majority of genetic variants identified in genome-wide association studies of complex traits are non-coding, and characterizing their function remains an important challenge in human genetics. Gen...

Super excited to share our review on genomic deep learning models for non-coding variant effect prediction, with Ayesha Bajwa and Nilah Ioannidis. We’d like this review to be a useful resource, and welcome any feedback, comments, or questions! 1/4

arxiv.org/abs/2411.11158

1 year ago 34 13 1 1

Hello 🦋 #protein / #microbio / #BioML community! We are excited to release Gaia🌎, a context-aware protein search tool, extending protein search and discovery capabilities beyond sequence and structure, to include *genomic context*. Search your favorite protein sequences with on gaia.tatta.bio

1 year ago 237 75 10 8

Posts by