Advertisement · 728 × 90

Posts by Yana Bromberg

Post image Post image

The slide you submitted | The slide you’re presenting
#tcteac #PragueSights

10 months ago 57 11 1 2
Video

Yana Bromberg at the Woodstock Night Science conference #TCTEAC @yanabromberg.bsky.social

10 months ago 10 2 0 0
Protein RNS scores differ from those of randomized sequences and, also, across protein subsets (structure vs disorder vs design)

Protein RNS scores differ from those of randomized sequences and, also, across protein subsets (structure vs disorder vs design)

While our focus is on protein embeddings, the same intuition applies to any domain using latent representations, from medical imaging to physics.
We are looking forward to hearing how your models improve with RNS!

11 months ago 0 0 0 0

Main findings:
• Variant effect prediction accuracy jumped from ~60% to ~90% for low vs high-reliability embeddings.
• Hundreds to thousands of human proteins, per model, may be poorly captured.
• Our score flags low-reliability embeddings, guiding better model training and downstream fine-tuning.

11 months ago 0 0 1 0
Preview
Quantifying uncertainty in Protein Representations Across Models and Task Embeddings, derived by language models, are widely used as numeric proxies for human language sentences and structured data. In the realm of biomolecules, embeddings serve as efficient sequence and/or...

I’m super excited to share our work (with Prabakaran Ramakrishnan) on scoring embedding reliability. We propose the RNS (random neighbors) score that improves the next steps in model use, e.g. variant effect prediction, structure modeling, function annotation, etc.
www.biorxiv.org/content/10.1...

11 months ago 4 0 1 1

Are you using AI models on protein or DNA sequences? Did you maybe forget that their embeddings come with no obvious measure of reliability?
Imagine how sequence analysis would suffer if source sequences were sometimes subtly randomized. This is exactly what happens if we use low-quality embeddings.

11 months ago 3 0 1 0
Photo of a slide titled "Hot Tub Hypotheses" - featuring a photo of the conference hot tub and our Ten Simple Rules for Drawing Scientific Comics paper

Photo of a slide titled "Hot Tub Hypotheses" - featuring a photo of the conference hot tub and our Ten Simple Rules for Drawing Scientific Comics paper

Shout out to Hot Tub concocted papers at #psb2025! Our Ten Simple Rules for Scientific Comics arose from a hot tub hypothesis with @yanabromberg.bsky.social - "Hey, I bet we could write a paper on scientific comics!" @pacsym.bio

Do you have any 'hot tub' papers?
journals.plos.org/ploscompbiol...

1 year ago 4 2 0 0

Love all of it! The big question is are you on this side of the pond yet?

1 year ago 1 0 1 0
Preview
A synthetic protein-level neural network in mammalian cells Artificial neural networks provide a powerful paradigm for nonbiological information processing. To understand whether similar principles could enable computation within living cells, we combined de n...

This and 'mirror' cells... and we are done
www.science.org/doi/10.1126/...

1 year ago 0 0 0 0
Advertisement
Post image

Association of polygenic scores for neuropsychiatric traits with self-reported professions based on analysis of 420k individuals from UK Biobank and Million Veteran Program. Look at the 'arts & design' category. Artistic talent comes at a cost--a piece of your mind :)

1 year ago 118 40 7 16
Preview
Deciphering enzymatic potential in metagenomic reads through DNA language model The microbial world plays a fundamental role in shaping Earth's biosphere, steering global processes such as carbon and nitrogen cycling, soil rejuvenation, and ecological fortification. An overwhelmi...

Excited to share our latest work in #metagenome analysis. We built REMME/REBEAN to analyze and label sequencing #reads.
End result? We can label activity of #microbiome #proteins with minimal homology to #enzymes we've seen before.

Great work by R. Prabakaran!
www.biorxiv.org/content/10.1...

1 year ago 3 0 0 0