There was concern it would not be helpful for identification purposes for the few people who do not already know me
Posts by Jacob Schreiber
The Programmable Genomics Lab at @umasschan.bsky.social
just officially launched our site!
We have a "simple" goal: to develop synthetic regulatory elements that target every cell type in every tissue, and control payload dosage/duration
Check it out!
programmable-genomics.github.io
Our preprint "Predictive design of tissue-specific mammalian enhancers that function in vivo in the mouse embryo" is on bioRxiv: www.biorxiv.org/content/10.6... . Amazing collaboration by @shenzhichen1999.bsky.social, Vincent Loubiere (@impvienna.bsky.social,@viennabiocenter.bsky.social),... (1/2)
There's a ton of really cool stuff in the paper. Take a skim if you have time!
In collaboration with @alex-stark.bsky.social , we designed cell type-specific enhancers and experimentally verified them using STARR-seq. Strikingly, we found that Ledidi can even design enhancers with stronger regulatory activity than the strongest endogenous enhancers.
We designed regulatory DNA in dozens of settings and found that we could achieve our target objective with surprisingly few edits (biology alert). Because we focus on edits, our designs are less likely to overfit to the model or inadvertently alter properties that are not well captured by the model.
Ledidi is a DNA design method that designs *edits* to a template while explicitly minimizing the number of needed edits. This allows you to build upon informative starting material, which our genomes are full of, and simply edit in the final touches, similar to an Instagram filter or video touch-up.
After a huge amount of work w/ @alex-stark.bsky.social's group, a new version of our Ledidi preprint is now out!
In an era of AI-designed proteins, the next leap will be controlling when, where, and how much of these proteins are expressed in living cells.
www.biorxiv.org/content/10.1...
The @impvienna.bsky.social is a unique place where talented researchers are doing amazing science. I'm glad I had an opportunity to spend some time there, and am looking forward to the next excuse to visit Vienna!
That sounds too biologically important for me to be involved.
My first @umasschan.bsky.social/@impvienna.bsky.social affiliated paper is up!
tomtom-lite is a re-implementation of tomtom targeting the ML age of genomics. Fast annotations ("what is this motif?") and simple large-scale discovery of motifs.
Check it out!
academic.oup.com/bioinformati...
Why is it a problem to translate C++ to changng standards when you can just use Agenic AI?
Tune in for a great MIA talk on ML for regulatory genomics by @jmschreiber91.bsky.social and Gregory Andrews now! 🧪
broad.io/mia
(Will also be available online on our YouTube playlist later: www.youtube.com/watch?v=sMTO...)
I'm happy to share that our gReLU package is now published in Nature Methods!
www.nature.com/articles/s41...
they told the flight attendants to sit down for the second half of the flight (an hour) because it was entirely turbulence
had to land in a nor'eastern storm in boston and i'm ready to move back to the west coast forever
Now that I'm settled in at @umasschan.bsky.social, I'm hiring at all levels: grad students, post-docs, and software engineers/bioinformaticians!
The goal of my lab is to understand the regulatory role of every nucleotide in our genomes and how this changes across every cell in our bodies.
I figure I can spend the time until I get tenure on answering the question, and then the time after tenure arguing about what "regulatory" means
If you're interested, please reach out with your CV and which topics you'd be interested in working on!
- Genomics Software Ecosystem: A major obstacle to our goal is the lack of simple+scalable software that everyone can use. Come build this with me. Training a lightweight deep learning model and using it for design/interpretability/VE prediction should be no more challenging than mapping reads.
- Foundation Models: As someone involved in ML, I am legally required to be working on this topic.
We have an array of ML-based projects for going after this, focusing on the following topics:
- DNA Design ( 🧬 ) We have shown that Ledidi (www.biorxiv.org/content/10.1...) can precisely design DNA, and now it's time to push the boundaries in several directions w/ some very cool collaborations.
Now that I'm settled in at @umasschan.bsky.social, I'm hiring at all levels: grad students, post-docs, and software engineers/bioinformaticians!
The goal of my lab is to understand the regulatory role of every nucleotide in our genomes and how this changes across every cell in our bodies.
It was suggested that the audience may not appreciate/understand :(
is it a good idea to wear a "join, or die!" hat to a big talk in europe? please say yes
the greatest productivity hack is having a grant deadline. there's so much other stuff you can do when you're supposed to be working on a grant.
I was delighted to have the unexpected opportunity to give a keynote at MLCB 2025 in NYC last week. I used it to explain how I view deep learning models in genomics not as "uninterpretable black boxes" but as indispensable tools for understanding genomics + designing the next gen of synthetic DNA.
for some reason i thought being a professor would involve more mentoring and research and less filling out disclosures concerning whether plants and seeds were used in my computational study
stocking up the new apartment with essentials
In the genomics community, we have focused pretty heavily on achieving state-of-the-art predictive performance.
While undoubtedly important, how we *use* these models after training is potentially even more important.
tangermeme v1.0.0 is out now. Hope you find it useful!