Happy to share our new preprint on non-coding genetic variation in the human brain and Parkinson's disease. Great team effort with @alexanrna.bsky.social, @juliedeman.bsky.social, Koen Theunis, and all co-authors, supervised by @steinaerts.bsky.social and @jdemeul.bsky.social.
Thread below:
Posts by Stein Aerts
Very proud of this and so cool that enhancer-level models can predict the effect of genetic variation. There is so much personal variation in terms of gene regulation in the human brain, it is fantastic to uncover this thanks to technology (whole-genome sequencing and single-cell multiomics) and AI
In addition to the bioRxiv this is also pilot for a new interactive preprint developed by @curvenote.com w/ support from @hhmi-science.bsky.social including directly embedded Jupyter notebooks for fig reproduction, data, models, prediction tracks, code, etc
shendure.curve.space/articles/evo...
New preprint @cxqiu.bsky.social @jshendure.bsky.social ! Can we learn regulatory grammars of human cell types — by training on mouse development and transferring across 241 mammalian genomes? Introducing STEAM & a whole-organism scATAC-seq atlas from E10 to birth.
www.biorxiv.org/content/10.6...
Latest from Shendure & Qiu labs (@cxqiu.bsky.social)
)! We combined a new 4M cell mouse whole embryo scATAC-seq atlas (E10-P0), millions of 'evolutionarily coherent' orthologs from 241 mammalian genomes (Zoonomia), and the CREsted CNN framework (@steinaerts.bsky.social).
We launched a new Group Leader vacancy in our Center for AI & Computational Biology - VIB.AI @vibai.bsky.social - with a Professorship at Ghent University. Join us with your most creative AI+Biology research plan! Apply before 31st May vib.ai/en/group-lea...
CREsted is finally out! You can find the article, together with a summarizing Research Briefing, in thread. 🦎
CREsted: an efficient and user-friendly toolbox for analysis, modeling and design of cell-type-specific enhancers.
www.nature.com/articles/s41...
The @steinaerts.bsky.social lab published CREsted, an end-to-end modeling framework to
🧬 Train sequence-based enhancer models on large sc datasets
🔍 Decode enhancer logic with nucleotide-level interpretability
⚙️ Design synthetic enhancers with cell-type specificity
https://tinyurl.com/ypurmrw5
Full house today for the Methusalem BioMedAI kickoff!
The labs of @steinaerts.bsky.social, @joanampereira.bsky.social, @ppjgoncalves.bsky.social & Maarten De Vos came together to launch this long-term research program on explainable and generative AI for biomedical discovery.
Let's go!
The @steinaerts.bsky.social lab is looking for a postdoctoral researcher to develop next-generation sequence-to-function models for glioblastoma, one of the most aggressive brain cancers.
More info & how to apply 👉 https://vib.ai/en/opportunities#/job-description/130090
Last summer I spent 4 months working at the @alleninstitute.org as a Visiting Scientist. Recently we released some preprints about the work we collaborated on, where from new multiome atlases of CNS regions we tried to decipher underlying enhancer logic with CREsted (among many other things). (1/n)
Introducing IZIKAI. ✨
My son, Juul Aerts, is on vocals and piano, and the band just dropped their debut single, "Spark."
🎧 Listen to "Spark" here: open.spotify.com/track/7D8KxZ...
📸 Follow their journey: www.instagram.com/izikai__/
#IZIKAI #ProudDad
Outstanding @science.org study on the evolution of gene regulation shaping
#cerebellum development 🧪🧠🧬
@ioansarr.bsky.social @marisepp.bsky.social @tyamadat.bsky.social @steinaerts.bsky.social @kaessmannlab.bsky.social
www.science.org/doi/10.1126/...
Big congrats to the entire Kaessmann lab for this spectacular achievement and beautiful insights. It was a great honour to contribute to this study and to host Ioannis in our lab, an absolutely brilliant scientist. Evolution of genomic enhancers controlling neuronal cell types is just too cool..
Paper alert! 💻 How many cells do you need to train reliable deep learning models in regulatory genomics? We asked how data quality, sequencing depth, and dataset size affect training of sequence-to-function models from scATAC-seq. Out now www.nature.com/articles/s41...
(details below)
Hydrop-v2 is now published ! Allows generating cheap scATAC-seq training data for enhancer modeling with CREsted. Make sure to check out the 600K cell atlas of the last 4 hours of Drosophila embryo development. Fun to use bioML for technology benchmarking :)
🚀 Proudly introducing the VIB-KU Leuven Center For Neuroscience, a merger of the two former VIB research centers VIB-KU Leuven Center for Brain & Disease Research and Neuro-Electronics Research Flanders (NERF)! Our new motto: Bold Science, Real Impact.
www.youtube.com/watch?v=uhaq...
New preprint from the lab and wonderful work by Seppe de Winter:
System-wide extraction of cis-regulatory rules from sequence-to-function models in human neural development
www.biorxiv.org/content/10.6...
tSNE dimensionality reduction of facial mesenchyme TF-MINDI seqlets colored based on TF-family. The coordinator instances are circled and an arrow drawn to a PCA of those coordinator instances colored based on coordinator motif score. This shows that TF-MINDI captures multiple coordinator affinities. For each affinity bin a TF binding motif logo is shown.
To test the sufficiency of the TF-MINDI extracted enhancer code rules we turn to synthetic enhancer design in facial mesenchyme cells. A homeobox-ebox dimer motif (Coordinator) has been shown to be instrumental for this cell type. TF-MINDI identified Coordinator instances at varying affinities.
A large tSNE dimensionality reduction showing PBMC TF-MINDI seqlets colored based on TF-family. This is surrounded by four smaller tSNE dimensionality reductons colored based on TF-ChIP-seq Z-score. Showing specific enrichment of TFs in TF binding sites annotated to the family of that TF. Bottom right shows ROC curve, comparing TF-MINDi based prediction of ChIP-seq signal with motif enrichment based prediction (cisTarget). This shows that TF-MINDI is more accurate.
We validate the TF-MINDI instances using ChIP-seq data in PBMC. Showing that TF-MINDI is more accurate compared to traditional motif enrichment analysis tools.
TF-MINDI is out! A new method to learn cis-regulatory codes through rich embeddings of TF binding sites. TF-MINDI decomposes motif neighbourhoods, and works downstream of any sequence-to-function deep learning model. We deeply study the enhancer code in human neural development, check out the thread
Check out the preprint: doi.org/10.64898/202... and the TF-MINDI package: github.com/aertslab/TF-MINDI. With @lukasmahieu.bsky.social ’s help this has become an amazing and user-friendly package, please give it a try and provide feedback.
Figure showing four panels. Top left: TF-MNDI logo (pink background and yellow letters), showing the text: "Transcription Factor Motif Instance Neighborhood Decomposition and Interpretation". Top right: TF-MINDI workflow. 1. seqlets are called (showing nucleotide level contribution scores and seqlets as blocks of nucleotides with high contribution). 2. Seqlets are embedded (showing, for each seqlet, a representation of a vector as a heatmap) and 3 seqlets are clustered and annotated (showing a schematic representation of a dimensionality reduction with seqlets colored based on TF-families as well as TF binding motif logos). Bottom left, tSNE dimensionality reduction of organoid seqlets colored based on TF family. Bottom right, similar tSNE dimensionality reduction for embryo seqlets.
To obtain high dimensional embeddings of S2F identified motifs, annotate TFBS across cell-type specific peaks and model TFBS co-occurrences we developed a new python package named TF-MINDI. Resulting in > 400k annotated TFBS instances across the genome (each dot in the tSNE below is one instance).
We are thrilled to share our new pre-print: “System-wide extraction of cis-regulatory rules from sequence-to-function models in human neural development”. S2F-deeplearning models can accurately encode enhancers, yet decoding these models into human-interpretable rules remains a major challenge.
TF-MINDI is out! A new method to learn cis-regulatory codes through rich embeddings of TF binding sites. TF-MINDI decomposes motif neighbourhoods, and works downstream of any sequence-to-function deep learning model. We deeply study the enhancer code in human neural development, check out the thread
This is the happy face of four researchers embarking on a cool scientific collaboration backed by 7-years of structural financing!
Congrats @steinaerts.bsky.social, @joanampereira.bsky.social, @ppjgoncalves.bsky.social, and Maarten De Vos on your Methusalem grant.
https://tinyurl.com/nvcardzy
Open Senior Bioinformatician position at
@sangerinstitute.bsky.social
Tree of Life, to work on the Biodiversity Cell Atlas initiative with @marakat.bsky.social and me.
📅 Apply by January 18
🔗 sanger.wd103.myworkdayjobs.com/en-US/Wellco...
Please share with anyone who might be interested!
Looking to start your lab in generative biology / AI?
Come join us at the @sangerinstitute.bsky.social
Sanger is core-funded so you can generate data at scale to train the next generation of models and understanding. Design/Engineering/Chemistry/Proteins/Pathways!
pls RT
tinyurl.com/GenGenFaculty