Advertisement · 728 × 90

Posts by Josh Weinstock

High Performance GWAS Plotting And Annotation More about what it does (maybe more than one line). Continuation lines should be indented.

Here's our R package for interacting with WGS derived GWAS summary statistics with many rare variants (from e.g. UKB or AofUs). It uses duckdb underneath so it's fast. Includes some helpful tie ins to Open Targets / Encode Screen / Ensembl APIs for annotation. weinstocklab.github.io/gwasplot/ind...

1 month ago 18 12 0 0
Post image

STATGEN 2026 will be in Atlanta @emoryrollins.bsky.social - see more info here: statgen26.emory.edu

4 months ago 1 0 0 0
Preview
Specificity, length and luck drive gene rankings in association studies - Nature Genetic association tests prioritize candidate genes based on different criteria.

How do GWAS and rare variant burden tests rank gene signals?

In new work @nature.com with @hakha.bsky.social, @jkpritch.bsky.social, and our wonderful coauthors we find that the key factors are what we call Specificity, Length, and Luck!

🧬🧪🧵

www.nature.com/articles/s41...

5 months ago 172 74 5 11

Shout to extraordinary collaborators and co-authors on this, Karen Conneely, Cameron Russell, Mitchell Machiela, Marios Arvanitis, Janghee Woo.

For more info, I'm presenting this work on Thursday at #ASHG at 1:45pm, Room 210C

6 months ago 1 0 0 0
Preview
Explore clonal hematopoiesis associations with proteins and disease in UK Biobank.

To make results easier to parse, we also release a summary statistics "portal" to view our results, including lots of QC metrics: somatic.emory.edu

We also release lots of code for this, which was all done on the DNA Nexus RAP in a cost effective way.

6 months ago 0 0 1 0
Post image

The IGH and IGL mutations in particular had really large effect sizes, so we then did a GWAS of a combined IGH + IGL phenotype, which resulted in a single hit: GRAMD1B. Remarkably, this is well characterized risk locus for CLL, suggesting that we are converging on CLL relevant biology.

6 months ago 2 0 1 0
Post image

We then quantified the variance explained on a liability scale of CH to 30 common aging-related diseases. CH explains far more 'liability' scale variance for hematologic malignancies than other classes (as one would expect), though chronic kidney disease appears high here as well.

6 months ago 0 0 1 0
Post image

New hits are generally indeed "weaker" than most classic CH drivers, suggesting that these are just underneath the "tip of the iceberg". We also replicate the telomere attrition mechanism reported here www.nature.com/articles/s41..., finding that the phenomenon is broader than previously observed

6 months ago 0 0 1 0
Post image Post image

The strongest hits are indeed strongly enriched for canonical CH genes, but we also find non-coding mutations at FGF1, UGT2B7, DGKB, the TERT promoter, chr17 centromere, and immunoglobulin loci:

6 months ago 1 0 1 0
Preview
Genome-wide characterization of clonal hematopoiesis reveals extensive non-coding putative driver mutations As humans age, we acquire somatic mutations in our blood, leading to clonal hematopoiesis (CH). Despite the prevalence of clonal hematopoiesis (CH) in aged individuals, recent searches for selective s...

What happens if you search the genomes of ~490K adults for variants that associate with age at blood draw? We actually did this crazy idea, using the UKB WGS (from peripheral blood).

Turns out this 'discovers' classic clonal hematopoiesis drivers and more 🧵👇

www.medrxiv.org/content/10.1...

#ASHG

6 months ago 4 0 1 0
Advertisement
Preview
Strober Lab The Strober lab is a computational group at Boston Children's Hospital (a Harvard Medical School affiliated hospital) focused on developing statistical and machine learning tools applied to human gene...

Exciting updates!!
(1) I just opened my lab at Boston Children’s Hospital (Harvard-affiliated)
(2) I’m hiring a postdoc focused on integrating GWAS and functional genomic data. Reach out if you’re interested or connect at ASHG next week!
(3) Learn more at stroberlab.com

6 months ago 31 10 0 1

Excited for a major milestone in our efforts to map enhancers and interpret variants in the human genome:

The E2G Portal! e2g.stanford.edu

This collates our predictions of enhancer-gene regulatory interactions across >1,600 cell types and tissues.

Uses cases 👇

1/

7 months ago 84 37 2 1
Preview
Beneath the surface of the sum When genetic interactions matter and when they don't

I wrote about gene-gene interactions (epistasis) and the implications for heritability, trait definitions, natural selection, and therapeutic interventions. Biology is clearly full of causal interactions, so why don't we see them in the data? A 🧵:

7 months ago 145 47 1 6

You can find PRSFNN code here: github.com/weinstockj/PRS . It takes in GWAS summary statistics + LD reference panel + annotations, and we compute the posterior using variational inference to make it fast.

A pleasure to build this with @aprilkim.bsky.social and @alexisbattle.bsky.social

9 months ago 2 0 0 0

We observed similar non-linear effects with AlphaMissense predictions, where low impact coding variants were prioritized, but highly pathogenic variants were not prioritized (presumably because these are so rare in real GWAS).

9 months ago 2 0 1 0

More compellingly, it also learns non-linear effects wrt to chromatin accessibility - variants in cCREs present in 10-50 cell types were more highly prioritized than variants in cCRE that are present in numerous (> 50) cell types, suggesting a preference for cell/tissue specificity.

9 months ago 2 0 1 0

In our annotation curation, we included lots of scATAC, cCREs from ENCODE, conservation from Zoonomia, pathogenicity from AlphaMissense, among others. Generally - PRSFNN "learns" that low-frequency SNPs in accessible chromatin are likely to have larger effect sizes (maybe not that surprising).

9 months ago 2 0 1 0

We connected the SNP annotations to the parameters of the prior distribution on the weights in a novel way with a neural network, so we're calling it Polygenic Risk Scores with Functional Neural Network (PRSFNN). We were excited to see that PRSFNN does well in benchmarks (at least in our hands).

9 months ago 1 0 1 0
Preview
Polygenic prediction of phenotypes with a neural empirical Bayes approach Polygenic risk scores (PRS) estimate the expected value of a phenotype based on individual genotypes. Although statistical approaches for calculating PRS have advanced considerably in recent years, fe...

Really excited to share our new PRS method, developed with @aprilkim.bsky.social and @alexisbattle.bsky.social ! Our approach is to use a lot of recently developed functional annotations to better estimate the weights of the SNPs.
www.medrxiv.org/content/10.1...

9 months ago 26 10 1 1
Advertisement

Congratulations!

10 months ago 2 0 0 0
Integration of transcriptomics and long-read genomics prioritizes structural variants in rare disease An international, peer-reviewed genome sciences journal featuring outstanding original research that offers novel insights into the biology of all organisms

Happy to share our work characterizing functional rare SVs in rare diseases with long-read genome sequencing and transcriptomic outlier data: genome.cshlp.org/content/earl...

1 year ago 10 7 1 1

Sharing some of our lecture slides on statistical genetics! 🧬

Co-taught with Mike Epstein, Dave Cutler, Karen Conneely, Jingjing Yang, Jian Hu.

Mendelian randomization: weinstocklab.org/lecture_slid...

Biobank scale GWAS methods: weinstocklab.org/lecture_slid...

Hope they're helpful!

1 year ago 3 0 0 0
Home | Mostafavi Lab

We have multiple postdoc positions available in my group at NYU. Join us if you're interested in complex trait genetics and biology. More information about the lab on our website: mostafavilab.org

1 year ago 19 15 0 0

Specificity, length, and luck: How genes are prioritized by rare and common variant association studies www.biorxiv.org/content/10.1101/2024.12....

1 year ago 46 24 0 1
Preview
Cell type and dynamic state govern genetic regulation of gene expression in heterogeneous differentiating cultures Popp et al. generate dozens of cell types from 53 human iPSC lines in order to characterize the dynamic genetic regulation of gene expression across early stages of cellular differentiation. Accessing...

Excited to see our study on genetic regulation in heterogeneous differentiating cultures out in final form!

www.cell.com/cell-genomics/fulltext/S2666-979X(24)00330-6

1 year ago 19 6 1 1

I'm developing a pipeline to call CHIP mutations in UK Biobank using the DNA Nexus RAP that is fast/cheap/reproducible. Initial results are promising; calls looks reasonable and cost to do this across all of UKB is likely < 500$.

Feel free to DM if of interest.

1 year ago 4 1 0 0

Great talk from Zeyun Lu on integrating cis-eQTLs with perturb-seq to increase discovery in Mendelian randomization #ASHG2023 !

2 years ago 14 2 0 1