Advertisement · 728 × 90

Posts by Kaitlin Samocha

gnomAD The Genome Aggregation Database (gnomAD) is a resource developed by an international coalition of investigators, with the goal of aggregating and harmonizing both exome and genome sequencing data from...

If you have more questions, you can check out our FAQ (gnomad.broadinstitute.org/help) or the Forum (discuss.gnomad.broadinstitute.org).

3 weeks ago 0 0 0 0

We are always working to make the resource better and greatly appreciate the feedback we've received from the community!

3 weeks ago 0 0 1 0

A huge thank you to the tireless gnomAD production and browser teams, who made this possible. Special thanks to Katherine Chao, Julia Goodrich, Phil Darnowsky, Ruchit Panchal, Kristen Laricchia, and Mike Wilson.

More details + extra features can be found in the blog post.

3 weeks ago 1 1 1 0
gnomAD v4.1.1 | gnomAD browser The Genome Aggregation Database (gnomAD) is a resource developed by an international coalition of investigators, with the goal of aggregating and harmonizing both exome and genome sequencing data from...

We are excited to share our gnomAD v4.1.1 release
gnomad.broadinstitute.org/news/2026-03...

Major changes:
* Constraint scores on X and Y
* Improved coverage correction
* LOFTEE fix
* Guidance on constraint cut-offs
* New quality flag for low coverage/mappability genes

@gnomad-project.bsky.social

3 weeks ago 10 2 1 1

To nominate disease genes, we introduce two discovery scores: ΔPEPPER flags genes where biological features predicts clinical impact beyond whats published, and DisPo highlights genes under strong constraint with limited literature. Together they prioritize hundreds of candidate genes for follow-up

3 weeks ago 0 1 1 0
Post image

We train a new model trained on biomedical literature (PEPPER_XGB). Mix LOEUF and PEPPER to make an OMELET, which outperforms either each individual model in identifying disease genes

3 weeks ago 1 2 2 0
Precision-recall curves showing LOEUF-MIS outperforming other metrics

Precision-recall curves showing LOEUF-MIS outperforming other metrics

We introduce LOEUF-MIS, combining pLoF and top 1% predicted deleterious missense constraint. This captures not just LoF but also gain-of-function and dominant-negative signals.

3 weeks ago 1 1 1 0
Figure 1a-b, growth of variation with sample size

Figure 1a-b, growth of variation with sample size

gnomAD v4's 5x sample increase benefits both common and rare disease: more common variants observed across ancestries improve diagnostic filtering, while more rare variants strengthen constraint metrics for disease gene detection

3 weeks ago 1 1 1 0
Preview
Integrating 730,947 exome sequences with clinical literature improves gene discovery Accurate estimates of allele frequencies aid in genetic discovery, including rare disease diagnosis, common disease investigations, and population genetics. Here, we present the Genome Aggregation Dat...

Excited to share our new preprint on gnomAD v4! We present the full analysis of 730,947 exomes — new constraint metrics, improved LoF annotation (LOFTEE-2), LLM-based literature curation, and a unified framework for gene discovery and rare disease diagnosis. www.medrxiv.org/content/10.6...

3 weeks ago 36 17 2 1

If you are submitting an NIH grant in February, you will be required to use SciENcv to prepare you biosketch.

IT IS MUCH WORSE THAN YOU CAN POSSIBLY IMAGINE.

Set aside *at least* 4 hours just to transfer an existing an biosketch into SciENcv.

3 months ago 179 93 27 28
Advertisement

I guess we all decided to work on the biosketch at the same time?

3 months ago 0 0 1 0

Ugh. This is the part I didn’t finish yet. The interface kept crashing and it was a nightmare to find things.

I have a nicely curated MyBibliography elsewhere but it seems like it didn’t link to that one.

3 months ago 0 0 0 0

Good to know. 🥲

3 months ago 0 0 0 0

Is there an official "this is what it should look like" version floating around? I was working on it today and the preview looks weird, but maybe that is just the way it is.

3 months ago 0 0 2 0
Post image

Why do some individuals defy their polygenic score?

In the largest study of its kind (402k UKB individuals; 7 continuous traits + 3 diseases), we asked: If your phenotype deviates from common-variant polygenic score prediction, what's driving that difference?

www.medrxiv.org/content/10.6...

3 months ago 48 26 2 3

Huge congratulations on your next step! Sounds like an amazing opportunity. :)

4 months ago 2 0 1 0
Post image

Massive single-cell study by Kanai et al (www.medrxiv.org/content/10.1...):
- Once statistical power is high, constrained genes have more (though weaker) eQTLs.
- Chromatin-QTLs near constrained genes have "normal" effect sizes, colocalize more with disease, but exhibit attenuated peak-gene effects.

4 months ago 34 9 1 0
Preview
Genotype-level quality control substantially reduces error rates in population-scale whole-genome sequencing Population-scale whole-genome sequencing data will contain many individual-level genotype errors, even after allele-level quality control (QC). We establish the need for genotype-level QC using UK Bio...

New paper on everyone’s favourite topic, QC!
We show why you should do genotype-level QC on your WGS data

www.biorxiv.org/content/10.1...

Very real quotes about this paper -
“The most exciting, mind-blowing paper of the year!”
“On a par with Fisher 1918”
“I read it every night. Just so beautiful”

5 months ago 46 19 2 3
Preview
Exploring penetrance of clinically relevant variants in over 800,000 humans from the Genome Aggregation Database - Nature Communications Here the authors provide an explanation for 95% of examined predicted loss of function variants found in disease-associated haploinsufficient genes in the Genome Aggregation Database (gnomAD),…

New study of 800K+ genomes from gnomAD reveals most “pathogenic” variants in healthy people aren’t truly disease-tolerant. They are explained by annotation errors, mosaicism, or compensatory variants. 🧬
A big step for precision medicine!
www.nature.com/articles/s41...

5 months ago 23 15 0 1

I haven't found it printed but there is a PDF link here: www.ashg.org/wp-content/u...

6 months ago 1 1 0 0
Advertisement
Preview
Improved allele frequencies in gnomAD through local ancestry inference - Nature Communications This study incorporates local ancestry into the Genome Aggregation Database (gnomAD) to improve allele frequency estimates for admixed populations, enhancing variant interpretation and enabling more accurate and equitable genomic research and clinical care.

📃 We’re excited to share our latest work, now published in Nature Communications — a major update to the Genome Aggregation Database (gnomAD) that improves allele frequency resolution for two gnomAD-defined genetic ancestry groups using local ancestry inference (LAI).

6 months ago 29 8 1 1
Preview
Sperm sequencing reveals extensive positive selection in the male germline - Nature A combination of whole-genome NanoSeq with deep whole-exome and targeted NanoSeq is used to accurately characterize mutation rates and genes under positive selection in sperm cells.

Now published! Our paper on:
(1) Accurate sequencing of sperm at scale
(2) Positive selection of spermatogenesis driver mutations across the exome
(3) Offspring disease risks from male reproductive aging
[1/n]
www.nature.com/articles/s41...

6 months ago 86 50 3 2
Image of an old building in Oxford with the heading 'postdoc opportunities' and the text 'computational approaches to improve rare disease diagnosis and treatment' and 'Big Data Institute, University of Oxford'

Image of an old building in Oxford with the heading 'postdoc opportunities' and the text 'computational approaches to improve rare disease diagnosis and treatment' and 'Big Data Institute, University of Oxford'

📣 We are recruiting! Please share!!

Are you a bioinformatician / computational scientist who wants to apply your skills to understanding regulatory biology and improving rare disease diagnosis and treatment? 🧠 💻 🧬 🩺

We have two roles available 👇

🧵 1/4

8 months ago 43 43 1 3

Isn't genetics cool???

Within only 145 nucleotides(!) of a non-coding RNA (RNU4-2) - different variants in distinct regions / structures cause three distinct disorders!!! (all discovered within the last 18 months)

🤯🤓🧬❤️

8 months ago 62 13 1 2
Post image

🗣️ Quote of #ESHG2025 (so far)

"Who licks bone !?!" 🦴
- Johannes Krause

Anyone have that on your bingo card?

Well apparently archeologists do, to distinguish bone from stones and it causes problems in DNA sequencing. 🤔

10 months ago 28 10 1 0

We are just wrapping up day 1 at #ESGH2025 in beautiful Milan. For those who want some extra fun while listening to the great science, you can play bingo.👇

I know multiple of these have already occurred.

10 months ago 5 0 1 0

Buongiorno Milano! Ready for a great day 1 of #eshg2025?
Packed program of excellent science 8.30am-8.00pm - plus networking event till 9.30pm to meet many friends, colleagues and collaborators! …andiamo @eshg.bsky.social @eshgyoung.bsky.social

10 months ago 15 6 0 0
Preview
Scalable automated reanalysis of genomic data in research and clinical rare disease cohorts Reanalysis of genomic data in rare disease is highly effective in increasing diagnostic yields but remains limited by manual approaches. Automation and optimization for high specificity will be necess...

🤗 Hugely excited to share our work on automating iterative reanalysis in #raredisease, preprint out: www.medrxiv.org/content/10.1... 🤖🧬

github.com/populationge...

A superb collaboration with @dgmacarthur.bsky.social @cassimons.bsky.social @heidirehm.bsky.social @ksamocha.bsky.social and many more!

10 months ago 24 13 1 5
Post image

Human Developmental Cell Atlas (HDCA) expression data is now displayed. Expression is displayed in 12 sections of a 6-7 post-conception week human embryo, alongside a sagittal view which displays the region of the embryo represented by each section @mhaniffa.bsky.social

11 months ago 17 7 0 1
Preview
My son Charlie — and the breakthrough that changed our lives James Coney and his wife, Sarah, struggled not knowing why their 12-year-old was born with a severe learning disability. In their darkest moments, they blamed themselves. Then, out of the blue, came a...

A few weeks ago, I had an incredibly emotional call with James Coney, a writer for the Sunday Times whose son Charlie was in the @genomicsengland.bsky.social 100k project and was recently diagnosed with ReNU syndrome. This beautiful article tells their story ❤️ www.thetimes.com/article/0bcc...

1 year ago 110 40 5 6
Advertisement