Advertisement · 728 × 90

Posts by Elizabeth Atkinson

Preview
Tractor Workflow: A Scalable Nextflow Framework for Local Ancestry-Aware Genome-Wide Association Studies AbstractMotivation. The routine exclusion of admixed individuals from traditional Genome-Wide Association Studies (GWAS) due to concerns about spurious ass

@nirav-shah.bsky.social's Tractor Workflow paper is out early access in Bioinformatics today! Check out his thread quoted here for a full bluetorial on the contents, and see the link below for the final version: academic.oup.com/bioinformati...

1 month ago 1 0 0 0

Thanks to all of our SMaHT colleagues and especially to @sedlazeck.bsky.social who led the hackathon which spawned the prototype of this pipeline!

4 months ago 0 0 0 0

MosaicSim offers a realistic, scalable approach for assessing detection limits, with immediate applications to large sequencing efforts including those within the SMaHT Network, which was the springboard for this work.

4 months ago 0 0 1 0

A key (surprising) result was that ultra-high coverage (300×–450×) yields diminishing returns for mosaic variant detection. In many settings, 150× coverage performs comparably or better, highlighting opportunities for cost-effective study design.

4 months ago 2 0 1 0

Using MosaicSim, we benchmarked DRAGEN and found strong VAF- and depth-dependent performance limits. Sensitivity decreases sharply at low VAF, especially in complex genomic regions.

4 months ago 0 0 1 0

Detecting mosaic variants is challenging due to low VAFs and real sequencing noise. MosaicSim layers user-defined variants directly onto empirical WGS data, preserving true read-level properties while providing a controlled ground-truth set for benchmarking.

4 months ago 0 0 1 0
Preview
MosaicSim: A Novel Mosaic Variant Simulator Reveals Diminishing Returns of Ultra-High Coverage for Mosaic Variant Detection Genetic mutations within select cells of a tissue, termed mosaic variants (MV), are being increasingly recognized for their role in human disease. This growing interest underscores the need for specia...

We are pleased to share our new preprint introducing MosaicSim, a framework for generating realistic mosaic variants! Mosaic variants - mutations present in only a subset of cells - are crucial for development, disease, and cancer, but are notoriously hard to call.
www.biorxiv.org/content/10.6...

4 months ago 3 0 1 1
Post image

A fun lab outing to the zoo ahead of conference season! 🦒

6 months ago 2 0 0 0

So since we only include >0.1% MAF variants in this article we can't address ultrarare, but check out Supp Fig 3; when comparing ancestry-specific AFs many variants deviate from the 1:1 line. We plotted this on the log₁₀(AF) scale to help magnify the low-frequency range.

6 months ago 0 0 0 0

To limit the noise from ultra-rare alleles we only looked at variants ≥0.1% MAF. Totally appreciate that's still quite low frequency, but even with that filter, we still saw the noted ancestry-specific frequency differences.

6 months ago 0 0 0 0
Advertisement

Great point; we thought about that too! Pragati stratified by whether variants were monomorphic or not to capture at least that aspect, but you’re right that the impact depends on where a variant sits on the SFS. Rare ones can show big fold-changes but small absolute shifts.

6 months ago 0 0 0 0
Post image

Texas Children's/Baylor College of Medicine Researchers Create Groundbreaking Tool to Improve Accuracy of #GeneticTesting @egatkinson.bsky.social @bcmgenetics.bsky.social @bcmhouston.bsky.social #TCHResearchNews #TexasChildrens @natcomms.nature.com tinyurl.com/jj6kyrrv

6 months ago 6 1 0 0

Thrilled to share our new @natcomms.nature.com paper on local ancestry informed allele frequencies in gnomAD, which are live now on the browser! Check out my stellar PhD student @pragskore.bsky.social’s Bluetorial on how this brings finer detail to variant interpretation 🧬🖥️

6 months ago 14 4 1 0
Preview
Pan-UK Biobank genome-wide association analyses enhance discovery and resolution of ancestry-enriched effects - Nature Genetics Genome-wide analyses for 7,266 traits leveraging data from several genetic ancestry groups in UK Biobank identify new associations and enhance resources for interpreting risk variants across diverse p...

A project many years in the process, we’re pleased to present our work on multi-ancestry meta-analysis across a boatload of traits in the UK Biobank: www.nature.com/articles/s41...

7 months ago 65 26 1 0

Delighted to amplify my talented PhD student’s work! Check it out for a great way to streamline and harmonize Tractor analyses.

7 months ago 6 2 0 0

Thanks for the interest! The tutorial code is available to download as supplemental information of the paper, and has been deposited as a community workspace in the All of Us Researcher Workbench.

8 months ago 2 0 0 0

In summary, we present a replicable training model that empowers early-career researchers - including and especially those new to computational genomics - to responsibly leverage large-scale biobank data into their research programs and teaching.

8 months ago 0 0 1 0

From years 1–3, training outcomes reported by scholars to stem directly from this training included:
📊 17 conference presentations
🔬 Multiple funded research grants
🎓 Numerous genomics modules added in undergrad courses
🤝 Sustained collaborations across institutions

8 months ago 0 0 1 0

During the summit, scholars used real short-read WGS data to:
• Prepare phenotypes & covariates
• Run GWAS via Hail
• Visualize results with PCA, Manhattan & QQ plots
• Manage compute costs
All in ~4 hours with no prior coding required.

8 months ago 0 0 1 0

Our training was part of the All of Us Biomedical Researcher Scholars Program through @bcmgenetics.bsky.social focused on mentoring early-stage faculty in genomic data science. The curriculum launches with an intensive Faculty Summit, where scholars get hands-on experience working with genomic data.

8 months ago 0 0 1 0
Advertisement

Access to big genomic data is growing, but parallel access to skills needed to use it hasn’t kept up.
We created an accessible, cloud-based genomic analysis training bootcamp using real All of Us data, Jupyter notebooks, and the Hail framework to lower the barrier for early-career researchers.

8 months ago 0 0 1 0

🚨 New perspective piece in @ajhgnews.bsky.social! 🚨
We developed a hands-on training resource for large-scale genomic data analysis in the All of Us Researcher Workbench, now published here:

8 months ago 13 8 1 0

Tractor-Mix builds on Tractor’s strengths to detect ancestry-enriched signals while adding power and robust false-positive control for relatedness via a GRM. By modeling both admixture and relatedness, it overcomes key GWAS barriers and enables more accurate, representative genomic discovery.

10 months ago 2 0 0 0

Tractor-Mix uses ancestry-specific genotypes as predictors, outputting ancestry-specific effect sizes and P values. We benchmark our new tool in simulations and apply it to multiple admixed cohorts (including UKBiobank and Mexico City Prospective Study), uncovering signals missed by standard GWAS.

10 months ago 2 0 1 0

In this work, we introduce Tractor-Mix, a new GWAS method that extends Tractor to handle related admixed samples. It combines a mixed model framework (like GMMAT) with local ancestry-aware genotypes (like Tractor) in a 2 d.o.f. test.

10 months ago 2 0 1 0

As biobanks and global cohorts grow, so does the inclusion of admixed individuals with close or cryptic relatedness. This introduces the statistical challenge of two interwoven sources of stratification: admixture and relatedness, which are rarely handled together.

10 months ago 2 0 1 0

We previously developed Tractor, a local ancestry-aware GWAS method that’s been widely used to uncover ancestry-enriched signals and refine genetic architecture in admixed populations. But Tractor (being a GLM) only works on unrelated samples, limiting its use in many real-world datasets.

10 months ago 2 0 1 0
Post image

We're excited to introduce Tractor-Mix, our new method for GWAS in admixed cohorts with relatedness, led by the fantastic @doubletaotan.bsky.social! Read the full preprint here: www.medrxiv.org/content/10.1...
Thanks to all our amazing collaborators who helped make this work possible!

10 months ago 12 3 1 0
Human Genetics | Genomic Scientist Fellows | UCLA Medical School The Emerging Genomic Scientist Fellows Program is a cornerstone of justice, equity, diversity, and inclusion initiatives in the Department of Human Genetics.

Check out my stellar PhD student, Pragati's talk on our work generating local ancestry informed frequency estimates in gnomAD as part of the prestigious Emerging Genomic Scientist Symposium next week! Congrats on being selected for this amazing event!

1 year ago 5 1 0 0
Advertisement
Post image

I'm delighted to be part of this symposium, put on by University of Pennsylvania Perelman School of Medicine, and led by @bpasaniuc.bsky.social and @sarahtishkoff.bsky.social. See you in a few weeks! upenn.co1.qualtrics.com/jfe/form/SV_...

1 year ago 8 4 0 2