Advertisement ยท 728 ร— 90

Posts by LIBD rstats club

Preview
LIBD rstats club Continuous rstats learning

@libdrstats.bsky.social videos and blog posts continue to be a great resource for me as I delve into snRNAseq analyses, and for lots of other bioinformatics-related topics! Learn more here: research.libd.org/rstatsclub/

3 weeks ago 4 3 0 0
[2026-02-27] Strategy to drop doublets during snRNAseq QC steps
[2026-02-27] Strategy to drop doublets during snRNAseq QC steps YouTube video by Leonardo Collado Torres

Check out my first @libdrstats.bsky.social presentation on one strategy our team @lcolladotor.bsky.social at @lieberinstitute.bsky.social is taking to deal with doublets during quality control for single nucleus RNA-sequencing data:
www.youtube.com/watch?v=SMA-...

3 weeks ago 6 4 1 0
Video

Talk submission video for Posit Conf 2026 #RStats

ยซOrganizing your data science team to be efficient, adaptive to new developments, and resilient to turnover rateยป

URLs:
1. lcolladotor.github.io/bioc_team_ds/
2. research.libd.org/rstatsclub/
3. github.com/LieberInstit...

#OpenSource #Bioconductor

1 month ago 10 4 1 0
Preview
RDM Weekly - Issue 031 A weekly roundup of Research Data Management resources.

Issue 31 of RDM Weekly is out! ๐Ÿ“ฌ

โžก๏ธ OS resources in Comm Sciences @csdisseminate.bsky.social
โžก๏ธ RDA 25th Plenary Meeting Programme @researchdataall.bsky.social
โžก๏ธ Ask for R Help by Creating a Small Reproducible Example @libdrstats.bsky.social
and more!

rdmweekly.substack.com/p/rdm-weekly...

2 months ago 13 6 0 0
@lieberinstitute.bsky.social @jhubiostat.bsky.social

@lieberinstitute.bsky.social @jhubiostat.bsky.social

lcolladotor.github.io/rnaseq_LCG-U... is looking ready for next week!

We'll have the most guest presenters ever this year with @meli-mq99.bsky.social @gabrielramirezv.bsky.social @mariagtac.bsky.social & @daiannaglez.bsky.social ๐Ÿ™Œ๐Ÿฝ

See you soon at LCG-UNAM! ๐Ÿ‡ฒ๐Ÿ‡ฝ

#RStats @bioconductor.bsky.social

2 months ago 8 3 0 1

In a recent journal club, I covered the SOFA paper by @wkhuber.bsky.social and colleagues. It's especially cool how you can guide the factorization approach with known covariates.

Check out their preprint (doi.org/10.1101/2024...) and our discussion (youtu.be/WKs-g4lXzZs)!

3 months ago 7 3 0 0

A colleague Geo @geopertea.bsky.social found out a cool trick to get Positron working with full support at our computing cluster JHPCE (remote file explorer, Positron assistant seeing the remote R session). Game changer for my work at LIBD! Check out his demo video (youtu.be/CnmaJG8VPp8) #positron

2 months ago 7 3 0 0
Advertisement

noted that DuckDB won't perform worse than dplyr at any point โœ…, and it's pretty easy to use ๐Ÿ‘, so he plans to keep using duckplyr more often ๐Ÿš€.

He wrapped up by saying the benchmarks were interesting ๐Ÿ“ˆ, and despite the mixed results, he still expects to be using duckplyr a lot going forward! ๐Ÿ’ช๐Ÿ˜Š

2 months ago 0 0 0 0

9 hours โฐ๐Ÿ˜ซ, and he was trying to optimize it.

Surprisingly, this particular use case didn't benefit much from DuckDB ๐Ÿฆ†, leading to his main takeaway: performance is highly dependent on what you're doing ๐ŸŽฏ. However, he

2 months ago 0 0 1 0
Video

@nick-eagles.bsky.social concluded that scratch space location matters significantly for performance ๐Ÿ’พโšก. He explained his motivation: he had a real use case with hundreds of millions of rows ๐Ÿ“Š that originally took

Full length video: youtu.be/ikumz_QLZiE
#RStats #dplyr #duckplyr #rpolars #data.table

2 months ago 1 1 1 0

He noted that most benchmarks focus on speed โšก, but emphasized that memory efficiency ๐Ÿง  is another important variable to consider - particularly highlighting that data.table tends to be more memory efficient than dplyr ๐Ÿ’พโœจ.

#RStats #dplyr #duckplyr #rpolars #data.table

2 months ago 0 0 0 0
Video

@nick-eagles.bsky.social introduced his presentation on benchmarking #dplyr alternatives ๐Ÿ“Š๐Ÿ’ป. He planned to review public benchmarks of various data manipulation packages ๐Ÿ“ฆ as well as his own custom benchmark results ๐Ÿ”ฌ

Full length ๐ŸŽฅ: youtu.be/ikumz_QLZiE
#RStats #dplyr #duckplyr #rpolars #data.table

2 months ago 1 1 1 0

actually be more rigorous ๐Ÿ”โœ…, as he was trying to argue with the AI model ๐Ÿค–๐Ÿ’ฌ.

He wrapped up by saying "that's it" ๐ŸŽฌ, and then @nick-eagles.bsky.social enthusiastically responded that this is super useful ๐ŸŒŸ and they're probably going to switch over to using Positron today! ๐Ÿš€๐Ÿ˜„

#RStats @posit.co

2 months ago 0 0 0 0
Video

@geopertea.bsky.social discussed how the AI provides semantic understanding of code ๐Ÿง ๐Ÿ’ป, showing how objects are generated from different parts of the codebase ๐Ÿ”—. He acknowledged that you can do this manually too ๐Ÿ‘จโ€๐Ÿ’ป, and sometimes manual inspection might ...

#RStats #PositronIDE @posit.co

2 months ago 1 1 1 0

He pointed out - why not use AI to do that for you instead? ๐Ÿค–๐Ÿ’ก The AI quickly gave him the context of where it happens, functioning like a smart search tool โšก๐ŸŽฏ!

#RStats

2 months ago 0 0 0 0

He described another helpful scenario: tracing how a "log counts assay" object was generated ๐Ÿ”ฌ. Normally, you'd search for the filename ๐Ÿ”Ž, then have to manually eyeball the context ๐Ÿ‘€ to see if that object was being read or written in a particular file ๐Ÿ“„.

#RStats

2 months ago 0 0 1 0
Advertisement
Video

@geopertea.bsky.social showed his chat history ๐Ÿ“œ and explained a useful use case: he asked the AI to look into a repository ๐Ÿ” and tell him what methods they were using for calculations ๐Ÿงฎ. Basically, he had the AI read the code and summarize it for him ๐Ÿ“โœจ.

#RStats #Positron #PositronAI

2 months ago 1 1 1 0

Once everyone agrees on the plan โœ…, the idea is to switch to agent mode ๐Ÿ”„ so it has access to actually modify the code according to that plan ๐Ÿ“โœจ. That's the intended workflow! ๐ŸŽฏ

#RStats #Positron

2 months ago 0 0 0 0

He explained the workflow: when you start modifying files or code ๐Ÿ’ป, it should switch to a different agent - either a specific agent or the main agent โšก. The planning agent is good for creating plans through discussion ๐Ÿ’ฌ between the LLM in the chat and the user ๐Ÿ‘ฅ.

#RStats #Positron

2 months ago 0 0 1 0
Video

@geopertea.bsky.social explained how you can add your own custom agents ๐Ÿค– to #Positron & showed the interactive way to configure them ๐Ÿ› ๏ธ. He demonstrated how the "plan" agent is defined, noting that it doesn't use any file mod tools ๐Ÿ“โŒ - it's specifically designed for planning purposes ๐Ÿ“‹

#RStats

2 months ago 1 1 1 0

authentication code ๐Ÿ”‘ which he pasted into the authorization page ๐ŸŒ. He had to authorize the application to access his GitHub account โœ…, going through two-factor authentication on his phone ๐Ÿ“ฑ. Once approved, he returned to Positron which confirmed the GitHub account was successfully connected! ๐ŸŽ‰โœจ

2 months ago 0 0 0 0
Video

@geopertea.bsky.social demonstrated setting up #Positron IDE ๐Ÿ’ป by first detecting local interpreters and updating extensions ๐Ÿ”ง. He showed how to login to @github.com to activate GitHub Copilot connectivity ๐Ÿค– for the Positron assistant. During the new installation, the app provides an...

#RStats

2 months ago 1 1 1 0
Video

I explained why including context is crucial ๐Ÿ“š! I pointed out that someone could just paste code like "GRanges from 1 to 1000 plus..." but then I would be confused ๐Ÿ˜• asking "where does this function come from? What package is this from?" ๐Ÿ“ฆโ“

@jennybryan.bsky.social #RStats #reprex

2 months ago 4 2 1 0
Video

I demonstrated running a #reprex in a fresh #RStats session ๐Ÿ’ป & opened it in my browser ๐ŸŒ. It showed the ๐Ÿ“ฆ loading with all its messages, then the function itself. I pointed out that I had made a syntax error: I missed the colon before the plus sign โž•โŒ, which caused an error

@jennybryan.bsky.social

2 months ago 4 2 1 0
Advertisement
Video

I explained how the reprex workflow helps with collaboration ๐Ÿค! I described a scenario where I ask a question about why the mean is 2.5 and not 3, and Nina wants to help by running the same code ๐Ÿ’ป. She can easily copy-paste the #reprex output and it actually runs โœ…!

@jennybryan.bsky.social #RStats

2 months ago 3 2 1 0
Video

I demonstrated how to use the #reprex package ๐Ÿ“ฆ! I loaded it and explained the workflow: I copied lines of code from my console (pretending Leo has a question โ“), then ran the reprex() function without any arguments ๐Ÿ’ป.

@jennybryan.bsky.social #RStats

2 months ago 3 2 1 0
Video

I explained how a common problem makes it harder for people to get help ๐Ÿ˜ค. People often think they've provided everything needed - posting their code online ๐Ÿ’ป, showing how to recreate objects, sharing all the code they ran, and even including the output ๐Ÿ“Š.

@jennybryan.bsky.social #RStats #reprex

2 months ago 3 2 1 0
Video

I explained the common problem people encounter when asking for coding help ๐Ÿค”. I showed how someone might copy code they ran, paste it, ask a question. I was setting up to explain what's wrong with this approach โŒ of sharing code for troubleshooting purposes๐Ÿ’ป #RStats #reprex @jennybryan.bsky.social

2 months ago 3 2 0 0
[2020-04-03] How to ask for R help by creating a small reproducible example (edited on 2026-01-27)
[2020-04-03] How to ask for R help by creating a small reproducible example (edited on 2026-01-27) YouTube video by Leonardo Collado Torres

I re-edited this 2020 video using Adobe Premier Pro's features for removing filler words, pauses, and text-based editing. In total it went down from 20:53 min to 12:41. I hope that you get to (re-)enjoy it ^_^.

@jennybryan.bsky.social

#RStats #reprex #reproducibility #JHPCE

youtu.be/LfUFxHFhXTE

2 months ago 4 3 0 0

Lots has evolved since these R Stats sessions @libdrstats.bsky.social, but we touch on the hot topic of integrating LLMs into data science, mostly with Github Copilot, in a couple sessions--

Introductory look: youtu.be/xuCQZ5_5KB0
Reusable prompts: youtu.be/mlMl81xRI_M

#RStats #LLMs

3 months ago 9 3 0 0