scDesign3 - although this tool doesn't have a web interface, it explores additional scenarios, such as other data modalities or even multimodal experiments
(github.com/SONGDONGYUAN...)
#bioinformatics
#scRNASeq
Posts by André Boler Barros, PhD
SCOPIT - another tool with a web interface, it is focused on providing feedback on your experimental design but, now focusing on identification of cell types (alexdavisscs.shinyapps.io/scs_power_mu...)
scPower - Tool with a Web interface, that uses previous datasets to build a framework where you can vary the number of cells, samples - and even include budget value - to assess the impact on DEG results (scpower.helmholtz-muenchen.de/scpower/)
When searching for power analysis (or just want to check the ideal number of cells) for scRNASeq studies, here are some resources that may be relevant:
#bioinformatics
"We need to create flexible, free paths to hiring (...). Imagine a model where capability, not how many hours students sit in class, is what matters; where demonstrated skills earn them credit and where employers recognize those credits as evidence of readiness to enter an apprenticeship program."
"[commit 1 percent of profits for workers re-training] isn’t charity. (...) Helping retrain workers is common sense, and such a small ask that these companies would barely feel it, while the public benefits could be enormous."
www.nytimes.com/2025/12/27/o...
Simple yet very interesting challenge! Keep them coming #bioinfo #R #datascience
By selecting keywords that should be either in the gene set name or description, and using the R package 'msigdbr' (igordot.github.io/msigdbr/), voilá!
github.com/andrebolerba...
This folder contains a .qmd file and respective html report, with an excel file containing the final info dataset.
The first challenge came up in a conversation about a project: how can I check gene sets across different databases (e.g., GO terms, KEGG Pathways, ...) involved in specific processes, like iron processing or lipid metabolism?
Today, I inaugurate a new Github repository: github.com/andrebolerba...
I created this to store code that, although short and simple, may be useful for addressing specific needs and challenges.
Congratulations Maria! Really interested to see you defend and to watch your future career in science!!
A special thank you to the PhD community, the facilities, and the community at IGC, now @gimmfoundation.bsky.social , for providing the right context for this thesis to see the light of day.
Thank you to everyone who supported me through this journey, from my supervisor @lferreiramoita.bsky.social , to my lab members - especially Henrique Colaço, @katiajesus.bsky.social and Elsa Seixas and, to the jury members - Vitor Cabral, Sarela García Santamarina, Nelson Frazao and Vera Martins.
I am very proud to announce that yesterday I have successfully defended my PhD.
It was a long journey, filled with difficulties but with a dose of rewards - scientific, professional but most of all, personal. But, it definitely made me a better scientist, and a better person.
"(...) the rise of generative AI in bioinformatics has not diminished my role, but redefined it. It has challenged me to become a better scientist. For good or ill, AI seems to be here to stay. I urge you to embrace the technology — not to replace your expertise, but to amplify it.
#bioinfo
Can't advise this lab more. If you'd like to work in a curious-driven and nurturing environment, with a high focus on robust data analysis, don't even think twice!
#bioinfo #datascience
Last week, I was fortunate enough to watch a talk from @tkorem.bsky.social , where he presented different things, from addressing inter- study variability on microbiome projects to the use of novel approaches on metagenomics alignment and processing. Interesting and very relevant!
#bioinfo
"GlucoStats demonstrates high efficiency in processing large-scale medical datasets in minimal time. Its modular design enables easy customization and extension, making it adaptable to diverse research and clinical needs"
bmcbioinformatics.biomedcentral.com/articles/10....
#datascience #biostats
"Delphi-2M predicts the rates of more than 1,000 diseases (...), with accuracy comparable to that of existing single-disease models. Delphi-2M (...) also enables sampling of synthetic future health trajectories"
www.nature.com/articles/s41...
Amazing repository with several references and resources for scRNASeq analysis
github.com/crazyhottomm...
#bioinfo #singlecell
"Even if work is done by or with the help of experts (...), it is crucial that researchers understand how a method works, that they can assess data quality, and that they fundamentally understand what types of conclusions can and cannot be drawn from their data"
www.nature.com/articles/s41...
Spreadsheets represent an everyday tool for most wet-lab scientists. So, why not use them at their highest potential, efficiently and ready for open science?
This paper provides some recommendations for the use of spreadsheets:
www.nature.com/articles/d41...
#bioinfo #stats
FlyBase, a Drosophila database, will lose a third of its team in early October because the Harvard grant that covered the employees’ salaries was canceled. Scientists warn that losing FlyBase could devastate fly research.
By @claudia-lopez.bsky.social
www.thetransmitter.org/community/ha...
5/n
The following paper provides an excellent overview of sequence-to-graph mapping algorithms—covering their principles, trade-offs, and performance benchmarks. It’s a great starting point if you’re interested on the topic
genomebiology.biomedcentral.com/articles/10....
#bioinformatics #bioinfo
4/n
From equitable precision medicine to large-scale population studies and evolutionary research, pangenomics graphs and graph mapping excel in capturing diversity—especially in highly variable or repetitive genome regions where linear mapping fails.
3/n
Unlike linear references, which favour the “reference” allele (known as reference bias), the integration of multiple haplotypes in pangenomic graphs and respective mapping reduces bias, improves alignment quality, and makes analyses fairer across diverse populations.
2/n
Pangenome graphs represent variants as alternative paths, so reads can follow the path matching their true sequence.
These graphs can represent SNPs, indels, structural variants, and complex rearrangements, all in one model—boosting variant calling accuracy for common and rare alleles.
1/n
Because, in bioinformatics, sharing is caring, let me share something I have recently started exploring - graph mapping and pangenome graphs.
A pangenome graph encodes a reference genome built from many genomes in one structure, thus trying to encapsulate the known genetic variability.
12/12
"It is an ethical imperative that research be designed and analysed to avoid wasting investment of animals, research dollars and effort".
#biostatistics #stats
11/n
Ultimately, understanding the C&N concepts, design experiments, and analytical approaches accounting for that are fundamental for rigorous and reproducible preclinical research.