André Boler Barros, PhD (@asbarros) Bsky

GitHub - SONGDONGYUAN1994/scDesign3: scDesign3 generates realistic in silico data for multimodal single-cell and spatial omics scDesign3 generates realistic in silico data for multimodal single-cell and spatial omics - SONGDONGYUAN1994/scDesign3

scDesign3 - although this tool doesn't have a web interface, it explores additional scenarios, such as other data modalities or even multimodal experiments
(github.com/SONGDONGYUAN...)

#bioinformatics
#scRNASeq

2 months ago 0 0 0 0

SCOPIT, v1.1.4

SCOPIT - another tool with a web interface, it is focused on providing feedback on your experimental design but, now focusing on identification of cell types (alexdavisscs.shinyapps.io/scs_power_mu...)

2 months ago 0 0 1 0

scPower - Tool with a Web interface, that uses previous datasets to build a framework where you can vary the number of cells, samples - and even include budget value - to assess the impact on DEG results (scpower.helmholtz-muenchen.de/scpower/)

2 months ago 0 0 1 0

When searching for power analysis (or just want to check the ideal number of cells) for scRNASeq studies, here are some resources that may be relevant:

#bioinformatics

2 months ago 0 0 1 0

"We need to create flexible, free paths to hiring (...). Imagine a model where capability, not how many hours students sit in class, is what matters; where demonstrated skills earn them credit and where employers recognize those credits as evidence of readiness to enter an apprenticeship program."

3 months ago 0 0 0 0

Opinion | A 1 Percent Solution to the Looming A.I. Job Apocalypse

"[commit 1 percent of profits for workers re-training] isn’t charity. (...) Helping retrain workers is common sense, and such a small ask that these companies would barely feel it, while the public benefits could be enormous."

www.nytimes.com/2025/12/27/o...

3 months ago 2 0 1 0

Simple yet very interesting challenge! Keep them coming #bioinfo #R #datascience

4 months ago 1 0 0 0

BitsandPieces/PathwayDatabase_Mining at main · andrebolerbarros/BitsandPieces This repository stores codes & notebooks done in response to small & simple challenges - andrebolerbarros/BitsandPieces

By selecting keywords that should be either in the gene set name or description, and using the R package 'msigdbr' (igordot.github.io/msigdbr/), voilá!

github.com/andrebolerba...

This folder contains a .qmd file and respective html report, with an excel file containing the final info dataset.

4 months ago 0 0 1 0

The first challenge came up in a conversation about a project: how can I check gene sets across different databases (e.g., GO terms, KEGG Pathways, ...) involved in specific processes, like iron processing or lipid metabolism?

4 months ago 0 0 1 0

GitHub - andrebolerbarros/BitsandPieces: This repository stores codes & notebooks done in response to small & simple challenges This repository stores codes & notebooks done in response to small & simple challenges - andrebolerbarros/BitsandPieces

Today, I inaugurate a new Github repository: github.com/andrebolerba...

I created this to store code that, although short and simple, may be useful for addressing specific needs and challenges.

4 months ago 0 0 1 0

Congratulations Maria! Really interested to see you defend and to watch your future career in science!!

5 months ago 0 0 0 0

A special thank you to the PhD community, the facilities, and the community at IGC, now @gimmfoundation.bsky.social , for providing the right context for this thesis to see the light of day.

5 months ago 2 0 0 0

Thank you to everyone who supported me through this journey, from my supervisor @lferreiramoita.bsky.social , to my lab members - especially Henrique Colaço, @katiajesus.bsky.social and Elsa Seixas and, to the jury members - Vitor Cabral, Sarela García Santamarina, Nelson Frazao and Vera Martins.

5 months ago 2 0 1 0

I am very proud to announce that yesterday I have successfully defended my PhD.

It was a long journey, filled with difficulties but with a dose of rewards - scientific, professional but most of all, personal. But, it definitely made me a better scientist, and a better person.

5 months ago 3 1 1 0

‘Am I redundant?’: how AI changed my career in bioinformatics A run-in with some artefact-laden AI-generated analyses convinced Lei Zhu that machine learning wasn’t making his role irrelevant, but more important than ever.

"(...) the rise of generative AI in bioinformatics has not diminished my role, but redefined it. It has challenged me to become a better scientist. For good or ill, AI seems to be here to stay. I urge you to embrace the technology — not to replace your expertise, but to amplify it.

#bioinfo

5 months ago 0 0 0 0

Can't advise this lab more. If you'd like to work in a curious-driven and nurturing environment, with a high focus on robust data analysis, don't even think twice!

#bioinfo #datascience

6 months ago 2 0 0 0

Last week, I was fortunate enough to watch a talk from @tkorem.bsky.social , where he presented different things, from addressing inter- study variability on microbiome projects to the use of novel approaches on metagenomics alignment and processing. Interesting and very relevant!

#bioinfo

6 months ago 3 0 0 0

Glucostats: an efficient Python library for glucose time series feature extraction and visual analysis - BMC Bioinformatics Background The advancement of technology and continuous glucose monitoring (CGM) systems has introduced several computational and technical challenges for clinicians and researchers. The growing volume of CGM data necessitates the development of efficient computational tools capable of handling and processing this information effectively. This paper introduces GlucoStats, an open-source and multi-processing Python library designed for efficient computation and visualization of a comprehensive set of glucose metrics derived from CGM. It simplifies the traditionally time-consuming and error-prone process of manual CGM metrics calculation, making it a valuable tool for both clinical and research applications. Results Its modular design ensures easy integration into predefined workflows, while its user-friendly interface and extensive documentation make it accessible to a broad audience, including clinicians and researchers. GlucoStats offers several key features: (i) window-based time series analysis, enabling time series division into smaller ‘windows’ for detailed temporal analysis, particularly beneficial for CGM data; (ii) advanced visualization tools, providing intuitive, high-quality visualizations that facilitate pattern recognition, trend analysis, and anomaly detection in CGM data; (iii) parallelization, leveraging parallel computing to efficiently handle large CGM datasets by distributing computations across multiple processors; and (iv) scikit-learn compatibility, adhering to the standardized interface of scikit-learn to allow an easy integration into machine learning pipelines for end-to-end analysis. Conclusions GlucoStats demonstrates high efficiency in processing large-scale medical datasets in minimal time. Its modular design enables easy customization and extension, making it adaptable to diverse research and clinical needs. By offering precise CGM data analysis and user-friendly visualization tools, it serves both technical researchers and non-technical users, such as physicians and patients, with practical and research-driven applications.

"GlucoStats demonstrates high efficiency in processing large-scale medical datasets in minimal time. Its modular design enables easy customization and extension, making it adaptable to diverse research and clinical needs"

bmcbioinformatics.biomedcentral.com/articles/10....

#datascience #biostats

6 months ago 1 0 0 0

Learning the natural history of human disease with generative transformers - Nature Delphi-2M forecasts a person’s future health, covering more than 1,000 diseases, provides insights into co-morbidity dynamics and generates synthetic data for the training of AI models that have never...

"Delphi-2M predicts the rates of more than 1,000 diseases (...), with accuracy comparable to that of existing single-disease models. Delphi-2M (...) also enables sampling of synthetic future health trajectories"

www.nature.com/articles/s41...

6 months ago 1 0 0 0

Amazing repository with several references and resources for scRNASeq analysis

github.com/crazyhottomm...

#bioinfo #singlecell

7 months ago 0 0 0 0

Push-button science - Nature Methods Technological advances change not only what we can learn as scientists, but also how science is conducted. Here we explore how automation and outsourcing are affecting the act of doing science.

"Even if work is done by or with the help of experts (...), it is crucial that researchers understand how a method works, that they can assess data quality, and that they fundamentally understand what types of conclusions can and cannot be drawn from their data"

www.nature.com/articles/s41...

7 months ago 0 0 0 0

Six questions to ask before jumping into a spreadsheet Spreadsheet software can be frustrating, but adopting some helpful habits can improve its effectiveness.

Spreadsheets represent an everyday tool for most wet-lab scientists. So, why not use them at their highest potential, efficiently and ready for open science?

This paper provides some recommendations for the use of spreadsheets:

www.nature.com/articles/d41...

#bioinfo #stats

8 months ago 2 0 0 0

Harvard University lays off fly database team The layoffs jeopardize this resource, which has served more than 4,000 labs for about three decades.

FlyBase, a Drosophila database, will lose a third of its team in early October because the Harvard grant that covered the employees’ salaries was canceled. Scientists warn that losing FlyBase could devastate fly research.

By @claudia-lopez.bsky.social

www.thetransmitter.org/community/ha...

8 months ago 123 129 3 12

A survey of sequence-to-graph mapping algorithms in the pangenome era - Genome Biology A pangenome can reveal the genetic diversity across different individuals simultaneously. It offers a more comprehensive reference for genome analysis compared to a single linear genome that may intro...

5/n

The following paper provides an excellent overview of sequence-to-graph mapping algorithms—covering their principles, trade-offs, and performance benchmarks. It’s a great starting point if you’re interested on the topic

genomebiology.biomedcentral.com/articles/10....

#bioinformatics #bioinfo

8 months ago 1 0 0 0

4/n

From equitable precision medicine to large-scale population studies and evolutionary research, pangenomics graphs and graph mapping excel in capturing diversity—especially in highly variable or repetitive genome regions where linear mapping fails.

8 months ago 0 0 1 0

3/n

Unlike linear references, which favour the “reference” allele (known as reference bias), the integration of multiple haplotypes in pangenomic graphs and respective mapping reduces bias, improves alignment quality, and makes analyses fairer across diverse populations.

8 months ago 1 0 1 0

2/n
Pangenome graphs represent variants as alternative paths, so reads can follow the path matching their true sequence.
These graphs can represent SNPs, indels, structural variants, and complex rearrangements, all in one model—boosting variant calling accuracy for common and rare alleles.

8 months ago 0 0 1 0

1/n

Because, in bioinformatics, sharing is caring, let me share something I have recently started exploring - graph mapping and pangenome graphs.

A pangenome graph encodes a reference genome built from many genomes in one structure, thus trying to encapsulate the known genetic variability.

8 months ago 1 0 1 0

12/12

"It is an ethical imperative that research be designed and analysed to avoid wasting investment of animals, research dollars and effort".

#biostatistics #stats

9 months ago 1 0 0 0

11/n

Ultimately, understanding the C&N concepts, design experiments, and analytical approaches accounting for that are fundamental for rigorous and reproducible preclinical research.

9 months ago 1 0 1 0

Posts by André Boler Barros, PhD