Andrew Roger (@andrewjroger) Bsky

Please wait whilst we redirect you All content on this site: Copyright © 2026 Elsevier B.V., its licensors, and contributors. All rights are reserved, including those for text and data mining, AI training, and similar technologies. For all open access content, the relevant licensing terms apply.

Wrote a fun little thing on gene tree discordance:

authors.elsevier.com/a/1mzPI3QW8S...

22 hours ago 30 17 3 1

This will be really interesting!

1 day ago 1 0 0 0

New OpenFold3 preview out! (OF3p2)

It closes the gap to AlphaFold3 for most modalities.

Most critically, we're releasing everything, including training sets & configs, making OF3p2 the only current AF3-based model that is functionally trainable & reproducible from scratch🧵1/9

1 month ago 245 91 1 2

Nevermind...in NCBI it says ~105 Mb in 1400 scaffolds.

1 day ago 0 0 0 0

What is the Trichoplax genome size estimate?

1 day ago 1 0 1 0

New preprint out! Using ~75k environmental OTUs + 77 fossil calibrations, we reconstructed a Proterozoic timeline of eukaryote evolution. Our results show crown eukaryotes were already diversifying >1.6 Ga, long before the first undisputed fossils (~1.05 Ga).
🔗 DOI: www.biorxiv.org/content/10.6...

4 months ago 117 57 4 4

illustration of various aspect of biology of the microbe discussed in the article

New #ISEPpapers! The genome sequence of the gastric gland parasite, #Cryptosporidium proliferans: Monika Wiśniewska et al. www.sciencedirect.com/science/arti...

#Protists #Parasites #Microbes #Genomics

6 days ago 6 1 0 0

a simplified evolutionary tree of eukaryotes with pictures of various microbes

New #ISEPpapers #preprint by @deemteam.bsky.social: Re-evaluating the eukaryotic Tree of Life with independent phylogenomic data www.biorxiv.org/content/10.6...

#Protists #Microbes #Evolution #Eukaryotes #TreeOfLife #Phylogeny #Phylogenomics #Bioinformatics #Algae

6 days ago 65 28 1 3

We stumbled upon an unusual gene called 'rqua' in the genome of some freshwater sponges. In microbes, this gene confers the ability to make a ubiquinone analog - rhodoquinone - which can help the respiratory chain run without oxygen.

1 week ago 26 10 1 0

Phylogenomic mixture models outperform homogeneous and partitioned models Abstract. Significant advances have been made in resolving the tree of life, but many nodes remain debated. The last two decades saw the emergence of mixtu

From Pisani et al: CAT-GTR is one of the most flexible models in the phylogenomic arsenal.

Phylogenomic mixture models outperform homogeneous and partitioned models

academic.oup.com/mbe/advance-...

1 week ago 17 9 0 1

I thought maybe @nicolasgaltier.bsky.social was asking about whether Canadian French accent when speaking French has similarities to Canadian English accent when speaking English? Is that right Nicolas?

1 week ago 1 0 2 0

My guess is "no", but I'm not an expert on the various french accents in Canada (Québécois, Acadien, Franco-Albertan etc.). @christianlandry.bsky.social what are your thoughts on this?

1 week ago 1 0 1 0

Yes!

1 week ago 1 0 0 0

Welcome. The Canadian phonetics explanation was an unnecessary addition.

1 week ago 1 0 2 0

Differently. "Bias" -> "Buy Ass". "Bayes" -> "Baes". For Canadians "Bayes" = "B-ehs".

2 weeks ago 7 0 1 0

Very useful article.

2 weeks ago 4 2 0 0

MBE Call for Papers on the Major Transitions of Life

MBE is excited to launch our newest Call for Papers on the Major Transitions of Life, covering all aspects of phylogenomic research.

🔗 academic.oup.com/mbe/pages/call-for-papers-on-the-major-transitions-of-life

Guest Editors:
Davide Pisani
Anja Spang

#evobio #molbio #phylogenetics

2 weeks ago 23 14 1 0

This looks interesting!

2 weeks ago 5 0 1 0

Anecdotally, I remember explaining parsimony methods to a statistician colleague in the early 2000s...and they just looked bewildered as they didn't seem like a sensible statistical approach to estimation so wondered why on Earth someone would propose them.

2 weeks ago 0 0 0 0

I suppose all of what I said above depends on what might be considered a "relevant difference". I've heard the cladist arguments about this ad nauseum...

2 weeks ago 0 0 1 0

I'd guess that statisticians would not see them as guilty until proven innocent in the phylogenetic setting. Rather they'd think that, unless there is a relevant difference between the phylogenetic setting and more classical statistical problems, these methods should perform well 'off the shelf'.

2 weeks ago 0 0 1 0

For example, I think that there would be an a priori expectation that, based on their properties in more classical statistical settings, both maximum likelihood and Bayesian approaches to phylogenetic estimation would perform well if the models adequately described real data

2 weeks ago 0 0 1 0

Although I get the context of this quote (and agree with it in that context), if we just set aside the whole cladistics/phenetics/probabilistic debates in the early days, I'm not sure we'd think this way about modern phylogenetic approaches.

2 weeks ago 1 1 1 0

On the Use of Information Criteria for Model Selection in Phylogenetics Abstract. The information criteria Akaike information criterion (AIC), AICc, and Bayesian information criterion (BIC) are widely used for model selection i

If you want to know more about the connection between AIC and cross-validation check the following paper out. Warning though -- some of the math is a bit tough going (for me to, even though I'm a co-author): academic.oup.com/mbe/article/...

3 weeks ago 2 0 0 0

AIC is an approximation to the expected predictive log-likelihood and is therefore getting at the same thing as cross-validation. In practice, if a model has the best AIC and the best BIC out of competing models (including simpler ones), then usually you don't have to worry about overfitting.

3 weeks ago 1 0 2 0

In practice if you are still worried about overfitting, you can always try splitting datasets into training and test sets to see how well your model fitted on the training data predicts the test data. Cross validation approaches are really helpful in this regard.

3 weeks ago 0 0 1 0

The above situation is quite similar in spirit to conducting penalized likelihood analysis to control overfitting. So I think you can roughly think of the summation constraint for weights of mixture models as functioning as a penalty for overfitting.

3 weeks ago 0 0 1 0

For example, to optimize mixture model weights with a quasi-Newton approach like L-BFGS, you can actually just constrain the weights as being only >0 and instead optimize the likelihood with a penalty of n*(Sum_i w_i - 1). This ends up guaranteeing that in the optimal solution the weights sum to 1

3 weeks ago 0 0 1 0

So the model 'self-compacts' in a sense if there isn't much evidence for particular mixture components. Parameters in components with weights at zero don't factor into the parameter count and don't contribute to model flexibility. This sounds mysterious, but it comes from the summation constraint.

3 weeks ago 0 0 1 0

This ends up turning the optimization of the mixture into something akin to a penalized likelihood setting. If the mixture is too rich, then often one or several of the components will end up with weights of 0 (or close to the lower bound for weights the software implementation you are using).

3 weeks ago 0 0 1 0

Posts by Andrew Roger