Viktor Zaverkin (@viktorzaverkin) Bsky

Uncertainty-biased molecular dynamics for learning uniformly accurate interatomic potentials - npj Computational Materials npj Computational Materials - Uncertainty-biased molecular dynamics for learning uniformly accurate interatomic potentials

For more details about the work on learning uniformly accurate interatomic potentials from scratch I'll present in B1.36:

📄 Paper: www.nature.com/articles/s41...
💻 Code: github.com/nec-research...

7 months ago 0 0 0 0

Or stop by poster B1.36 (Thu, Aug 28)!

#PsiK2025 #AI4Science

7 months ago 0 0 1 0

I’ll be at the Psi-k conference next week!

Let’s chat about ML potentials, uncertainty quantification (ensemble-free, gradient-based: Laplace approx., NTKs, batch selection, …), uncertainty-biased MD, message-passing architectures, particle-mesh long-range methods, etc.

7 months ago 0 0 1 0

Performance of universal machine-learned potentials with explicit long-range interactions in biomolecular simulations Universal machine-learned potentials promise transferable accuracy across compositional and vibrational degrees of freedom, yet their application to biomolecular simulations remains underexplored. Thi...

🧵 TL;DR:

✅ Benchmark metrics improve with model size and electrostatics
❌ These gains don't always translate to improved simulation outcomes
⚠️ Training data & evaluation practices remain key bottlenecks

📄Preprint: arxiv.org/abs/2508.10841
💻Code: github.com/nec-research...

-/

8 months ago 0 0 0 0

With @matheusfferraz.bsky.social, @falesiani.bsky.social, @mniepert.bsky.social

15/

8 months ago 0 0 1 0

Moreover, simulation results are sensitive to training data composition.

E.g., water density predictions depend on whether NaCl-water clusters were in the training set: compare ICTP-LR(M) vs. ICTP-LR(M)*.

Legend: Solid green - ICTP-LR(M); dashed green - ICTP-LR(M)*.

14/

8 months ago 0 0 1 0

Without DFT-level simulations or other baselines, it is difficult to assess to what extent universal ML potentials improve on classical FFs in realistic biomolecular settings.

While their qualitative advantages are often evident, quantitative validation remains challenging.

13/

8 months ago 0 0 1 0

These results highlight the limitations of current evaluation practices.

12/

8 months ago 0 0 1 0

In Trp-cage, simulations with explicit long-range electrostatics exhibit greater conformational variability.

However, the origin of these effects remains unclear without DFT-level simulations.

11/

8 months ago 0 0 1 0

For Crambin, no significant differences are observed for the vibrational spectrum.

10/

8 months ago 0 0 1 0

For Ala3, larger models better reproduce experimental J-couplings.

9/

8 months ago 0 0 1 0

For water and NaCl-water mixtures:

- Larger models don't consistently outperform smaller ones
- Increasing model size doesn't yield systematic convergence
- Explicit electrostatics shifts density predictions from overestimation to underestimation, without consistent gains.

8/

8 months ago 0 0 1 0

BUT: These improvements do not consistently translate into more accurate physical observables in simulations.

Densities, radial distribution functions, and conformational ensembles show inconsistent trends with model size and long-range electrostatics.

7/

8 months ago 0 0 1 0

As expected, benchmark metrics (e.g., energy & force RMSEs) systematically improve with increasing model size and the inclusion of explicit long-range interactions.

6/

8 months ago 0 0 1 0

We use DIMOS for our simulations:

📄Preprint: arxiv.org/abs/2503.20541
💻Code: github.com/nec-research...

5/

8 months ago 0 0 1 0

We assess the impact of model size, dataset composition, and explicit long-range electrostatics across:

📊 Benchmark datasets
💧 Pure liquid water
🧂 NaCl-water mixtures
🧬 Small peptides (blocked and cationic Ala3)
🧪 Small proteins (Trp-cage, Crambin)

4/

8 months ago 0 0 1 0

DFT-level simulations and other high-quality baselines are unavailable or infeasible for biomolecular systems.

A more reliable evaluation should consider how model expressivity (model size, explicit long-range interactions) affects prediction errors and simulation results.

3/

8 months ago 0 0 1 0

Performance of universal machine-learned potentials with explicit long-range interactions in biomolecular simulations Universal machine-learned potentials promise transferable accuracy across compositional and vibrational degrees of freedom, yet their application to biomolecular simulations remains underexplored. Thi...

📄 Preprint: arxiv.org/abs/2508.10841
💻 Code: github.com/nec-research...

2/

8 months ago 0 0 1 0

🚨 New preprint: How well do universal ML potentials perform in biomolecular simulations under realistic conditions?

There's growing excitement around ML potentials trained on large datasets.
But do they deliver in simulations of biomolecular systems?

It’s not so clear. 🧵

1/

8 months ago 5 1 1 0

Gaussian Moments as Physically Inspired Molecular Descriptors for Accurate and Scalable Machine Learning Potentials Machine learning techniques allow a direct mapping of atomic positions and nuclear charges to the potential energy surface with almost ab initio accuracy and the computational efficiency of empirical ...

📎Paper: pubs.acs.org/doi/abs/10.1... (but check also pubs.acs.org/doi/abs/10.1...)
💻Code(s): gitlab.com/zaverkin_v/g... (TensorFlow), github.com/nec-research... (PyTorch), and github.com/apax-hub/apax (JAX)

9 months ago 0 0 0 0

Many thanks to everyone who has read, cited, or built on it. I hope it continues to be helpful!

9 months ago 0 0 1 0

We proposed using full tensor contractions to construct many-body features, thereby avoiding expensive sums over triplets, quadruplets, and so on. I am thrilled to see that similar ideas are now an integral part of state-of-the-art architectures, such as MACE, CACE, and so on. 💪

9 months ago 0 0 1 0

📈My first PhD paper just reached 100 citations, which is a small but very special milestone for me!

Our paper introduces Gaussian moments as molecular descriptors and uses them to build ML potentials with an impressive balance between accuracy and computational efficiency.

9 months ago 3 0 1 0

Chemical Sciences M.Sc. for prospective students | Study program | University of Stuttgart Information for prospective students: Chemical Sciences at the University of Stuttgart: application, admission, requirements.

🚀 Apply Now: International Master's Chemical Sciences! 🌍

The application portal for the English-conducted M.Sc. Chemical Sciences at @unistuttgart.bsky.social are officially open! 🔬✨

👉 Visit our program website for further details: www.uni-stuttgart.de/en/study/stu...

1 year ago 3 3 1 0

I'll present our paper in the afternoon poster session at 4:30pm - 7:30 pm in East Exhibit Hall A-C, poster 3304!

1 year ago 3 2 0 0

Neural surrogates can accelerate PDE solving but need expensive ground-truth training data. Can we reduce the training data size with active learning (AL)? In our NeurIPS D3S3 poster, we introduce AL4PDE, an extensible AL benchmark for autoregressive neural PDE solvers. 🧵

1 year ago 12 3 1 2

Join us today at #NeurIPS2024 for our poster presentation:

Higher-Rank Irreducible Cartesian Tensors for Equivariant Message Passing

🗓️ When: Wed, Dec 11, 11 a.m. – 2 p.m. PST
📍 Where: East Exhibit Hall A-C, Poster #4107

#MachineLearning #InteratomicPotentials #Equivariance #GraphNeuralNetworks

1 year ago 5 1 0 0

@falesiani.bsky.social

1 year ago 1 0 0 0

@takashimal.bsky.social

1 year ago 0 0 1 0

Good question! There is a certain connection between Cartesian and geometric products through those operations: geometric product for an n-dimensional vector could be seen as an "outer" product (up to n-rank tensors) + a "contraction". However, I don't know of any systematic study of the relation.

1 year ago 0 0 0 0

Posts by Viktor Zaverkin