Today, we are sharing our newest report, "Licensing Best Practices for Sharing Scientific Data", which extends the open data practices originally designed for climate data in our 2023 report "Recommended Best Practices for the Better Sharing of Climate Data" to other disciplines.
buff.ly/mkmMe2S
Posts by Bryan Gee
BREAKING: In response to huge cuts in Trump's budget request, NSF is shuttering its SBE directorate. Staff will be transferred to other parts of the agency and "grants that align with Administration priorities" will be maintained.
That & more w/ @maxkozlov.bsky.social & @edwrdchen.bsky.social
Government-Wide Prohibition on Publishing and Subscription Fees. The Budget ends the diversion of research dollars to high priced publishers across the Government. The Budget prohibits the use of Federal funds for expensive subscriptions to academic journals and prohibitively high publishing costs unless required by Federal statute or approved in advance by a Federal agency. Research funded by taxpayers should be publicly accessible; yet many publications charge the Government to both publish and to access the same research study. There are numerous low-cost outlets to make federally-funded research publicly available
Lots to unpack in the FY27 budget proposal but an interesting one under DOE (p. 17): "The Budget prohibits the use of Federal funds for expensive subscriptions to academic journals and prohibitively high publishing costs unless required by Federal statute or approved in advance by a Federal agency."
This is very cool! Curious whether you have insights into copyright law for 3D data in Europe - as Creative Commons says itself, CC licenses (CC0 as a waiver being the exception) are not appropriate for most research data, and Blackburn et al. (which you cite) says the same for 3D data in the U.S.
Same, we had to get uni legal involved to do takedowns and now I'm looking through all of our Zoom attendance data trying to pick out who the mole(s) was
Will probably be back next week or the week after with more deets on where those authors are/were based at the time of publication (country-level) once I come up with schemes for how to count/measure/weight that.
Reminder that you can find the code + data here: github.com/bryanmgee/ne...
[END]
The last thing I have data for, which is maybe of greatest interest to some, is WHO is publishing these papers. Still figuring out how to graph this, but the top hitters for Nature & Science only:
-Zhe-Xi Luo: 16
-Jin Meng: 15
-Xu Xing: 14
-Min Zhu: 11
-Qiang Ji: 9
(data on GitHub)
Chloropleth map showing the relatively frequency of new holotypes published in Nature or Science, with China being a giant hotspot. Australia, Canada, the U.S., and some other western European countries also show up a fair bit.
Likewise, Palaeontology sees more holotypes from the UK, western Europe, and other parts of the Commonwealth (Canada, Australia). But China remains dominant in being the source of new species.
Binary map showing whether at least one holotype published in PNAS/Current Bio was found in a given country. Most of the Americas and Europe are well-represented, while Africa is largely unrepresented.
Chloropleth map showing the relatively frequency of new holotypes published in Nature or Science, with China being a giant hotspot and almost all other countries being nearly indiscernible from zero. The United States and Myanmar are also discernible.
PNAS/Current Bio sees much higher representation from the U.S. (probably because PNAS is a U.S.-skewed publication). If you squint hard enough, Myanmar is also higher up there - Current Bio has published a lot of Cretaceous amber material (though not recently).
Binary map showing whether at least one holotype published in Nature/Science was found in a given country. Most of the Americas and Europe are well-represented, while Africa is less well represented.
Chloropleth map showing the relatively frequency of new holotypes published in Nature or Science, with China being a giant hotspot and almost all other countries being nearly indiscernible from zero.
The maps for where holotypes are from are also interesting. Nature and Science show the same trend in being dominated by new species from China.
Bar graph comparing frequency of clades for which new species were described in Palaeontology since 2010. Invertebrates of various types (molluscs, insects, ostracods, etc.) dominate this journal, followed by different types of fishes and mystery animals outside of Metazoa.
Bar graph comparing frequency of geologic time bins from which new species were described in Palaeontology between 2010 and 2026. The Cretaceous is overwhelmingly the most common, followed by the Jurassic, Cambrian, and Devonian.
Palaeontology (@thepalass.bsky.social) is the best I have atm for 'typical' paleo in which naming of inverts outpaces that of verts - not uncommon to see single papers naming 5+ new species of inverts, which almost never happens for verts. Binned inverts out more for @daveyfwright.bsky.social
Bar graph comparing frequency of clades for which new species were described in PNAS or Current Biology between 2000 (PNAS) or 2006 (CB) and 2026. Insects, misc. arthropods, and plants are far more common in than Nature/Science.
Bar graph comparing frequency of geologic time bins from which new species were described in PNAS or Current Biology between 2000 (PNAS) or 2006 (CB) and 2026. The Cretaceous is overwhelmingly the most common, followed by the Eocene, Jurassic, and Triassic.
Quite interesting to compare relative ranks among clades to which new species belong between Nature/Science and what could be considered the 'next tier down' in journals like PNAS and Current Bio.
ππ
π±π
π¦π
ππ
Bar graph comparing frequency of clades for which new species were described in Nature or Science between 1995 and 2026. Primates, non-therian mammaliaforms, theropod dinosaurs, assorted fish, and amphibians are the most common.
Bar graph comparing frequency of geologic time bins from which new species were described in Nature or Science between 1995 and 2026. The Cretaceous and Jurassic are the most common, followed by the Cambrian and Eocene.
I have cleaned up the graphs a bit after some internet rando told me my five-minute rush-job matplotlib was ugly π’
Clade- and time-based charts now have standardized colour schemes and the clades are binned out more. The following data are Nature/Science (1995-present, including this week's newπ)
π Back for #FossilFriday with more data on where/what new species get published!
Updated data and scripts: github.com/bryanmgee/ne...
Nature/Science: β¬οΈin coverage through 1995
PNAS: π, new species through 2000
Current Bio: π, new species through 2006
Palaeontology: π, new species through 2010
π§΅π
Ah Palaeontology is a good one to check! Definitely a shift away from descriptive work in my experience (less recent descriptions to cite), will add to list
Yep, the next step, which i mostly have scripted just not pushed on GitHub, is to pull information on authors and institutions from a bibliographic API - there are definitely some heavy hitters who are on 5+ articles in this dataset
The molluscs were modestly represented IIRC, I might split them out in the next iteration - it was a hassle trying to identify all the conditional labels to make some that would make sense and not be too vague
Anyway, this is a fun per project I'm just noodling on in my spare time, but if anyone else is interested and wants to contribute, the GitHub is there for forking or you can DM me!
github.com/bryanmgee/ne...
[END]
Global map using pink to indicate whether at least one holotype of a new species published in JVP was found in a country. The global north and western countries are well-represented, with broader coverage in some areas but still underrepresentation in Africa
Global chloropleth map showing that the U.S. is overwhelmingly where holotypes published in JVP come from, although Argentina and China show up well.
Chloropleth map more readable as distribution shifts more to U.S. (JVP is a American-society-based pub).
Bar plot of new species named from fossils and published in JVP between 2020 and 2026 based on manually categorized group. Placental mammals and actinopterygian fish are at the top, followed by therapsids, miscellaneous fish, and birds.
Bar plot of new species named from fossils and published in JVP between 2020 and 2026 based on the time bin. The Cretaceous is the most common, but several Cenozic bins are also near the top, like the Miocene and the Eocene
You're more likely to see new fish π and pseudosuchians π in JVP and wayyy more likely to see Cenozoic species.
Bar plot of new species named from fossils and published in JVP between 2020 and 2026 based on whether the species is an dinosaur (about 1 in 9 are).
Bar plot of new species named from fossils and published in JVP between 2020 and 2026 based on whether the species is a mammaliaforms (about 1 in 3 are).
For comparison, I also got data from JVP (I wanted J Paleo but their date filter is borked), though it's only through 2020 because there are so many more papers. The mammal "advantage" (really just skew in this case) persists.
Global map using pink to indicate whether at least one holotype of a new species published in Nature or Science was found in a country. The global north and western countries are well-represented
Global chloropleth map showing that China is overwhelmingly where holotypes published in Nature and Science come from
And the vast majority of new species in fancy journals are from China; the skew is so strong it completely blows out the chloropleth map.
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on manually categorized group. Placental mammals and non-therian mammaliaforms are at the top, followed by invertebrates and then non-avian saurischian dinosaurs and then birds.
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on the time bin. The three Mesozoic bins are the most common, followed by the Cambrian and the Silurian.
Here it is in another form. These groupings are mine, are highly asymmetrical, will probably be tweaked, and were done to try and make this more comprehensible. The Mesozoic is also the most common time bin.
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on whether the species is an dinosaur (about 1 in 6 are).
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on whether the species is a mammaliaform (more than 1 in 3 are).
People may assume dinosaurs benefit the most from being charismatic, but it's actually new mammaliaforms that have the highest representation.
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on whether the species is a vertebrate (about 4 in 5 are).
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on whether the species is a tetrapod (about 3 in 5 are).
Predictably, there's a skew towards vertebrates and tetrapods specifically.
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026 based on whether the species is an animal (almost all are).
Bad news if you don't work on animal fossil specifically. It does happen once in a blue moon though!
www.nature.com/articles/nat...
Bar plot of new species named from fossils and published in Nature or Science between 2010 and 2026. Nature has a lot more.
Firstly, Nature publishes way more of these naming papers than Science.
This is just a pet project, so I won't belabor the methods. It's all in the README on GitHub with my code and data:
github.com/bryanmgee/ne...
In short, I searched 'sp. nov. fossil' in Nature and Science's websites, 2010 through 2026 inclusive, and pulled data. Then the script magic happens β¨οΈ
Last week, amidst the hoopla over a new Speen, @fishfetisher.bsky.social suggested a review of naming papers in fancy journals in response to a post by @daveyfwright.bsky.social - I got bored after work and now I have (some) data!
π§΅π
#FossilFriday
#CharismaticTaxaAreOverrated
Very sad to hear about the passing of Hans Sues. I was always impressed with Hans' breadth across Tetrapoda and his enthusiasm, but he was just a really nice guy above anything else. He was very patient and kind to me as a nervous grad student when handling my first MS, which I have never forgotten.