Advertisement · 728 × 90

Posts by David Steinberg

Post image Post image Post image

🧬 Database Center for Life Science #BioHackathon2025 in Mie, Japan!

We tested Camber's Nova, the Science AI, on analyzing survey results, combining DBCLS datasets, and enabling natural language queries. Huge thanks to DBCLS!

#BioHackathon #Bioinformatics #LifeSciences #AI #Camber #DBCLS

6 months ago 3 2 0 0
OSF

Want to make better use of ontologies? Check out on2vec created at DBCLS #biohackathon 2025...

It turns ontologies into embeddings you can actually use in ML models. Perfect for biomedical data, knowledge graphs, etc. osf.io/preprints/bi... github.com/david4096/on...

5 months ago 2 0 0 0

Catch you at the next one I hope ;)

6 months ago 0 0 0 0
Post image Post image Post image

Hello from @ga4gh.org Plenary in Uppsala!

6 months ago 8 1 1 0
Cloud-Based BRCA Exchange Variant Analysis Environment Using GA4GH Standards in Camber By integrating BRCA Exchange variant data with GA4GH standards, this GA4GH Implementation Forum (GIF) project creates open, platform-agnostic workflows and tools that can be used by anyone for scalabl...

Announcing GIF Project: Cloud-based BRCA Exchange variant analysis environment using GA4GH standards in Camber. The project aims to adapt and extend community-driven standards to support interoperable workflows, variant annotation, and metadata description. Learn more: www.ga4gh.org/what-we-do/g...

8 months ago 2 2 0 0

Nice to meet you too!

1 year ago 1 0 0 0

Collected together @ga4gh.org Bluesky accounts here, lmk if you want to be added! go.bsky.app/8BDDMqM

1 year ago 2 1 0 0
Advertisement

Calling all @ga4gh.org Connect 2025 attendees online and in-person, let's connect here on bluesky! #ga4ghconnect2025 #ga4gh #bioinformatics #genomics

1 year ago 6 1 1 0
Capt. Grace Hopper on Future Possibilities: Data, Hardware, Software, and People (Part One, 1982)
Capt. Grace Hopper on Future Possibilities: Data, Hardware, Software, and People (Part One, 1982) YouTube video by National Security Agency

Grace Hopper could really get people laughing about information sciences and the struggles of working under strict hierarchies www.youtube.com/watch?v=si9i...

1 year ago 2 0 0 0
Summary of "Improvising to cellular playgrounds in Realtalk", Aug 2023
Summary of "Improvising to cellular playgrounds in Realtalk", Aug 2023 YouTube video by Dynamicland

If you haven't caught up with the amazing new demos from @dynamicland.org now is your chance www.youtube.com/watch?v=Osn3...

1 year ago 3 0 0 0
Post image

At the @mlcommons.org Croissant community meeting with, you guessed it

1 year ago 1 0 0 0
Improvising cellular playgrounds in Realtalk

The photo we saw reminded me immediately of some of the goals of @dynamicland.org as seen here dynamicland.org/2023/Improvi...

1 year ago 0 0 0 0
Preview
GitHub - dbcls/dive: Data Integration Visual Exploration (DIVE) Data Integration Visual Exploration (DIVE). Contribute to dbcls/dive development by creating an account on GitHub.

Another important direction is making immersive visual experiences that make data models accessible in a visual and humane way. I hope to experience this in person at a museum github.com/dbcls/dive

1 year ago 0 0 1 0
Post image Post image Post image

Toshiyaki Katayama, original author of the wildly popular KEGG database rounding out the keynotes @swat4hcls.bsky.social by showing us the past, present, and future of linked data in the life sciences — lots of excitement for the possibilities of #graphgenome!!

1 year ago 2 2 1 0

Nice to see this one making the rounds @dockstore.org @ucscgenomics.bsky.social

1 year ago 2 1 0 0

Starter pack for #swat4hcls2025 conference go.bsky.app/PiZd2qR 🗣️ @swat4hcls.bsky.social

1 year ago 0 0 0 0
Preview
GitHub - MaastrichtU-IDS/UM_KEN4256_KnowledgeGraphs: Resources for the KG course at IDS, Maastricht University Resources for the KG course at IDS, Maastricht University - MaastrichtU-IDS/UM_KEN4256_KnowledgeGraphs

Slide from a course at @maastrichtu.bsky.social that’s up on GitHub github.com/MaastrichtU-...

1 year ago 0 0 0 0
Advertisement
Post image

Embedding knowledge graphs in order to compare ontologies using learned features from Shervin Mehryar’s keynote

1 year ago 0 0 1 0
Post image

From Prof Anna Fensel’s keynote a roundup of some of the connections between AI and semantic

1 year ago 0 0 1 0

One of the common themes of the conversations at #swat4hcls so far is that knowledge graphs are proving to be critical for reliability and interpretability of AI and LLMs in specific

1 year ago 1 0 1 0

Excited to attend #SWAT4HCLS in Barcelona next week, representing @cambercloud.bsky.social ! 🎉

At the hackathon, we’ll explore #CroissantML for seamless dataset & model access via @hf.co and @kaggle.com 🤓

1 year ago 3 0 0 0

Check out our first preprint from #biohacakathon Fukushima 2024 and expect more on this work 🤓 files.osf.io/v1/resources...

1 year ago 0 0 0 0

We found some low hanging fruit for improvement and tested out bringing a bio dataset into Croissant. We think that continually increasing the use of ontologies and controlled vocabularies will be crucial for data harmonization and the new era of multimodal models!

1 year ago 1 1 1 0
Preview
GitHub - david4096/croissant-rdf: Tools for working with RDF from Croissant JSON-LD resources Tools for working with RDF from Croissant JSON-LD resources - GitHub - david4096/croissant-rdf: Tools for working with RDF from Croissant JSON-LD resources

We made a simple tool for converting CroissantML to #RDF so it could be analyzed using #SPARQL and looked for differences between its usage between Kaggle and Hugging Face github.com/david4096/cr...

1 year ago 1 0 1 0
Advertisement

It works by providing a controlled vocabulary for high level dataset metadata as well as specific metadata for columnar data, which might seem like a small thing but is a huge step forward for bringing tools to data

1 year ago 0 0 1 0

@hf.co , @kaggle.com , OpenML, DataVerse and others are all implementing some or part of the CroissantML spec that interoperates with tooling like Tensorflow so you can load datasets directly into your AI training code

1 year ago 1 0 1 0

Biology datasets tend to be messy, require domain knowledge to parse, and not immediately usable for training AI models. That’s part of why I became interested in @mlcommons.org CroissantML as a way to bring ML tools to biology data — we’re presenting a poster on this effort at #swat4hcls next week!

1 year ago 2 0 1 0

This is a great opportunity to contribute —

1 year ago 2 0 0 0
Post image

@anthropic.com marked bioinformaticians as Office & Administrative for their job category 🧐 www.anthropic.com/news/the-ant...

1 year ago 0 0 0 0

gestures in @worrydream.com

1 year ago 1 0 0 0