Advertisement ยท 728 ร— 90

Posts by Josh Persi

Can anyone recommend some data science podcasts on really any aspect of the biz? My go-tos are the R podcast and the test set, but curious what others are listening to!

#rstats #pydata

1 month ago 2 0 1 0

Looks like some awesome talks. Can't wait to dig in!

1 month ago 1 0 0 0
Post image

Looks nifty! Is there a specific setting to enable this?

I have the latest version of Positron, the Positron notebook enabled, but I don't see the data explorer:

1 month ago 0 0 0 0

Looks really slick! What plotting library are you using for the line-plots? I haven't seen any plots in R 'draw' the lines like that!

1 month ago 1 0 1 0
Post image

New feature in the Positron Notebook Editor: Inline data-explorer view. Whenever you print a pandas or polars dataframe you get an inline interactive view powered by Positron's data explorer. This allows you to view and sort massive dataframes.

1 month ago 30 6 2 1

Thanks for sharing - I appreciate the benefits you write about (e.g. searching across multiple projects). It still feels weird that, as far as I know, I'd have to change the working directory at the outset of each new R session to whichever project I'm working on.

1 month ago 1 0 1 0
Post image

Positron users - do you set up multiple (all?) projects in a single workspace or open up each project into its own workspace. I'm thinking the latter is more common but I'm curious to know if others use the multiple-projects-in-one-workspace approach.

#positron #rstats

1 month ago 3 0 2 0

Working on Shiny apps for the passed few weeks, boy am I glad the browser() function exists. It makes it so much easier to walk through reactives as they're being evaluated. Check it out if you've ever felt stuck working on a massive Shiny app!

#rstats #rshiny

1 month ago 9 1 0 1
Advertisement

If you're deep-learning-curious, come read the free online book Deep Learning with Python with us at the @dslc.io
bsky.app/profile/josh...

2 months ago 2 0 0 0
But what is a neural network? | Deep learning chapter 1
But what is a neural network? | Deep learning chapter 1 YouTube video by 3Blue1Brown

I'm new to the world of neural networks and am 8 years late to this video, but this video really helped me build intuition for what happens under the hood. Thanks @3blue1brown.com!

youtu.be/aircAruvnKk

#rstats #pydata #machinelearning #deeplearning

2 months ago 7 1 1 0
Preview
dplyr 1.2.0 dplyr 1.2.0 fills in some important gaps in dplyr's API: we've added a new complement to `filter()` focused on dropping rows, and we've expanded the `case_when()` family with three new recoding and re...

dplyr 1.2.0 is out now and we are SO excited!

- `filter_out()` for dropping rows

- `recode_values()`, `replace_values()`, and `replace_when()` that join `case_when()` as a complete family of recoding/replacing tools

These are huge quality of life wins for #rstats!

tidyverse.org/blog/2026/02...

2 months ago 465 133 12 12
Preview
Deep Learning with Python, Third Edition Deep Learning with Python is written for anyone who wishes to explore deep learning from scratch. This new edition adds comprehensive coverage of generative AI and modern deep learning frameworks. It ...

Come learn about deep learning with us! The Data Science Learning Community just started reading the free third edition of Deep Learning with Python. Meetings are Tuesday at 3 pm CST.

DSLC: dslc.io
Book: deeplearningwithpython.io

#rstats #pydata

2 months ago 7 3 0 1

Glad you think it's cool! I'm happy to share!

I referenced the sentiment analysis chapter of 'Text Mining with R' (www.tidytextmining.com/sentiment.ht...) while writing the analysis.

My (somewhat slapdash) code is available on GitHub:
github.com/joshpersi/ti...

3 months ago 2 0 1 0
Post image

For this week's #tidytuesday I looked at the sentiment of classic Christmas novels and found the most 'joyous' ones on the basis of a sentiment analysis. If you want extra joy next holiday season, Angel Unaware may be a good read!

#dataviz #rstats #ggplot2

3 months ago 9 0 2 0

Thanks for pointing that out! I'm still learning all the terms but I believe you're right. Whenever I wrote 'reference table' it would have been more correct to write 'dimension table'.

3 months ago 0 0 0 0

If you want to learn about DuckDB too, come join our 'DuckDB in Action' book club at the Data Science Learning Community (it's free and we're friendly!)

dslc.io

3 months ago 4 0 0 0
A Canadian trade database built with DuckDB โ€“ Josh Persi

Hey folks! I built a Canadian trade database to practice what I'm learning about DuckDB. The database includes all Canadian imports and exports from 1988 to present as per Statistics Canada. Here's a blog post I wrote with more details:

joshpersi.github.io/blog/posts/0...

#rstats #duckdb

3 months ago 33 6 4 0
Advertisement
Post image

I'm taking my very first crack at OJS for this week's #tidytuesday! It's a super basic plot but getting going with OJS was a challenge, so I'm calling it a win!

#DataViz #RStats #ggplot2

3 months ago 10 0 2 0
A choropleth map of US states, coloured by the number of roundabouts per state. The state with the most roundabouts if Florida.

A choropleth map of US states, coloured by the number of roundabouts per state. The state with the most roundabouts if Florida.

Who knew Florida had so many roundabouts? For this week's #tidytuesday I struggled to match each interval's colour intensity with the midpoint of the interval. I couldn't figure it out in code and ended up pulling from coolors.co/gradient-pal...

#DataViz #RStats #ggplot2 #tidytuesday

4 months ago 7 0 0 0
Post image

I took this week's #TidyTuesday as an opportunity to try and brush up on modelling and learn tidymodels.org. I have no great modelling insights to share, but in making this graph, I did learn you can set line-ends to be round in geom_line(). Who knew?!

#DataViz #RStats #ggplot2

4 months ago 12 0 1 0
testthat hex

testthat hex

testthat 3.3.0 is here! Major updates for #RStats testing include failure messages for easier debugging, 6 new specific expectations, and the SlowReporter to find your bottlenecks.

Heads Up: R 4.1+ is now required, and old mocking functions are removed.

Read the post: tidyverse.org/blog/2025/11...

4 months ago 25 6 0 0
Preview
Free "DuckDB in Action" Book Download a free PDF of the DuckDB in Action book published by Manning. The book is rich with technical guidance on data analytics and data engineering with DuckDB and MotherDuck.

The Data Science Learning Community is starting a new book club cohort for DuckDB in Action this Wednesday. Come read and learn with us! All are welcome!

DSLC: dslc.io
DuckDB in Action: motherduck.com/duckdb-book-...

#RStats #python #DataScience #OpenSource

4 months ago 13 0 0 0

Thank you!

4 months ago 1 0 0 0
Controlling source and author attributions in your {ggplot2} graphics โ€“ Josh Persi

A very long time in the making but I'm eager to share I've made a Quarto blog to talk about all the data science stuff I'm learning! First post is about adding data source and author attributions in ggplot2! Feel free to stop by!

joshpersi.github.io/blog

#RStats #Quarto #DataScience #OpenSource

4 months ago 13 1 1 0

Thanks Libby! You were actually one of the reasons I finally got the gumption to join in!

A few Data Science Hangouts back you said something along the lines of 'you don't need to wait to be an expert to start working in the open' and it really clicked with me!

5 months ago 1 0 1 0
A line-chart showing the number of deaths to tuberculosis on the y-axis and year, from 2000 to 2023, on the x-axis, with each line being a separate country. Most countries have had low tuberculosis mortality throughout this timespan, but two countries in particular, India and South Africa, have seen significant declines.

A line-chart showing the number of deaths to tuberculosis on the y-axis and year, from 2000 to 2023, on the x-axis, with each line being a separate country. Most countries have had low tuberculosis mortality throughout this timespan, but two countries in particular, India and South Africa, have seen significant declines.

This is my first time joining in on the #TidyTuesday fun. Looking forward to seeing what everyone else is doing with this week's data!

#DataViz #RStats #ggplot2

5 months ago 17 0 2 1
Advertisement
A map of Canada showing rivers as lines

A map of Canada showing rivers as lines

10 minutes is all we were allowed for day 13 of the #30DayMapChallenge. This took me about 15 minutes, so I'm a little over, but this was a fun way to work under a time constraint and avoid endless tweaking!

#DataViz #RStats #ggplot2

5 months ago 8 0 1 0

Thank you! I was able to find it and will keep it mind for the future!

Here's the link: www.metrolinx.com/en/about-us/...

5 months ago 0 0 0 0
Post image

Day 11 is all about minimalism so I made a map with four lines! Perhaps too minimal, but I also couldn't find geographic data on the stations themselves, so maybe it was meant to be ๐Ÿคท

#30DayMapChallenge #DataViz #RStats #ggplot2

5 months ago 7 0 2 0

Happy to! My code is here:

github.com/joshpersi/30...

Data is pulled from the Climate Data Store: cds.climate.copernicus.eu/user-guide

5 months ago 1 0 1 0