Finished the dlt workshop from @datatalks.bsky.social ✅
Built a custom API → dlt → DuckDB pipeline, with page-by-page pagination + validation.
#dlthub #dezoomcamp #zoomcamp #dataengineering
Module 1 complete: Containerization & Infrastructure as Code ✅ @datatalks.bsky.social
• #Docker Compose orchestrating #PostgreSQL + pgAdmin
• #Python ETL processing 46K NYC taxi records
• #SQL analytics on trip patterns and zones
• #Terraform provisioning GCP resources
#dataengineering #zoomcamp
#MLOps #Zoomcamp - Module 6 on Best Practices! 🚀
@datatalks.bsky.social
Learned:
✅ Unit & integration testing with pytest
✅ Mocking cloud services with LocalStack
✅ Code quality with pre-commit hooks
✅ Workflow automation with Makefiles
✅ CI/CD with GitHub Actions
🔥 Completed my MLOps homework by building a regression model to predict taxi ride durations! Excited to continue learning about MLOps, model optimization, and deployment. 🚖 #MLOpsJourney #AI #DataScience #MachineLearning #ZoomCamp #DataTalksClub
🚕💡 The results are in! My regression model predicts NY Yellow Taxi ride durations based on trip data. MLOps knowledge is growing as I move towards model deployment. Looking forward to the next steps! #DataScience #MachineLearning #NYC #MLOps #ZoomCamp #DataTalksClub
📊🤖 Finished building my first linear regression model to predict NY Taxi ride durations using the January-February 2023 dataset. Next step: tuning the model and deploying with Docker. Let's do this! #MLModel #DataScience #AI #ZoomCamp #MLOps #DataTalksClub
🚖🔍 Exploring the NY Yellow Taxi dataset from Jan-Feb 2023! Diving into ride data to better understand patterns and prepare for a linear regression model to predict ride durations. Exciting journey ahead! #MachineLearning #MLOps #ZoomCamp #DataTalksClub