Building reproducible ML/AI pipelines.This project is about building a reproducible ML pipeline with MLflow and Weight and Biases. A properties company has a model that estimate…Apr 20Apr 20
How to create and deploy big data jobs on AWS EMR (Apache Spark)A growing music streaming startup has expanded both its user base and song library, and now plans to setup the data in its data warehouse…Apr 20Apr 20
Data Ingestion (Batch/Real-time ) tools series: Data Build Tool (DBT)DBT is a data workflow tool designed mainly for transformation, it helps you transform data within a data storage. DBT is not the kind of…Oct 27, 2023Oct 27, 2023
Published inGeek CultureStarting Your Data Science Project With Metaflow? The MNIST Use-Case.The priority of data scientists simply lies in picking out the right features, building and deploying their models, they do not like to be…May 2, 2021May 2, 2021
Published inAnalytics VidhyaTechnology behind Streaming-as-a-serviceThe demand for streaming service has risen exponentially, companies like Netflix, Spotify, Twitter, Uber want to offer real-time services…Jan 24, 2021Jan 24, 2021
Translate Business problems to Data science problems.Data science has come a long way in recent times, its processes are being used in the spheres of analysis, analytics, machine learning…Aug 7, 2020Aug 7, 2020
Distributed ComputingFor Data Science, Machine learning, Data Engineering folks.Jul 26, 2020Jul 26, 2020
Data Privacy & SecurityWith the growth in the use of web and mobile applications, data is everywhere and it’s vulnerable, this includes personal data. There have…Jun 29, 2020Jun 29, 2020
Published inAnalytics VidhyaContainers, Kubernetes & Continuous Delivery/Integration in AIHello my Datascience/Machine learning folksJun 22, 2020Jun 22, 2020