Data Science with Raghav

Everything related to Data and AI

Skip to content
Menu
  • Home
  • Data Science
  • Data Engineering
  • AI
  • NLP
  • Productivity
  • General
  • About
  • Contact
  • Privacy Policy

Category: Data Engineering

How to get started with Databricks on Azure for free?
Data Engineering

How to get started with Databricks on Azure for free?

Posted on March 21, 2023May 18, 2023 by Raghav

What is Databricks Databricks is an American Software company founded by the creators of Apache Spark. It provides a web based platform of the same…

What is Redis and When to use it?
Data Engineering

What is Redis and When to use it?

Posted on July 19, 2022July 19, 2022 by Raghav

Introduction Redis is an open source in-memory data structure store. It is a short for Remote Dictionary Server(REDIS). It is used as a distributed in-memory…

How to get historical weather data (min temp, max temp and precipitation) directly from NOAA (National Oceanic and Atmospheric Agency) using Python – Part 3 (extracting temperature/precip data from netcdf files)
Data Engineering

How to get historical weather data (min temp, max temp and precipitation) directly from NOAA (National Oceanic and Atmospheric Agency) using Python – Part 3 (extracting temperature/precip data from netcdf files)

Posted on April 29, 2022April 29, 2022 by Raghav

In the last post I described how to create a GIS shape file for the region of interest for which you need to extract the…

How to get historical weather data (min temp, max temp and precipitation) directly from NOAA (National Oceanic and Atmospheric Agency) using Python – Part 2 (Creating Shape Files)
Data Engineering

How to get historical weather data (min temp, max temp and precipitation) directly from NOAA (National Oceanic and Atmospheric Agency) using Python – Part 2 (Creating Shape Files)

Posted on April 18, 2022April 29, 2022 by Raghav

Creating Shape files In the first part we discussed how to download the netcdf files containing the weather data from NOAA’s website using Python. In…

How to get historical weather data (min temp, max temp and precipitation) directly from NOAA (National Oceanic and Atmospheric Agency) using Python – Part 1 (Downloading NETCDF files)
Data Engineering

How to get historical weather data (min temp, max temp and precipitation) directly from NOAA (National Oceanic and Atmospheric Agency) using Python – Part 1 (Downloading NETCDF files)

Posted on April 7, 2022April 29, 2022 by Raghav

NOAA is a US government agency that forecasts weather and monitors oceanic and atmospheric conditions. It is one of the biggest weather agencies in the…

Understanding Transformations vs Actions and Narrow vs Wide Dependencies in Apache Spark
Data Engineering

Understanding Transformations vs Actions and Narrow vs Wide Dependencies in Apache Spark

Posted on March 8, 2022March 8, 2022 by Raghav

This article covers two of the most important concepts related to execution of code in Apache Spark. It is crucial for your understanding of Spark…

Getting started with PySpark and running your first application
Data Engineering

Getting started with PySpark and running your first application

Posted on March 3, 2022March 3, 2022 by Raghav

Introduction to Spark Spark is a unified engine for distributed data processing. It supports both on premise and cloud installation. Applications written in Spark can…

How to install Apache Superset locally on Windows with database drivers for MS SQL server, Dremio, MySQL and PYODBC.
Data Engineering

How to install Apache Superset locally on Windows with database drivers for MS SQL server, Dremio, MySQL and PYODBC.

Posted on December 23, 2021December 27, 2021 by Raghav

Introduction What is Superset Superset is a Data Visualization tool which is cloud-native, highly available and scalable as it works very well with containers. You…

How to read a fixed width file using Pandas in Python
Data Engineering

How to read a fixed width file using Pandas in Python

Posted on November 30, 2021November 30, 2021 by Raghav

Introduction Fixed width files, which do not have any column delimiters are common in financial industry especially with ETL extracting data from mainframe systems. In…

How to scrape product prices data from Amazon and store it in a pandas DataFrame using Python
Data Engineering

How to scrape product prices data from Amazon and store it in a pandas DataFrame using Python

Posted on January 19, 2021January 19, 2021 by Raghav

Scraping product prices data from Amazon is very helpful if you are building a price comparison utility or if you want to be alerted when…

Posts pagination

Page 1 Page 2 Next Page
© Copyright 2025 – Data Science with Raghav
Wisteria Theme by WPFriendship ⋅ Powered by WordPress
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OkNoPrivacy policy