• Explore
  • Sites
  • Posts
  • Twitter
  • FAQs
  • Terms
  • Discord
  • Support
  • Privacy
Abhijit Singh
🙂
Abhijit Singh

Data Analytics and Warehousing for eBev's Online Brewery

Introduction

eBev, an online brewery, was facing challenges in managing its operations and sales effectively. They had a vast amount of data but lacked the necessary tools and expertise to turn it into actionable insights. I was brought on board to help eBev leverage data analytics and warehousing to optimize their operations and sales.

Overview

  • The client wanted to create a centralized repository for all organizational data.

  • The goal was to make the data accessible to everyone and create an organized data source.

  • The client wanted continuous data to refresh the repository.

  • The main objective was to convert raw data into actionable insights for better decision-making.

  • The client wanted to build a data lake-house architecture on the cloud to achieve this.

Requirements

  1. Ingest data from different sources.

  2. Store data securely in Azure Data Lake Storage.

  3. Perform data transformations using Azure Databricks.

  4. Utilize Apache Airflow for orchestrating and scheduling the entire workflow.

  5. Ensure scalability and fault tolerance.

Architecture

eBev's Architecture diagram

Description

  • Data Sources: Multiple data sources feed into the system, and data is ingested into Azure Data Lake Storage (ADLS).

  • Data Processing: Azure Databricks is used for batch processing and data transformations.

  • Workflow Orchestration: Apache Airflow orchestrates the entire workflow, scheduling tasks and triggering data processing jobs.

  • Data Analysis: Processed data in ADLS can be queried and analyzed using Power BI for advanced analytics.

Send Abhijit Singh a reply about this page
Back to profile