Skip to content
View caesarmario's full-sized avatar
🚀
Fortune Favors the Bold!
🚀
Fortune Favors the Bold!

Block or report caesarmario

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
caesarmario/README.md

Mario Caesar

Data Engineer • Mentor • Speaker

I build reliable data platforms, practical ETL/ELT pipelines, and analytics foundations that help teams move faster with more trust in their data.

Website LinkedIn Medium Kaggle Email

About

I'm a Data Engineer from Indonesia with experience across Agoda, Allo Bank Indonesia, Kredivo Group, and Kredit Pintar. I enjoy building dependable pipelines, warehouse layers, and data workflows that are practical, observable, and easy for teams to trust.

My core stack includes Python, SQL, Airflow, dbt, BigQuery, PySpark, Kafka, and Microsoft Fabric.

Selected Work

  • Re-architected ingestion and ETL pipelines to reduce runtimes from hours to minutes
  • Built reusable, config-driven data workflows for faster onboarding and cleaner maintenance
  • Improved data quality, lineage, and observability across analytics and reporting systems
  • Designed privacy-aware data solutions for masking, governance, and secure delivery

Beyond Work

I also mentor and teach aspiring data professionals, contribute to technical learning programs, and occasionally speak about practical data engineering and analytics.

Find Me


Fortune favors the bold.

Pinned Loading

  1. data-slices data-slices Public

    Welcome to "Data Slices", where imagination and information converge, inviting you to see the world through a new lens of data-driven artistry.

    Jupyter Notebook 11

  2. Team-Mayo-UN-Youth-Hackathon-2022-Submission Team-Mayo-UN-Youth-Hackathon-2022-Submission Public

    This repository is the Mayo team's submission of answers for the UN Youth Hackathon 2022. This repository includes Jupyter notebooks, presentation files, and raw & cleaned datasets. The topic is th…

    Jupyter Notebook

  3. big-mart-sales-preprocessing-SAS-studio big-mart-sales-preprocessing-SAS-studio Public

    Data preprocessing, feature engineering, and EDA for "Big Mart Sales" data set using SAS Studio. The dataset is taken from Kaggle (https://www.kaggle.com/mrmorj/big-mart-sales).

    SAS 5 1

  4. database-bookstore-case-study database-bookstore-case-study Public

    A case study about designing simple database system for a bookstore. This repository contains ERD design and SQL codes for design of bookstore database system. The main purpose of this system is to…

    17 3

  5. etl-credit-card-dataset-using-pentaho etl-credit-card-dataset-using-pentaho Public

    This repository contains ETL file from Pentaho Data Integration. The ETL process cleaned applicant with empty values/data and dirty data. The dataset is taken from https://www.kaggle.com/rikdifos/c…

    7 5

  6. heart-disease-prediction-with-logistic-regression-SAS-studio heart-disease-prediction-with-logistic-regression-SAS-studio Public

    Heart disease prediction with logistic regression using SAS Studio. The dataset is taken from UCI Machine Learning about heart disease.

    SAS 12 1