Skip to content
View VijayRodrigues's full-sized avatar

Block or report VijayRodrigues

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
VijayRodrigues/README.md

πŸ‘‹ Hi, I'm Vijay Ashley Rodrigues

πŸ§‘β€πŸ’» About Me

I'm a Data Engineer with 7+ years of experience building data platforms, ETL/ELT pipelines, migration frameworks, and analytics solutions across APAC, ANZ, and North America within the insurance industry.

Throughout my career, I've worked on large-scale data migration, cloud modernization, analytics enablement, and data quality initiatives, with a focus on building reliable and scalable solutions that help organizations make better use of their data.

My primary expertise lies in Azure Databricks, Apache Spark, PySpark, SQL, Snowflake, Airflow, and dbt, and I enjoy solving problems related to data processing, platform engineering, data quality, and performance optimization.

Lately, I've been exploring how Generative AI can complement data engineering, particularly in areas such as metadata intelligence, analytics acceleration, developer productivity, and intelligent data tooling.


πŸ”§ Core Technologies

Data Engineering: Apache Spark, PySpark, Databricks, Delta Lake, dbt

Cloud & Data Platforms: Azure Databricks, Azure Data Factory, Azure Data Lake Storage, Azure Synapse Analytics, Snowflake

Programming: Python, SQL, Spark SQL

Orchestration: Apache Airflow, Databricks Workflows

Data Quality & Governance: Data Modeling, Data Quality, Data Migration, Data Validation, Reconciliation Frameworks


πŸ“Œ Featured Projects

⭐ GenAI DBT Model Analyzer

AI-powered assistant for understanding dbt models, dependencies, lineage, and transformation logic.

⭐ Real-Time POS Kafka Pipeline

Real-time streaming pipeline demonstrating event-driven data processing patterns.

⭐ Data Compare Dashboard

Data reconciliation and validation framework for identifying mismatches across datasets and systems.

⭐ Real-Time E-Commerce Pipeline

End-to-end data engineering project covering ingestion, transformation, and analytics workflows.

🚧 AI Engineering Playground

Experiments and prototypes exploring practical applications of Generative AI in data engineering.


πŸ“œ Certifications

  • Databricks Certified Data Engineer Associate
  • dbt Fundamentals
  • Apache Airflow Fundamentals
  • DAG Authoring for Apache Airflow
  • Snowflake Hands-On Essentials: Data Engineering
  • Postgraduate Program in Artificial Intelligence & Machine Learning (Texas McCombs School of Business)

🌍 Beyond Work

πŸ’° Currency collector with notes from 180+ countries

πŸ”­ Space and astronomy enthusiast

πŸ“š Continuous learner exploring data engineering, AI, and cloud technologies


πŸ“« Connect

πŸ’Ό LinkedIn: https://www.linkedin.com/in/vijayrodrigues

🌐 Portfolio: https://www.vijayrodrigues.com

πŸ’» HackerRank: https://www.hackerrank.com/profile/vijayrodrigues18

Pinned Loading

  1. VijayRodrigues VijayRodrigues Public

    Config files for my GitHub profile.

  2. GenAI_DBT_model_Analyzer GenAI_DBT_model_Analyzer Public

    Python 1