Skip to content
View Chaitanya-0310's full-sized avatar
  • https://geekvide.com/
  • Vadodara,Gujarat,India

Block or report Chaitanya-0310

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Chaitanya-0310/README.md

Email LinkedIn GitHub


πŸ‘¨β€πŸ’» About Me

AI & Data Engineer with hands-on experience building agentic AI systems, RAG pipelines, and scalable data platforms.
Currently pursuing a Master of Applied Computing at the University of Windsor (Jun 2025).

I specialize in bridging robust data engineering with modern Generative AI to deliver reliable, production-ready systems.


πŸ” Core Expertise

  • πŸ€– Agentic AI & LLM Orchestration (LangGraph, LangChain, RAG)
  • πŸ“Š Distributed Data Engineering (Spark, Airflow, dbt, Kafka)
  • ☁️ Cloud Platforms (Azure, AWS, GCP)
  • πŸ’Ύ Modern Analytics & Warehousing (Snowflake, PostgreSQL, BigQuery)
  • 🧱 MLOps & Vector Databases (pgvector, Pinecone, MLflow)

πŸ› οΈ Tech Stack

🧾 Languages

Python SQL Bash R


πŸ€– AI / ML

LangChain HuggingFace PyTorch TensorFlow OpenAI


πŸ“Š Data Engineering

Apache Spark Apache Airflow Kafka dbt Snowflake


☁️ Cloud & Infra

Azure AWS GCP


πŸ—„οΈ Databases & DevOps

PostgreSQL MongoDB Cassandra Docker Kubernetes Terraform GitHub Actions

πŸš€ Featured Projects

🧠 Multi-Agent Marketing Campaign Orchestrator (Agentic AI)

  • Multi-agent workflow using LangGraph
  • Planner, Writer & Reviewer agents
  • RAG + ChromaDB for contextual generation
  • Streamlit UI

πŸ”— https://github.com/Chaitanya-0310/Multi_Model_AI_Agent


πŸ“ˆ StockRAG – AI-Driven Financial Intelligence

  • End-to-end RAG pipeline
  • FastAPI for real-time market data
  • pgvector-enabled PostgreSQL
  • Context-aware LLM responses

πŸ”— https://github.com/Chaitanya-0310/StockRAG-AI-Assistant


πŸ“₯ Reddit Data Engineering Pipeline

  • Reddit API β†’ Spark β†’ Analytics
  • Schema validation & deduplication
  • Production-ready ETL design

πŸ”— https://github.com/Chaitanya-0310/RedditDataEngineerProject


πŸ’Ό Professional Experience

Data Engineer β€” Independent Consultant (Mar 2025 – Dec 2025)

  • Built metadata-driven ingestion framework on Azure Data Factory
  • Reduced onboarding time by 70%
  • Optimized PySpark transformations using Z-Order & Liquid Clustering
  • Reduced incremental load time by 35% and cloud compute cost
  • Built dbt star-schema models for trusted analytics

Junior Data Engineer β€” Kraftbase (Jan 2023 – Nov 2023)

  • Modernized 12+ SSIS jobs β†’ Airflow DAGs
  • Built modular Python ingestion framework
  • Improved query performance by 40%
  • Implemented SCD2 pipelines
  • Resolved 25+ data quality issues

Data Analyst Intern β€” Kintu Designs (Jul 2022 – Dec 2022)

  • Automated feature extraction pipelines
  • Built PowerBI & Streamlit dashboards
  • Reduced ad-hoc reporting by 30%

πŸ“œ Certifications

  • Databricks Lakehouse Fundamentals
  • Databricks Generative AI
  • Snowflake Hands-On Essentials
  • MongoDB AI-Powered Search & RAG
  • Apache Airflow Fundamentals

πŸ“Š GitHub Stats

GitHub Stats

Top Languages


🎯 Currently Seeking

πŸš€ AI Engineer / Data Engineer roles (2025–2026)


⭐ If you find my work useful, consider starring my repositories!

Pinned Loading

  1. Multi_Model_AI_Agent Multi_Model_AI_Agent Public

    Python 1

  2. StockRAG-AI-Assistant StockRAG-AI-Assistant Public

    TypeScript 1

  3. Build-RAG-for-Generating-Response-on-PDF Build-RAG-for-Generating-Response-on-PDF Public

    Jupyter Notebook 1

  4. YellowTaxi_BigDataProject YellowTaxi_BigDataProject Public

    Built a project for quality checking of Big data using HDFS, Spark and Hive queries on cloudera platform. Used Yellow-taxi dataset in parquet for performing quality checks. Loaded the data into HDF…

    RobotFramework 1

  5. RedditDataEngineerProject RedditDataEngineerProject Public

    Python