Building intelligent data systems that scale β from raw bytes to actionable insight.
I'm a Data Engineer & AI Automation Specialist with an MS in Data Science from the University at Buffalo. I specialize in:
- π End-to-end ETL/ELT pipelines β from ingestion through transformation to serving
- ποΈ Scalable data architectures β Lakehouse patterns with Delta Lake & Apache Iceberg
- π€ LLM-powered applications β multi-agent systems with LangChain, OpenAI, and FastAPI
- βοΈ Cloud-native engineering β AWS, GCP, Snowflake, BigQuery at production scale
2024 β 2025 β π MS in Data Science β University at Buffalo
β Focus: ML, Big Data Systems, NLP, Cloud Architecture
β
2023 β 2024 β π§ Data Engineering & AI Automation Projects
β ETL pipelines Β· Lakehouse architectures Β· LLM apps
β
2022 β 2023 β π Data Analytics & Pipeline Development
β Spark Β· Airflow Β· Kafka Β· dbt Β· AWS
β
2020 β 2022 β π» Software & Data Engineering Foundations
β Python Β· SQL Β· Cloud fundamentals Β· ML basics
π Open to full-time Data Engineering / ML Engineering roles β available immediately.
harshal = {
"pronouns" : "he/him",
"currently" : "Building LLM-powered data pipelines & multi-agent systems",
"learning" : ["Apache Iceberg", "LLM fine-tuning", "Rust for data tools"],
"hobbies" : ["Exploring new data tools π", "Technical blogging on Medium βοΈ",
"Coffee-fuelled late-night debugging β", "F1 ποΈ"],
"fun_fact" : "I automate tasks so I have more time to automate more tasks π€",
"reach_me_at" : "harshal.sanjivpatil2000@gmail.com",
}

