Data Platform & Warehousing
- BigQuery
- Dataform
- dbt
- Data Modeling
Hi, I'm
Data Engineer · GCP · Terraform
Data Engineer with 2+ years developing scalable GCP data infrastructure leveraging BigQuery data warehouse, streaming pipelines, multi-environment Terraform IaC, and full-stack data tools.
I’m a data engineer based in Dhaka, Bangladesh. I build the pipelines, data models, and cloud infrastructure that quietly keep a business running, and most of what I know I picked up by actually shipping things and watching what broke.
I studied biomedical engineering at BUET (Bangladesh University of Engineering & Technology), working on medical imaging and a thesis on lung nodule detection with deep learning. Somewhere along the way I realized I cared more about the systems that move and shape data than the specific domain it described, and I went looking for work that let me build those systems end to end. I found that at G-Star, where I’ve spent the last two years working on a production GCP data platform: BigQuery models in Dataform, streaming pipelines on Pub/Sub and Dataflow, multi-environment Terraform.
G-Star · Dhaka, Bangladesh
G-Star · Dhaka, Bangladesh
End-to-end event ingestion: Pub/Sub topics → Apache Beam / Dataflow → Hive-partitioned GCS → BigQuery sink. Fully Terraform-provisioned.
Designed and delivered an event-driven ingestion pipeline from scratch as part of a new order-management platform integration.
WRITE_APPEND and clustering on high-cardinality keys.Schema-validated CSV / Excel / JSON uploads via FastAPI, routed to date-partitioned GCS paths backing BigQuery external tables. Deployed behind Cloud IAP with managed SSL inside a custom VPC.
A browser-based upload tool that lets business users land data into the warehouse without pipeline disruption or public exposure.
Nightly Python / GraphQL job that extracts Tableau Cloud metadata (dashboards, sheets, full lineage) and lands it in GCS. Surfaces 300+ downstream dashboard dependencies for impact analysis.
Before this pipeline, “what breaks if we drop this table?” was a manual half-day question.
downstream_dashboard ↔ upstream_object pair) alongside a snapshot timestamp for change tracking.End-to-end medallion lakehouse on a single laptop — Kafka in KRaft mode at 50 RPS, MinIO for object storage, DuckDB for query, Airflow for orchestration, Superset for BI. Zero cloud dependency.
A production-grade, end-to-end streaming data platform on self-hosted open-source infrastructure, processing synthetic e-commerce events through a Medallion architecture with zero cloud dependency.
Bangladesh University of Engineering and Technology (BUET) · Dhaka, Bangladesh
Ananda Mohan College · Mymensingh, Bangladesh
Mymensingh Zilla School · Mymensingh, Bangladesh
mHealth Lab , Department of Biomedical Engineering, BUET