Data engineer · 5 yrs on AWS, Spark, Kafka. Open to senior data engineering roles, remote or hybrid. Portfolio in pinned repo ↓
- Frisco, TX
- in/sathvik-0d138
-
Joined
May 10, 2026
Popular repositories Loading
-
-
fraud-streaming
fraud-streaming PublicSub-250ms card-auth fraud scoring on a synthetic 5K eps stream. Kafka → Spark Streaming → Redis features → XGBoost (ONNX) → Iceberg + FastAPI.
Python
-
claims-lakehouse
claims-lakehouse PublicSynthea + EDI 837 → Kafka → PySpark Iceberg → dbt-DuckDB → Great Expectations → Airflow. HIPAA-flavored claims lakehouse.
Python
-
retail-cdc
retail-cdc PublicPostgres → Debezium → Kafka → Spark/Iceberg → dbt-DuckDB → reverse-ETL (custom Hightouch-style worker). End-to-end CDC with exactly-once via outbox + content-hash.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

