Aware Corpolation (Aug 2025 - current)
Data Engineer
-
Manage Data catalog
I specialized in managing the enterprise Data Catalog using DataHub. My responsibilities included orchestrating automated metadata ingestion from diverse environments like Azure SQL and MSSQL. Additionally, I designed and implemented access control policies to manage user permissions effectively, ensuring both data security and seamless data discovery for the team.- Implemented a centralized Data Catalog: using DataHub hosted on Azure Kubernetes Service (AKS) to enhance data governance and discoverability.
- Orchestrated an end-to-end CI/CD pipeline: by creating custom Dockerfiles, managing container images via Azure Container Registry (ACR), and automating deployments using Jenkins.
- Engineered metadata ingestion pipelines: from hybrid sources (Azure SQL and MSSQL) to capture technical schemas, descriptions, ownership, and end-to-end Data Lineage.
- Integrated DataHub’s GraphQL API: With a custom Frontend application, enabling seamless access to metadata and governance insights for end-users.