Work History
9+ years of engineering impact across media, e-commerce, pharma, and technology — from Bangalore to Berlin.
First Senior Data Engineer in the German division of Bauer Xcel Media (part of Bauer Media Group), powering 12 popular German digital publishing brands with 2B+ yearly pageviews.
- Transformed a fragmented data landscape into a scalable GCP-based platform, reducing complexity, cost and data volume by 60%+.
- Built the Marketing Scorecard Project — automating reporting across all brands, optimising €12M+ marketing spend and managing 32 Billion paid impressions with 800M+ paid pageviews over 3+ years.
- Built unified, scalable data models integrating GA4, GSC, JW Player, CMS, and affiliate data across 12 German brands enabling insights for marketing, SEO, content, traffic, and revenue.
- Delivered real-time GA4 reporting (DE & UK) and a global revenue targets framework for standardised performance tracking.
- Engineered a 700k+ DE & UK Content AI assets dataset by unifying, embedding and vectorising content across 12 brands, 3 CMS systems, 2 countries and multiple content asset types.
- Developed a multilingual (50+ languages) RAG-based AI Tool — Multilingual Semantic Vector Search + OpenAI + Hugging Face Embedding Models — supporting 9+ sophisticated content AI business use cases, hosted on Kubernetes.
- Pioneered modern workflows using GCP, GitHub, Terraform, Airflow DAGs (Astronomer + Cloud Composer), Pub/Sub, Cloud Functions, Cloud Run.
- Delivered projects for Ströer advertising & Nielsen AGF Traffic data.
- Building multiple AI prototypes: OCR Trend recommendation, Web Parsing, Brand-based Recommendations, Content Topic Ideas using AI Agents, OpenAI and vectors, Traffic Trend Wordcloud.
Sole Data Engineer — owned the full Data Engineering technology stack for a CPC/CPO-based e-commerce and retail media business, working directly with C-levels and cross-functional teams.
- Owned the complete Data Engineering stack: GCP, Python, BigQuery, Cloud Storage, Cloud Functions, Cloud Scheduler, IAM, Kubernetes, Gitlab, DBT Cloud, CSCart, PostgreSQL, Supermetrics.
- Worked closely with C-levels, Marketing Performance Managers, BI Analysts, Backend, Product, Sales, and Key Account teams to build high-quality data pipelines into the DWH.
- Deep exposure to clicks and CPC/CPO-based e-commerce and retail media models, Google Merchant Center, Channel Pilot.
- Automated and adjusted Google Ads campaigns and Product Groups through Python-generated product outbound feed files.
- Worked extensively with Google Analytics and Ingenious tracking data.
- Joined as BI Engineer and owned the full Data Engineering stack including PostgreSQL DWH hosted in Azure, building data interfaces for business stakeholders, Data Analysts, Data Scientists & Marketing Analysts.
- Managed all data integration activities using Fivetran as ELT tool — Google Analytics, Shopify, Amazon, Klaviyo, CSV/TXT, API, JSON/XML.
- Used GCP (Cloud Functions, Cloud Repositories, BigQuery), Python, and SQL to collect raw data from non-standard sources and transform via DBT into consumable BI reporting layers.
- Applied Statistical data analysis, data preprocessing and Predictive Modelling using Machine Learning algorithms to identify factors affecting Cocoa prices.
- Developed Statistical, Machine Learning & Deep Learning Forecasting models.
- Worked in a Team Lead role supporting Cisco's Tidal automation product — used by companies for business process automation.
- Resolved Cisco Tidal-related customer tickets and incidents; raised software bugs and collaborated with the Software Development team for resolution.
Developer in the Sales & Marketing Business Intelligence (BI) and Data Integration team — delivered end-to-end BI systems, DWH and data models/pipelines for a Global CRM rollout.
- Coded and implemented end-to-end BI systems, Data Warehouse (DWH), data models and pipelines for Global CRM rollout across multiple pharmaceutical clients.
- Identified and resolved a critical bug causing Sales Orders to be lost during daylight saving time transitions.
- Parallelised a critical data pipeline generating 96 reports for 6 countries — reduced pipeline runtime by 60–70%.
- Served as critical developer resource for Support team on Level 1/2 high-priority incidents.
- Built data interfaces: users, customer master data, products, metrics, alignment data, multichannel cycle plans, delivery automation via REST API & SOAP Web Services (XML/JSON), Shell scripting and Job Schedulers.
- Awarded Accenture Delivery Excellence Award.