Hi,
I'm Rachana Mahapatra

I'm a

MS BA & AI grad student at University of Texas at Dallas | Expertise in Business Analytics, Data Engineering, Data Science, and Machine Learning.

profile image
about image

About Me

Hey there! I'm Rachana, a Data Engineer and a Master's student in Business Analytics & Artificial Intelligence at The University of Texas at Dallas. With over 5 years of experience working as a Business Analyst and Data Engineer across diverse industries like travel, healthcare, and finance, I specialize in building robust data pipelines, optimizing ETL processes, and utilizing Machine Learning and AI to transform data into impactful insights. In my current role as a Data Engineer, I thrive on solving complex data challenges and creating solutions that drive meaningful results. My work is driven by a passion for innovation and a deep curiosity for how data can revolutionize decision-making. Beyond my professional life, I’m an ambivert who finds joy in both deep conversations and creative pursuits. I love painting, writing, and constantly seeking new experiences. My adventurous spirit has led me to skydiving, parasailing, deep water walking, and rafting, always pushing me to explore beyond the ordinary. If you’re intrigued by data, creativity, or a good adventure story, let’s connect and see where our paths might cross!

Experience

May 2024 - Present

Data Engineer, Aesthetic Record LLC

Dallas, Texas

  • Engineered end-to-end ETL pipelines using AWS Lambda (Python), Glue (Spark), and Airflow to automate EMR migrations, reducing onboarding time by 25% and accelerating go-live for 120+ medical practices monthly.
  • Architected a centralized data lake (S3) and Redshift-based columnar data warehouse to manage 10TB+ of healthcare data; standardized KPIs across 40+ reports using Athena, implemented RBAC, and integrated real-time, self-service dashboards via PHP/Next.js using Athena APIs and Git workflows.
  • Built an event-driven, ML-ready data layer using Kafka, RDS triggers, and DynamoDB streams to capture appointment activity, session events, and auth logs — powering inference pipelines and LLM-driven IVR agents with real-time, context-aware data.
  • Built a behavioral modeling pipeline to predict churn across web and tablet EMR apps; analyzed feature usage and session data via AWS RDS and Athena, identifying at-risk accounts with 85% precision and reducing manual retention efforts by 40%.

April 2022 - June 2023

Business Intelligence Engineer, Aviation World

Hyderabad, India

  • Exposed inefficiencies in static pricing models and built a dynamic pricing platform; ingested IATA, GDS, and Travelport API data into dimensional BigQuery schemas and unified views for investigative analysis.
  • Architected ML-ready feature pipelines using Cloud Dataflow (Beam) and Python, orchestrated via Cloud Composer, enriching 30M+ bookings; validated with PyTest and used for predictive pricing and offer-response modeling.
  • Developed real-time Tableau dashboards on BigQuery datasets to monitor A/B tests and surface pricing insights; improved targeting strategies and contributed to 15% revenue lift and 17% CSAT improvement.

January 2021 - April 2022

Data Analytics Engineer, Tata Consultancy Services

Bhubaneswar, India

  • Built and managed ETL pipelines using Informatica PowerCenter and SSIS to integrate async data feeds from four source systems into 40+ fact/dimension tables, supporting regulatory and risk analytics at scale (200M+ records/month).
  • Led orchestration migration from legacy Autosys to Azure Data Factory, improving observability, chaining logic, and fault recovery across multi-stage data workflows.
  • Enforced PII masking and secure access control within Informatica and SQL workflows, supporting compliance and enabling secure model training on masked SSNs and sensitive attributes.

January 2019 - January 2021

Data Analytics Engineer, Titan Aviation

Hyderabad, India

  • Performed exploratory analysis on high-frequency engine telemetry and sensor streams using SQL and Python to detect degradation patterns and inform diagnostics.
  • Partnered with R&D to model raw test-bench data into analytical structures, accelerating validation cycles and enabling faster iterations on turbine upgrades.
  • Benchmarked engine performance across aircraft classes using custom Python scripts, supporting competitive positioning for commercial and defense bids with data-driven insights.

Education

August 2023 - May 2025

Master of Science in Business Analytics & Artificial Intelligence - The University of Texas at Dallas

Relevant Coursework: Big Data Engineering, Applied Machine Learning, Deep Learning

July 2016 - September 2020

Bachelor of Technology in Aeronautical Engineering - Jawaharlal Nehru Technological University, India

Relevant Coursework: Statistics, Quantum Mechanics, C++, Data Structures, Computational Fluid Dynamics, Numerical Methods.

Certifications

AWS Certified Data Engineer – Associate

  • Amazon Web Services (AWS)
  • July 2024

AWS Certified Solutions Architect – Associate (SAA-C03)

  • Amazon Web Services (AWS)
  • November 2023

Azure Fundamentals (AZ-900)

  • Microsoft
  • June 2023

Applied Machine Learning

  • Naveen Jindal School of Management, UTD
  • August 2024

Tech Stack

Programming Languages

  • Python
  • R
  • SQL
  • C++
  • Pyspark
  • Shell Scripting
  • JavaScript

Data Engineering

  • Apache Spark
  • Apache Kafka
  • Apache Airflow
  • Hadoop
  • Informatica
  • Automation
  • PySpark
  • Docker
  • Kubernetes
  • Git
  • Jenkins (CI/CD)

Databases & Data Warehousing

  • Snowflake
  • Teradata
  • MySQL
  • SQL Server
  • Oracle
  • MongoDB
  • DynamoDB
  • Redshift
  • RDS
  • PostgreSQL
  • Elasticsearch

Cloud Platforms

  • AWS (S3, Lambda, DynamoDB, SageMaker, Glue, Athena, Redshift)
  • Azure (Data Factory, Synapse Analytics)
  • GCP (BigQuery, Cloud Functions, Cloud Pub/Sub)

Data Science & Machine Learning

  • TensorFlow
  • PyTorch
  • Scikit-learn
  • ARIMA
  • Prophet
  • AutoML
  • MLlib (Spark)

Data Visualization

  • Power BI
  • Tableau
  • Quicksight
  • Matplotlib
  • Seaborn
  • Gephi

Tools & Utilities

  • Git
  • Jenkins
  • Docker
  • Kubernetes
  • Jira
  • PowerShell

Projects

Diabetes Risk Prediction With PCOS

Diabetes Risk Prediction With PCOS

HeatEx FanGuard

HeatEx FanGuard : Predictive Maintenance System

E-commerce Data Streamliner

RetailMind-Data-Framework

DatawHiz Hackathon - Safety Excellence Group

DatawHiz Hackathon - Safety Excellence Group

Conagra Hackathon

Conagra Hackathon

Publications

Numerical Simulation and Analysis of Open Cavity Flow

Numerical Simulation and Analysis of Open Cavity Flow