✅ Available for Work · UK Based

Data Analyst. SQL · Python · Power BI.
£97K recovered. 89% reporting time cut.

I work across e-commerce, healthcare and operations analytics, connecting SQL, Python and Power BI to commercial and operational outcomes that stakeholders can act on. Based in Ipswich, open to roles across the UK.

View Projects Download CV
£97K
Revenue Recovered
89%
Reporting Time Saved
17%
CLV Increase
2+
Years Experience

About

Data Analyst with 2+ years across e-commerce, healthcare and customer operations.

Profile

I work across the full analytics pipeline — cleaning and shaping data, building models and dashboards, and translating findings into decisions that non-technical stakeholders can act on. My experience spans e-commerce customer analytics at Nautica Solutions, operations and patient safety analysis at Axtria, and performance analytics at Teleperformance.

Since completing my MSc in 2025, I have built a project portfolio covering e-commerce, healthcare and geospatial analytics.

Ipswich, UK Remote Friendly Two-week notice

Education

MSc Data Science
University of Essex (2023–2025)
Ipswich, United Kingdom
BSc Statistics
University of Mumbai (2018–2021)
Mumbai, India

Where I Work Best

Open to Data Analyst and BI Analyst roles across the UK, including remote-friendly and hybrid teams. I do my best work in roles where the analysis is expected to connect to a business decision — not just land in a report.

  • E-commerce, retail and customer analytics
  • Healthcare and operations analytics
  • BI and reporting automation roles
  • Teams using SQL, Python and Power BI

Right to work in the UK · Two-week notice period

Skills

Tools, languages and techniques I use in practice.

Data & Programming

SQL (CTEs, window functions, joins) · Python (pandas, NumPy, scikit-learn) · Excel (pivot tables, VLOOKUP) · Google Analytics · Power Query · Data cleaning and joining across sources

CTEsWindow FunctionsJoinsAggregations

Business Intelligence & Visualisation

Power BI (DAX, Power Query, data modelling) · Interactive KPI dashboards for marketing and operations · Drill-through and summary views for leadership · Data storytelling for non-technical stakeholders

DAXDashboardsReportsDrill-throughs

Analytics

Customer lifetime value (CLV) · Churn prediction · Behavioural cohort analysis · Decile-based revenue concentration · Regression and correlation analysis · Hypothesis testing · KPI design and monitoring · Stakeholder communication

RFMSegmentationPredictiveInferential

Featured Projects

DA/BI-relevant work with a clear business question, method and outcome.

E-commerce Customer Value Segmentation (RFM & Deciles)

Top 10% of customers generate 61.5% of revenue — invisible without a structured value model.

An online retailer had marketing spend spread too widely through blanket discounts with no view of which customers drove revenue. Built RFM segments (Champions, Loyal, At Risk, Dormant) and ranked all customers into 10 monetary deciles from raw invoice data. Applied business-safe data cleaning with full audit trail. Produced decision-ready CSV outputs for CRM and campaign use.

PythonPandasRFMDecilesCustomer value

Hospital Operations Analytics: Bed Occupancy, Responsiveness & Patient Safety

Staff responsiveness (r = −0.791) was the strongest driver of falls — but only 35% of months hit the target.

Hospital leaders needed to understand how occupancy, staffing and patient safety outcomes connected. Analysed 60 months of KPI data in Python using correlation analysis and linear regression. Modelled staffing scenarios and built a 12-slide stakeholder presentation with specific recommendations, with modelled scenarios suggesting potential fall rate reduction.

PythonHealthcare analyticsPower BIRegression

Colchester Crime Analysis 2023: Hotspot-Driven Resource Planning

6,800+ incidents broken into two distinct hotspots, each requiring a different policing strategy.

Applied DBSCAN geospatial clustering to 6,800+ crime incidents, converting lat/long to projected CRS for accurate distance-based analysis. Compared crime composition and seasonality across hotspots. Nightlife core (3,535 incidents): violent offences. Retail zone (947 incidents): shoplifting. Built an interactive R Shiny dashboard and static HTML report for non-technical stakeholders.

RDBSCANGeospatialShiny dashboard

Work Experience

2+ years across e-commerce, healthcare and customer operations analytics.

Junior Data Analyst

Nautica Solutions Ltd – Brentwood, UK
Jul 2024 – Feb 2025

  • Recovered £97K lapsed revenue and reduced churn 5% by segmenting 50K+ transactions with Python/SQL RFM scoring, enabling the marketing team to target at-risk VIPs for win-back campaigns.
  • Increased 12-month CLV 17% by boosting second-purchase rate from 22% to 29% through behavioural cohort analysis and targeted retention offers.
  • Reduced weekly reporting time 89% (11 hours to 70 minutes) by building an automated RFM scoring pipeline in Python, enabling self-serve CRM analytics.
  • Improved e-commerce net profit margin 3% by cutting discount spend 17% without a drop in conversion, using decile-based revenue concentration analysis.

Performance Analyst

Teleperformance – Mumbai, India
Dec 2022 – Aug 2023

  • Increased First Contact Resolution 13% by running root-cause analysis on detractor feedback, stabilising CSAT scores for Airbnb Superhost vertical.
  • Reduced Time to Proficiency 25% by designing a knowledge base and delivering SME-led workflow training for 20+ team members.
  • Achieved 91% positive feedback rating against Airbnb's global satisfaction standards.

Analytics Intern – Data Analyst

Axtria – Noida, India
Jan 2022 – Oct 2022

  • Identified a staffing optimisation opportunity via 36-month hospital operations SQL analysis, linking bed occupancy, staff responsiveness and patient safety outcomes.
  • Quantified operational relationships (occupancy vs responsiveness −0.62; occupancy vs falls +0.58) using correlation and regression, translating findings into specific recommendations.
  • Built Power BI KPI dashboards with custom DAX for occupancy, response time and fall rate monitoring to test staffing scenarios with nursing leadership.

Advanced Analytics & Modelling

Machine learning, statistical modelling and SQL analysis projects demonstrating end-to-end pipeline work.

Heart Disease Prediction using Machine Learning

Random Forest: 96.7% accuracy, 97.9% recall, ROC AUC 0.996 — tuned via GridSearchCV.

Traditional risk scores struggle with non-linear clinical interactions. Cleaned a 968-record medical dataset, encoded and scaled features, then trained and compared five classifiers (Logistic Regression, Random Forest, SVM, Decision Tree, Gaussian Naive Bayes). SVC F1-score improved from 89% to 96% post-tuning. Performance caveats around retained duplicate records are documented in the project notes.

Pythonscikit-learnClassificationGridSearchCVROC AUC

Machine Learning for House Price Prediction: A Comparative Study

XGBoost outperformed all linear baselines — R² = 0.84 on the California housing dataset.

Compared statistical and ML models for predicting house prices. Applied IQR outlier detection, log transformation, one-hot encoding and GridSearchCV tuning across Linear Regression, Ridge, KNN, Random Forest and XGBoost. Evaluated using R², RMSE, residual plots and Q-Q plots.

PythonXGBoostscikit-learnRegressionModel comparison

Covid-19 Patient Survival Analysis (SQL)

Mortality rates varied significantly by age, comorbidity and ICU admission source — surfaced entirely through SQL.

Hospitals needed a structured view of which patient characteristics drove Covid-19 mortality. Wrote advanced SQL queries across a hospital patient dataset to calculate mortality rates stratified by age, gender, ethnicity and comorbid conditions (diabetes, cirrhosis). Investigated ICU admission types, BMI distributions and physiological metrics across survivors and non-survivors. Output provides a foundation for clinical risk stratification.

SQLHealthcareRisk stratificationPostgreSQL

CV & Certifications

Professional credentials and downloadable CV.

Certifications

  • Career Essentials in Data Analysis — Microsoft & LinkedIn (2024)
  • Microsoft Azure AI Fundamentals — Microsoft (2024)
  • Power BI for Business Intelligence — LinkedIn Learning (2024)
  • Career Essentials in Generative AI — Microsoft & LinkedIn (2024)
  • Data Science Job Simulation — British Airways / Forage (2023)

Tools & Platforms

BI & Reporting: Power BI (DAX, Power Query, data modelling), Excel (pivot tables, VLOOKUP)
Programming: Python (pandas, NumPy, scikit-learn), SQL (PostgreSQL/MySQL), R
Version Control & Workflow: GitHub, Jupyter Notebook, VS Code
Other: Google Analytics, CleverTap

Download My CV

One-page CV covering all experience, metrics, tools and education. Formatted for UK recruiters.

Download CV (PDF)

Open to Data Analyst and BI Analyst roles across the UK.

Want someone who connects SQL, Python and Power BI to clear commercial outcomes?

I am open to Data Analyst and BI Analyst roles across the UK. Available now, two-week notice.

Get in Touch

Get in Touch

If you are hiring for Data Analyst or BI Analyst roles in the UK and want someone who can link SQL, Python and BI to clear commercial or operational outcomes, I would be happy to talk.

Contact Details

Email
shahbaz.b.sharif@gmail.com

LinkedIn
linkedin.com/in/shahbaz-sharif

GitHub
github.com/shahbazsharif1

Location
Based in Ipswich. Open to roles across the UK.

Send a Message

The quickest way to reach me is directly by email. Click below and your email client will open with my address pre-filled.

Email Me Directly

shahbaz.b.sharif@gmail.com

What I am Looking For

  • Data Analyst or BI Analyst roles
  • Remote or hybrid positions across the UK
  • Industries: e-commerce, retail, healthcare, SaaS
  • Work that connects analysis to revenue, cost or service outcomes
  • Right to work in the UK · Two-week notice