Skip to the content.

Home | Portfolio | GitHub | LinkedIn | Medium | Stack Overflow | Terms | E-mail

Data and Market Research Specialist

profile

“I always strive to be better at market research than any statistican and better at statistics than any market researcher.”

My work includes:

Samples

Analysing Sales Data: SQL and Data Visualization

boxplot-sales-3

The majority of companies keep their important data stored in a database. That said, how often do these companies take the time to analyse such data for meaningful insights? When used efficiently, interrogating a database with SQL and generating data visualizations from those insights can greatly enhance understanding of the data.

Forecasting Hotel Revenue: Predicting ADR Fluctuations with ARIMA

hotel-adr-7

Average daily rates represent the average rate per day paid by a staying customer at a hotel. This is an important metric for a hotel, as it represents the overall profitability of each customer. In this example, let us see how average daily rates can be forecasted using an ARIMA model.

Handling Imbalanced Classification Data: Predicting Hotel Cancellations

svm-2

When attempting to build a classification algorithm, one must often contend with the issue of an unbalanced dataset. An unbalanced dataset is one where there is an unequal sample size between classes, which induces significant bias into the predictions of the classifier in question. Let us see how a Support Vector machine can be used to classify hotel booking customers in terms of cancellation risk.

Hotel Analytics: Calculating Revenue, RevPAR and GOPPAR Using SQL

revpar_by_quarter_2

The hotel industry relies on several unique KPIs to gauge performance. One of the most important of these is RevPAR - which stands for revenue per available room.

In addition, hoteliers can also use GOPPAR (or Gross Operating Profit per Available Room) to calculate room profitability. Often, hotel businesses can find it challenging to keep up with these metrics and identify key revenue and profitability trends over time. However, using SQL and data visualization together can be quite an effective way of analysing these metrics.

heatmap-ltv

When a business sells to customers, the reality is that a certain percentage of customers will cease buying from that business over a given period. This is what is known as churn.

The total revenue that a business can expect from a customer before they churn is known as customer lifetime value. Revenue remaining constant, the longer a customer keeps buying from a company - the higher their customer lifetime revenue (or LTV) will be.

Technical Skills

Cloud: AWS, Azure

Languages: Python, R, SQL

Machine learning libraries: InterpretML, PyMC3, scikit-learn, statsmodels, TensorFlow

Platforms and relevant tools: PyCharm, Jupyter Notebook, pgAdmin4, RStudio, Git, Docker, Linux

Visualization libraries: ggplot2, matplotlib, seaborn

Training Courses

Time Series Forecasting with Bayesian Modeling. LiveProject series produced for Manning Publications.

TensorFlow 2.0 Essentials: What’s New. Video seminar produced for O’Reilly Media.

Business Analytics with R — Statistics and Machine Learning. Video series produced for O’Reilly Media.

Talks