July 29, 2025

Methods, R Programming, Statistics & Probability

Ratio of risk ratios in R

July 29, 2025

Methods, R Programming, Statistics & Probability

I ran into a problem where I had two risk ratios, but I wanted to evaluate the statistical difference between them. I couldn’t find an R package, but I found a paper by Altman and Bland that go over the step-by-step process. I wrote a tutorial on how to perform this method using R, which is available on my RPubs page (link).

Reference:

Altman DG, Bland JM. Interaction revisited: the difference between two estimates. BMJ. 2003 Jan 25;326(7382):219. doi: 10.1136/bmj.326.7382.219. PMID: 12543843; PMCID: PMC1125071.

Mark Bounthavong

May 26, 2025

R Programming, Statistics & Probability

Transform data from wide to long format using R

Mark Bounthavong

May 26, 2025

R Programming, Statistics & Probability

Often, when we input data into a spreadsheet, we use the wide format where the sequence of variables are ordered according to the columns. But when we perform longitudinal analyses, we need to transform this to the long format.

Sometimes, I forget how to do this in R, so I decided to write a tutorial to remind myself how to do this.

Therefore, I wrote a tutorial on using the pivot_longer() function to transform data from the wide to long format in preparation for longitudinal data analysis. The tutorial is located on my RPubs page.

Mark Bounthavong

February 26, 2025

Econometrics, Epidemiology, MEPS, Methods, R Programming, Statistics & Probability

Propensity score matching in R

Mark Bounthavong

February 26, 2025

Econometrics, Epidemiology, MEPS, Methods, R Programming, Statistics & Probability

I wrote an introductory tutorial on how to perform propensity score matching using R, which has been posted on my RPubs site (link).

Propensity score matching is a statistical approach to balancing the observed covariates between groups. In observational studies, this method has the potential to mitigate potential confounding and allow us to make causal interpretations. However, there are a lot of approaches and nuances. This intorductory tutorial presents the basics of propensity score methods and how we can use these in our conventional analyses.

Mark Bounthavong

December 25, 2024

Econometrics, Epidemiology, Methods, R Programming, Statistics & Probability

Prepost analysis with continuous data using R - Part 1

Mark Bounthavong

December 25, 2024

Econometrics, Epidemiology, Methods, R Programming, Statistics & Probability

I wrote a tutorial on how to perform simple prepost analysis using R, which is available on my RPubs page. It covers how to compare two differences (change in value before and after an interention) using independent t test and linear regression approaches. However, it doesn’t cover how to address correlation between two dependent values. Part 2 of prepost analysis will cover those issues.

Mark Bounthavong

September 28, 2024

R Programming

Tips and Tricks (Guide) with R and RStudio

Mark Bounthavong

September 28, 2024

R Programming

I wrote a collection of tips and tricks (guide) for R and RStudio (link). This is a work in progress, and I plan to update this in the fiture.

Mark Bounthavong

July 28, 2024

Econometrics, R Programming, Statistics & Probability

Staggered difference-in-differences using R

Mark Bounthavong

July 28, 2024

Econometrics, R Programming, Statistics & Probability

I was interested in learning how to apply the Callaway & Sant'Anna staggered difference-in-differences framework to my work. After reading several papers and watching the video by Sant'Anna, I wrote a short tutorial on how to apply this framework to a simulated data. The tutorial is located on my RPubs site.

This is a unique method that used the R “did” package, which is based on the paper by Callaway & Sant’Anna.

Mark Bounthavong

June 23, 2024

Epidemiology, Econometrics, Methods, R Programming, Statistics & Probability

Mediation analysis using R

Mark Bounthavong

June 23, 2024

Epidemiology, Econometrics, Methods, R Programming, Statistics & Probability

It’s not uncommon to see covariates in a regression model that should not be there. For example, measurements that occur after the treatment assignment are included into a regression model as baseline covariates. Rather, one should consider a mediation analysis.

I wrote a tutorial on how to perform mediation analysis using R on my RPubs site (link).

I know that I make this mistake at times. This tutorial helped me to carefully consider which covariates to include in a regression model and which ones to consider for mediation analysis.

Mark Bounthavong

November 24, 2023

Econometrics, MEPS, Methods, R Programming, Statistics & Probability

MEPS tutorial on interrupted time series analysis in R

Mark Bounthavong

November 24, 2023

Econometrics, MEPS, Methods, R Programming, Statistics & Probability

I wrote a short tutorial on how to perform an interrupted time series analysis in R. I had a challenging time working on this because I wasn’t familiar with all the nuances of the ITSA. More importantly, I wasn’t able to leverage my Stata skills to do this in R. I’m used to the Stata margins command, which is great for creating constrasts. R has its own version of the margins command, but it lacks some of Stata’s features such as the pwcompare, which I use a lot in Stata. However, I found a workaround with linear splines, and I have uploaded this to my RPubs site (link). I hope you find this useful. I also saved my R Markdown code on my GitHub site (link).

Mark Bounthavong

November 19, 2023

Econometrics, MEPS, R Programming, Statistics & Probability, Epidemiology

MEPS tutorials on linkage files and trend analysis

Mark Bounthavong

November 19, 2023

Econometrics, MEPS, R Programming, Statistics & Probability, Epidemiology

I create two MEPS tutorials recently. One is on the use of condition-event linkage files to capture the disease-specific costs. I used migraine as a motivating example. In this tutorial, I go through the steps to identify migraine-related costs assocaited with office-based visits and inpatient night stays. In the second tutorial, I review how to perform simple trend analysis with linear regressio models. I pooled MEPS data from 2016 to 2021 and apply the approriate primary sampling units and strata from the pooled file.

The first tutorial is located on my RPubs page (MEPS Tutorial 4 - Using condition-event link (CLNK) file: A case study with migraine). The R Markdown code to create the tutorial is located in my GitHub repository (link).

The second tutorial is also located on my Rpubs page (MEPS Tutorial 5 - Simple Trend Analysis with Linear Models). The R Markdown code to create the tutorial is located in my GitHub repository (link).

Mark Bounthavong

September 30, 2023

R Programming, Statistics & Probability

Exact matching using R - MatchIt package

Mark Bounthavong

September 30, 2023

R Programming, Statistics & Probability

Recently, I was asked to help create a matching algorithm for a retrospective cohort study. The request was to perform an exact match on a single variable using a 2 to 1 ratio (unexposed to exposed). Normally, I would use a propensity score match (PSM) approach, but the data did not have enough variables for each unique subject. With PSM, I tend to build a logit (or probit) model using variables that would be theoretically associated with the treatment assignment. However, this approach requires enough observable variables to construct these PSM models. For this request, there were a few variables for each subjects; the only variable available were the unique identifier, site, and a continuous variable.

This problem led to a tutorial on how to perform an exact match using the MatchIt package in R, which can be viewed here in my RPubs page.

In this tutorial, you will learn how to perform an exact match with a single variable using a hypothetical dataset with 30 subjects.

Ratio of risk ratios in R

Transform data from wide to long format using R

Propensity score matching in R

Prepost analysis with continuous data using R - Part 1

Tips and Tricks (Guide) with R and RStudio

Staggered difference-in-differences using R

Mediation analysis using R

MEPS tutorial on interrupted time series analysis in R

MEPS tutorials on linkage files and trend analysis

Exact matching using R - MatchIt package

Categories

Use the search tool to find a specific blog

Previous blogs