July 29, 2025

Methods, R Programming, Statistics & Probability

Ratio of risk ratios in R

July 29, 2025

Methods, R Programming, Statistics & Probability

I ran into a problem where I had two risk ratios, but I wanted to evaluate the statistical difference between them. I couldn’t find an R package, but I found a paper by Altman and Bland that go over the step-by-step process. I wrote a tutorial on how to perform this method using R, which is available on my RPubs page (link).

Reference:

Altman DG, Bland JM. Interaction revisited: the difference between two estimates. BMJ. 2003 Jan 25;326(7382):219. doi: 10.1136/bmj.326.7382.219. PMID: 12543843; PMCID: PMC1125071.

Mark Bounthavong

May 26, 2025

R Programming, Statistics & Probability

Transform data from wide to long format using R

Mark Bounthavong

May 26, 2025

R Programming, Statistics & Probability

Often, when we input data into a spreadsheet, we use the wide format where the sequence of variables are ordered according to the columns. But when we perform longitudinal analyses, we need to transform this to the long format.

Sometimes, I forget how to do this in R, so I decided to write a tutorial to remind myself how to do this.

Therefore, I wrote a tutorial on using the pivot_longer() function to transform data from the wide to long format in preparation for longitudinal data analysis. The tutorial is located on my RPubs page.

Mark Bounthavong

April 30, 2025

R Programming, Statistics & Probability

Generate data using the simstudy package in R

Mark Bounthavong

April 30, 2025

R Programming, Statistics & Probability

There are times when you are looking for a dataset to test a code or formula, but they are hard to find or are not publicly available. To get around this problem, we can generate our own data. R provides several tools for us to accomplish this.

I wrote a short guide on how to generate data using the simstudy package in R. You can read how to do this on my Rpub site (link).

Mark Bounthavong

February 26, 2025

Econometrics, Epidemiology, MEPS, Methods, R Programming, Statistics & Probability

Propensity score matching in R

Mark Bounthavong

February 26, 2025

Econometrics, Epidemiology, MEPS, Methods, R Programming, Statistics & Probability

I wrote an introductory tutorial on how to perform propensity score matching using R, which has been posted on my RPubs site (link).

Propensity score matching is a statistical approach to balancing the observed covariates between groups. In observational studies, this method has the potential to mitigate potential confounding and allow us to make causal interpretations. However, there are a lot of approaches and nuances. This intorductory tutorial presents the basics of propensity score methods and how we can use these in our conventional analyses.

Mark Bounthavong

January 30, 2025

Data visualization, Econometrics, MEPS, Stata programming, Statistics & Probability

Stata - marginsplot & mplotoffset commands for plotting average marginal effects

Mark Bounthavong

January 30, 2025

Data visualization, Econometrics, MEPS, Stata programming, Statistics & Probability

In Stata, users have a lot of flexibility with creating plots, particularly after the margins command has been executed. Once a regression command has been run, users can estimate the average marginal effect of a factor with respect to another variable using the margins command in Stata. Once the average marginal effect has been estimated, users can plot this using the marginsplot or mplotoffset commands. These are power tools that allow us to visualize the average marginal effects, particularly when we have interaction terms.

I posted a tutorail on my RPubs site that revieweed some basic features of the marginsplot and mplotoffset commands and provide some practical examples of customization.

Mark Bounthavong

December 25, 2024

Econometrics, Epidemiology, Methods, R Programming, Statistics & Probability

Prepost analysis with continuous data using R - Part 1

Mark Bounthavong

December 25, 2024

Econometrics, Epidemiology, Methods, R Programming, Statistics & Probability

I wrote a tutorial on how to perform simple prepost analysis using R, which is available on my RPubs page. It covers how to compare two differences (change in value before and after an interention) using independent t test and linear regression approaches. However, it doesn’t cover how to address correlation between two dependent values. Part 2 of prepost analysis will cover those issues.

Mark Bounthavong

November 30, 2024

Econometrics, Methods, Statistics & Probability

Some cool website on study design and biostatistics

Mark Bounthavong

November 30, 2024

Econometrics, Methods, Statistics & Probability

This month (November 2024), I wanted to take a break from writing tutorial and articles. Instead, I wanted update myself on (and share) some very helpful/useful online resources.

A colleague of mine introduced me to website called Datamethods. It’s mainly a discussion forum, but it has some useful resources. This particular post contains references that are very useful for anyone who is interested in study design and biostatistics (link). It is a collection of papers and articles that addresses common myths and practices regarding the application of biostatistics in study designs.

Another great website is Scott Cunningham’s Mixed Taped Sessions. He has a book called Mixed Taped Session about causal inference, and he has regular workshops. I attended his Causal Inference Part 2 workshop, and it was amazing. We learned about the basic difference-in-differences methods (coding in R and Stata), and the innovations surrounding these methods (e.g., Callaway & Sant’Anna’s staggered difference-in-differences approach). Scott also provides the historical perspectives on these methods, which are insightful as they are entertaining. Moreover, he conducts interviews with prominant econometricians, which he posts on his YouTube channel.

Hopefully, these sites are useful for you as they have been for me.

Mark Bounthavong

October 28, 2024

Methods, Stata programming, Statistics & Probability

Linear spline (piecewise) models in Stata

Mark Bounthavong

October 28, 2024

Methods, Stata programming, Statistics & Probability

I wrote a tutorial on how to construct linear spline (also known as piecewise) models using Stata, which has been uploaded to my RPubs site.

Previously, I have developed tutorial on using the linear spline method for interrupted time series analsyis with Stata. However, I did not properly go over the mkspline commands.

In this tutorial, I review the mkspline command and the marginal option to generate coefficients that could be interpreted as the slope within each segment or the change in slope between segments, respectively.

Mark Bounthavong

July 28, 2024

Econometrics, R Programming, Statistics & Probability

Staggered difference-in-differences using R

Mark Bounthavong

July 28, 2024

Econometrics, R Programming, Statistics & Probability

I was interested in learning how to apply the Callaway & Sant'Anna staggered difference-in-differences framework to my work. After reading several papers and watching the video by Sant'Anna, I wrote a short tutorial on how to apply this framework to a simulated data. The tutorial is located on my RPubs site.

This is a unique method that used the R “did” package, which is based on the paper by Callaway & Sant’Anna.

Mark Bounthavong

June 23, 2024

Epidemiology, Econometrics, Methods, R Programming, Statistics & Probability

Mediation analysis using R

Mark Bounthavong

June 23, 2024

Epidemiology, Econometrics, Methods, R Programming, Statistics & Probability

It’s not uncommon to see covariates in a regression model that should not be there. For example, measurements that occur after the treatment assignment are included into a regression model as baseline covariates. Rather, one should consider a mediation analysis.

I wrote a tutorial on how to perform mediation analysis using R on my RPubs site (link).

I know that I make this mistake at times. This tutorial helped me to carefully consider which covariates to include in a regression model and which ones to consider for mediation analysis.

Ratio of risk ratios in R

Transform data from wide to long format using R

Generate data using the simstudy package in R

Propensity score matching in R

Stata - marginsplot & mplotoffset commands for plotting average marginal effects

Prepost analysis with continuous data using R - Part 1

Some cool website on study design and biostatistics

Linear spline (piecewise) models in Stata

Staggered difference-in-differences using R

Mediation analysis using R

Categories

Use the search tool to find a specific blog

Previous blogs