May 30, 2026

MEPS Tutorial 8: Estimating slopes from a regression model using R

May 30, 2026

In a previous tutorial, I reviewed how we can perform trend analysis using R on survey-weighted estimates. However, I neglected to discuss how to estimate the average slope across time. Rather, I focused on estimate the predicted values at each year.

In this tutorial, I show how you can use the margins command in R to estimate the survey-weighted average total healthcare expenditures across years for males and females. You can read the tutorial on my RPubs page (link).

Mark Bounthavong

March 31, 2026

Econometrics, MEPS, R Programming, Statistics & Probability

Two-Part Model with Bootstrap using R

Mark Bounthavong

March 31, 2026

Econometrics, MEPS, R Programming, Statistics & Probability

In this article, I wanted to expand on a previous post that describes using a two-part model to model cost (or total expenditure) as an outcome with data from the Agency for Healthcare Research and Quality (AHRQ) Medical Expenditure Panel Survey (MEPS). In the previous article, I used the twopartm package, which is great at leveraging the two-part model approach. However, it does not appear to handle data from complex survey designs like MEPS.

The best way to handle complex survey design data with weights using the two-part model approach is to perform the estimations for each part separately and then combine them.

With a little help from some AI chatbots, I was able to construct a viable code that not only estimates and combines both parts of the two-part model, but also allows me to bootstrap the results to generate 95% confidence intervals (CI).

The complete article on how to construct a two-part model with bootstrap using R is available on my RPubs site (link)

Mark Bounthavong

January 30, 2026

Literary Cafe, MEPS, R Programming

Literary Cafe series: Patterns and costs of GLP1-RA (Part 1) - Getting data from MEPS

Mark Bounthavong

January 30, 2026

Literary Cafe, MEPS, R Programming

In this Literary Cafe series, I attempt to reproduce the findings from Wu and colleagues’ paper, “Patterns and costs associated with glucagon-like peptide-1 receptor agonist use in US adults with type 2 diabetes“ (link).

In this first part (with subsequent parts to follow), I demonstrate how we can use the same publicly available data from the Agency of Healthcare Research and Quality (AHRQ) Medical Expenditure Panel Survey (MEPS) to reproduce the sample used by Wu and colleagues in their study (link).

I published this article in my RPubs page (link).

Mark Bounthavong

December 31, 2025

Epidemiology, Literary Cafe, Methods, R Programming, Statistics & Probability

Literary Cafe series: Policy analysis (Part 2) - Interrupted Times Series Analysis with publicly available data

Mark Bounthavong

December 31, 2025

Epidemiology, Literary Cafe, Methods, R Programming, Statistics & Probability

I’m back with some Literary Cafe series updates.

I have regularly informal discussions with my students about interesting papers in the biomedical sciences. Recently, we discussed a great paper by Jurecka and colleagues on the impact of a state-wide law to change the definition of fentanyl possession on opioid-related overdose death rates.

Jurecka and colleagues used publicly available data to perform their research, and I wanted to show my students how this was done using CDC WONDER data. Hence, I started this Literary Care series to document these exercises for others to learn from.

Last month, I wrote an article on how to get data from the CDC WONDER site, which you can read here. I considered this Part 1 (Getting the data).

This is the second part of a two-part series that illustrates how to use publicly available data to replicate the findings from a published study. In Part 2, I use the data from Part 1 to analyze the impact of the statwide fentanyl possession law on opioid-related overdose death rates using an interrupted time series analysis. I posted this on my RPubs site (link) along with part 1 (link).

Mark Bounthavong

September 15, 2025

R Programming

Loading data into the R environment

Mark Bounthavong

September 15, 2025

R Programming

I wrote a short exercise on how to load *.csv and *.xlsx files into the R environment, which I posted on my RPubs site (link)

Mark Bounthavong

August 30, 2025

R Programming, Statistics & Probability

R - Tips and Tricks (Guide) - Part 2

Mark Bounthavong

August 30, 2025

R Programming, Statistics & Probability

I wrote a second R guide to help students navigate and use R and RStudio in their biostatistics course. I focused on creating vectors, matrices, and dataframes.

The guide can be found on my RPubs site.

Mark Bounthavong

July 29, 2025

Methods, R Programming, Statistics & Probability

Ratio of risk ratios in R

Mark Bounthavong

July 29, 2025

Methods, R Programming, Statistics & Probability

I ran into a problem where I had two risk ratios, but I wanted to evaluate the statistical difference between them. I couldn’t find an R package, but I found a paper by Altman and Bland that go over the step-by-step process. I wrote a tutorial on how to perform this method using R, which is available on my RPubs page (link).

Reference:

Altman DG, Bland JM. Interaction revisited: the difference between two estimates. BMJ. 2003 Jan 25;326(7382):219. doi: 10.1136/bmj.326.7382.219. PMID: 12543843; PMCID: PMC1125071.

Mark Bounthavong

May 26, 2025

R Programming, Statistics & Probability

Transform data from wide to long format using R

Mark Bounthavong

May 26, 2025

R Programming, Statistics & Probability

Often, when we input data into a spreadsheet, we use the wide format where the sequence of variables are ordered according to the columns. But when we perform longitudinal analyses, we need to transform this to the long format.

Sometimes, I forget how to do this in R, so I decided to write a tutorial to remind myself how to do this.

Therefore, I wrote a tutorial on using the pivot_longer() function to transform data from the wide to long format in preparation for longitudinal data analysis. The tutorial is located on my RPubs page.

Mark Bounthavong

April 30, 2025

R Programming, Statistics & Probability

Generate data using the simstudy package in R

Mark Bounthavong

April 30, 2025

R Programming, Statistics & Probability

There are times when you are looking for a dataset to test a code or formula, but they are hard to find or are not publicly available. To get around this problem, we can generate our own data. R provides several tools for us to accomplish this.

I wrote a short guide on how to generate data using the simstudy package in R. You can read how to do this on my Rpub site (link).

Mark Bounthavong

March 30, 2025

Epidemiology, Methods, R Programming

Medication adherence estimations using R - Part 1

Mark Bounthavong

March 30, 2025

Epidemiology, Methods, R Programming

I created a tutorial on how to use the AdhereR package in R to estimate the medication adherence rate for a sample of individuals with prescription claims data. I posted the tutorial on my RPubs page (link).

The two most common medication adherence meaures are the Medication Possession Ratio (MPR) and the Proportion of Days Covered (PDC). This tutorial reviews how to estimate these medication adherence rates using AdhereR in R.

MEPS Tutorial 8: Estimating slopes from a regression model using R

Two-Part Model with Bootstrap using R

Literary Cafe series: Patterns and costs of GLP1-RA (Part 1) - Getting data from MEPS

Loading data into the R environment

R - Tips and Tricks (Guide) - Part 2

Ratio of risk ratios in R

Transform data from wide to long format using R

Generate data using the simstudy package in R

Medication adherence estimations using R - Part 1

Categories

Use the search tool to find a specific blog

Previous blogs