Guilherme D. Garcia – Intro to Bayesian data analysis in R

Sample of 5 rows from the data
Subject	Word	Affix	LogRT	RT
2s20	vE6ltede	ede	6.538169	691.02
2s16	ruinere	ere	6.664702	784.23
2s16	dyrkede	ede	6.370569	584.39
2s01	vandrende	ende	6.566841	711.12
2s18	lyttede	ede	6.485597	655.63

Extracting samples

Code

#
post1_raw = as_draws_df(fit1a) |>
  as_tibble()

post1_raw = post1_raw |>                    
  select(contains("b_")) |>
  rename("bar*" = b_Intercept,
         "ede" = b_Affixede,
         "ere" = b_Affixere,
         "ende" = b_Affixende,
         "lig" = b_Affixlig)

post1 = post1_raw |>
  pivot_longer(names_to = "Parameter",
               values_to = "Estimate",
               cols = `bar*`:lig)

post1_summary = post1 |>                    
  group_by(Parameter) |>
  mean_qi(Estimate) |>
  ungroup()

ROPE1 = rope(fit1a) |>
  as_tibble()

1: Extract samples and transform output into tibble
2: Select only a subset of columns of interest
3: Rename columns
4: Wide-to-long transformation
5: Create summary for each parameter (mean, etc.)
6: Calculate ROPE for model

Why do this?

More control over comparisons
More personalization in figures

Hover over numbers for annotations

Visualizing posterior

#
ggplot(data = post1 |> filter(Parameter != "bar*"), 
       aes(x = Estimate, y = Parameter)) + 
  annotate("rect", ymin = -Inf, ymax = +Inf,
           xmin = ROPE1$ROPE_low,
           xmax = ROPE1$ROPE_high,
           alpha = 0.3,
           fill = "gray90",
           color = "white") +
  stat_halfeye(fill = "#B6B7DB", .width = c(0.5, 0.95)) +
  theme_classic(base_family = "Futura") +
  coord_cartesian(xlim = c(-0.2, 0.2)) +
  labs(y = NULL,
       x = "Posterior distribution",
       title = "Effects relative to intercept (affix **bar**)") +
  theme(plot.title = ggtext::element_markdown()) +
  geom_label(data = post1_summary |> filter(Parameter != "bar*"),
             aes(x = Estimate, y = Parameter, label = Parameter),
             family = "Futura",
             # position = position_nudge(y = -0.25),
             label.padding = unit(0.4, "lines")) +
  theme(axis.text.y = element_blank(),
        axis.ticks.y = element_blank()) +
  geom_text(data = post1_summary |> 
              filter(Parameter != "bar*") |>  
              mutate(across(where(is_double), ~round(., 2))),
            aes(label = glue::glue("{Estimate} [{.lower}, {.upper}]"), x = Inf),
            hjust = "inward", vjust = -0.5, family = "Futura", color = "gray60") +
  geom_vline(xintercept = 0, linetype = "dashed", color = "black")

Multiple comparisons

Manually generate comparisons from posterior samples

post1_raw_mc = post1_raw |> 
  mutate(ede_ende = (`bar*` + ede) - (`bar*` + ende),
         ede_ere = (`bar*` + ede) - (`bar*` + ere),
         ede_lig = (`bar*` + ede) - (`bar*` + lig),
         ere_lig = (`bar*` + ere) - (`bar*` + lig),
         ere_ende = (`bar*` + ere) - (`bar*` + ende),
         ende_lig = (`bar*` + ende) - (`bar*` + lig)) |> 
  select(-`bar*`) |> 
  rename("bar_ede" = ede,
         "bar_ende" = ende,
         "bar_lig" = lig,
         "bar_ere" = ere)

# Now, wide-to-long transform:
post1_mc = post1_raw_mc |> 
  pivot_longer(names_to = "Comparison",
               values_to = "Estimate",
               cols = bar_ede:ende_lig)

# Create some point and interval summaries for our posteriors:
post1_summary_mc = post1_mc |> 
  group_by(Comparison) |> 
  mean_qi(Estimate) |> 
  ungroup()

Visualize multiple comparisons

ggplot(data = post1_mc, 
       aes(x = Estimate, y = fct_reorder(Comparison, Estimate))) + 
  annotate("rect", ymin = -Inf, ymax = +Inf,
           xmin = ROPE1$ROPE_low,
           xmax = ROPE1$ROPE_high,
           alpha = 0.3,
           fill = "gray90",
           color = "white") +
  geom_vline(xintercept = 0, linetype = "dashed", color = "black") +
  stat_halfeye(fill = "#B6B7DB", .width = c(0.5, 0.95)) +
  theme_classic(base_family = "Futura") +
  coord_cartesian(xlim = c(-0.2, 0.4)) +
  labs(y = NULL,
       x = "Posterior distribution for each comparison",
       title = "**Multiple comparisons** of posterior distributions",
       subtitle = "All but **two** comparisons display a credible statistical difference",
       caption = "Cf. ANOVA + TukeyHSD, where half of all comparisons had p > 0.05") +
  theme(plot.title = ggtext::element_markdown(),
        plot.subtitle = ggtext::element_markdown()) +
  geom_label(data = post1_summary_mc,
             aes(x = Estimate, y = Comparison, label = Comparison),
             family = "Futura", size = 3,
             # position = position_nudge(y = -0.3),
             label.padding = unit(0.4, "lines")) +
  theme(axis.text.y = element_blank(),
        axis.ticks.y = element_blank()) +
  geom_text(data = post1_summary_mc |> 
              mutate(across(where(is_double), ~round(., 2))),
            aes(label = glue::glue("{Estimate} [{.lower}, {.upper}]"), x = Inf),
            hjust = "inward", vjust = -0.5, family = "Futura", color = "gray60", size = 3)

A more realistic model?

We should now include random effects to our model
Thanks to brms, we can use the same familiar syntax from lme4

fit2 = brm(LogRT ~ Affix +
             (1 + Affix | Subject) +
             (1 | Word),
           data = dan,
           family = "Gaussian",
           cores = 4,
           save_model = "fit2.stan")

What happens to our posterior distributions now?
Let’s go straight to our multiple comparisons

Hierarchical model: multiple comparisons

Figure 2: Before, without random effects

Final considerations

Great

Bayesian models are much more customizable
No -values
Results are intuitive (e.g., CI) and comprehensive
We can easily incorporate our assumptions (priors)

Not so great

These models are more computationally intensive
They’re often unfamiliar (e.g., reviewers, priors)
If we run simple models without informed priors, similar results
Not having -values may be an issue to many

Materials & Readings

All the materials on OSF

Data and script at https://osf.io/h6czd/
Expansion: tutorial on adding random effects to our figure

Readings

McElreath (2020)
Kruschke (2015), Kruschke (2021) and Kruschke & Liddell (2018)
Gelman (2008) (on common objections to Bayesian statistics)
Garcia (2021) and Garcia (2023)

References

Bürkner, P.-C. (2018). Advanced Bayesian multilevel modeling with the R package brms. The R Journal, 10(1), 395–411. https://doi.org/10.32614/RJ-2018-017

Carpenter, B., Gelman, A., Hoffman, M., Lee, D., Goodrich, B., Betancourt, M., … Riddell, A. (2017). Stan: A probabilistic programming language. Journal of Statistical Software, Articles, 76(1), 1–32. https://doi.org/10.18637/jss.v076.i01

Garcia, G. D. (2021). Data visualization and analysis in second language research. New York NY: Routledge.

Garcia, G. D. (2023). Statistical modeling in L3/ln acquisition. In J. Cabrelli, A. Chaouch-Orozco, J. González Alonso, S. M. Pereira Soares, E. Puig-Mayenco, & J. Rothman (Eds.), The Cambridge Handbook of Third Language Acquisition (pp. 744–770). Cambridge: Cambridge University Press. https://doi.org/10.1017/9781108957823.030

Gelman, A. (2008). Objections to Bayesian statistics. Bayesian Analysis, 3(3), 445–449.

Kruschke, J. K. (2015). Doing bayesian data analysis: A tutorial with r, JAGS, and stan (2nd ed.). London: Academic Press.

Kruschke, J. K. (2021). Bayesian analysis reporting guidelines. Nature Human Behaviour, 5(10), 1282–1291.

Kruschke, J. K., & Liddell, T. M. (2018). Bayesian data analysis for newcomers. Psychonomic Bulletin & Review, 25(1), 155–177.

McElreath, R. (2020). Statistical rethinking: A Bayesian course with examples in R and Stan (2nd ed.). Boca Raton: Chapman & Hall/CRC.

Winther Balling, L., & Harald Baayen, R. (2008). Morphological effects in auditory word recognition: Evidence from Danish. Language and Cognitive Processes, 23(7-8), 1159–1190.

Appendix

A second test

Updating our posterior probability

Go back

	Log RT
Predictors	Estimates	CI	p
(Intercept)	6.80	6.77 – 6.82	<0.001
Affix [ede]	-0.05	-0.09 – -0.01	0.008
Affix [ende]	0.05	0.02 – 0.09	0.006
Affix [ere]	0.02	-0.02 – 0.06	0.354
Affix [lig]	-0.06	-0.10 – -0.02	0.001
Observations	1040
R² / R² adjusted	0.045 / 0.041