Guilherme D. Garcia

Typst is a programming language designed for typesetting. There’s a great tutorial here and an introductory series on YouTube here. To migrate from \(\LaTeX\) in 2025, I had to spend some time playing around with the language to see if I’d be able to move all of my workflow (slides, articles, CV, etc.). I quickly discovered that it could do everything I do in \(\LaTeX\) and, crucially, much more (see here). That’s how the idea for this package was born: it is a collection of functions I often use in my teaching/research in phonology.

Last updated: February 2026

Manual

There’s a comprehensive manual for the package in PDF format here. The file constains numerous examples for all functions in phonokit.

Main features

IPA Module

Unlike \(\LaTeX\), Typst offers out-of-the box support for Unicode characters such as phonetic symbols. While this is great, I am already used to tipa in \(\LaTeX\), so my first goal was to have a function that emulated tipa as much as possible: it would be familiar, practical and quick. This is what the function #ipa() does. There are only minor differences between \textipa{} and #ipa(). You can also access a PDF version of the reference sheet here.

tipa-style input: use familiar \(\LaTeX\) tipa notation instead of hunting for Unicode symbols
Comprehensive symbol support: most IPA consonants, vowels, and other symbols from the tipa chart
Combining diacritics: Nasalized (\\~), devoiced (\\r), syllabic (\\v); the tie (\\t) is also available; primary stress ('), secondary stress (,), length (:)
Automatic character splitting: Type SE instead of S E for efficiency (spacing is necessary around characters using backslashes)
Charis SIL font is used for all transcriptions (https://software.sil.org/charis/download/), but you can choose your own font as well (see manual)

Reference sheet

You can find the complete reference sheet in the appendix of the manual, but you can also download the sheet as a standalone file here.

IPA Charts Module

I have used the great vowel package multiple times in \(\LaTeX\), but I don’t love its interface — see example in my \(\LaTeX\) tutorial for phonologists here. The #vowels() function in phonokit is simpler: it takes a string of vowels and plot them onto a vowel trapezoid. The trapezoid in Figure 1 was created with #vowels("english"), but the function takes a string of vowels, so you can customize your trapezoid as needed. The same applies to the function #consonants().

A similar function also exists for consonants: #consonants(). It returns an IPA table of pulmonic consonants given an input (string). For both #vowels() and #consonants(), you also have the option of using a language as an input (see list of available languages below).

Vowel charts: plot vowels on the IPA vowel trapezoid with accurate positioning
Consonant tables: display consonants in the pulmonic IPA consonant table
Language inventories: pre-defined inventories for some languages (English, Spanish, French, German, Italian, Portuguese, Japanese, Russian, Arabic)
Custom symbol sets: plot any combination of IPA symbols
Automatic positioning: symbols positioned according to phonetic properties (place, manner, voicing, frontness, height, roundedness)
Proper IPA formatting: voiceless/voiced pairs, grayed-out impossible articulations, minimal pair bullets for vowels
Scalable charts: adjust size to fit your document layout (scaling includes text as expected)

Figure 1: Vowel trapezoid for (Standard American) English.

As of version 0.4.5, you can also add arrows, shifted vowels and highlights to trapezoids. This makes the function #vowels() much more flexible. Figure 2 demonstrates how the arguments arrows and highlight can be used to display diphthongs in North American English. Code block 1 also shows how you can adjust the color of arrows and highlights independently. Finally, arrow lines can be dashed (arrow-style) and straight (curved: false).

Figure 2: Arrows can be easily added to trapezoids.

#vowels(
   "english",
   arrows: (
     ("a", "U"),
     ("a", "I"),
     ("e", "I"),
     ("O", "I"),
   ),
   arrow-color: blue.lighten(60%),
   curved: true,
   highlight: ("a", "e", "o", "O"),
   highlight-color: blue.lighten(80%),
 )
)

Code block 1: Code used to create a trapezoid with arrows for diphthongs.

Prosody Module

The functions #syllable(), #foot() and #word() help you create prosodic representations from strings. They adjust sizing automatically, but you can also use the scale argument.

Prosodic structure visualization: draw syllable structures with onset, nucleus, and coda
Flexible foot structure: use parentheses to mark explicit foot boundaries and stress mark to identify headedness (iambs, trochees)
Stress marking: mark stressed syllables with apostrophe '
Flexible alignment: left or right alignment for prosodic word heads

Figure 3: A geminate consonant in a traditional moraic representation.

Figure 3 is the result of #foot-mora("'pot.ta", coda: true), where coda: true indicates that codas project a mora. The function detects identical coda-onset sequences, so “pot.ta” triggers the representation for a geminate.

To create the representation for prosodic words, the function #word() is used. Figure 4 shows a simple PWd generated with the code #word("('po.Ra).('ma.pa)", foot: "R"), where foot: "R" indicates which foot is the main foot in the PWd (when more than on foot is present). Notice that feet are detected based on the use of parentheses in the input. Stress marks ' are used to determine foot headedness. All functions accept the same input used in the #ipa() function, which means that phonetic symbols are automatically detected.

Figure 4: A prosodic word assuming onset-rhyme representations for syllables.

All functions involving prosodic representations also have a scale argument. This is important because we often need to adjust the dize of a representation but the text itself may not scale appropriately (line width can also be tricky in these scenarios). The argument in question takes care of everything.

Autosegmental phonology module

Starting with version 0.3.0, phonokit also offers a function to create autosegmental representations, including features and tones. The example in Figure 5 is from Zsiga (2024). Read the manual to learn more about the function. In a nutshell, you can represent linking, delinking, floating tones, and highlighted tones. Thus, the most common processes involving features and tones are easy to represent with the #autoseg() function.

#autoseg(
  ("e", "b", "e"),
  features: ("L", "", "H"),
  spacing: 0.5,
  tone: true,
  gloss: [èbě],
)
#a-r // arrow
#autoseg(
  ("e", "b", "e"),
  features: ("L", "", "H"),
  links: ((0, 2),),
  spacing: 0.5,
  tone: true,
  gloss: [_pumpkin_],
)

Code block 2: Code used to generate an autosegmental representation.

#autoseg(
  ("p", "a", "S", "E", "i"),
  features: ("", "L", "", "H", "L"),
  tone: true,
  spacing: 0.5, // keep consistent
  delinks: ((1, 1),),
  float: (4,),
  baseline: 37%,
  gloss: [_delinking & floating tone_],
)
#autoseg(
  ("Z", "W", "p", "K", "u"),
  features: ("", "H", "", "", ""), // H at position 1, but will be repositioned
  tone: true,
  float: (1,), // Mark H as floating so it doesn't draw vertical stem
  multilinks: ((1, (1, 4)),), // H links to segments at positions 1 and 4
  spacing: 0.5,
  baseline: 37%,
  arrow: false,
  gloss: [_one-to-many relationships_],
)

#autoseg(
  ("m", "@", "a"),
  features: ("", "", ("H", "L")),
  links: (((2, 0), 1),), // From H (first tone at position 2) to segment 1
  tone: true,
  highlight: ((2, 0),), // Highlight the H tone
  baseline: 37%,
  spacing: 1.0,
  arrow: true,
  gloss: [_contour tone, linking & highlighting_],
)
#autoseg(
  ("m", "@", "a"),
  features: ("", "", ("H", "L")),
  links: (((2, 0), 1),), // From H (first tone at position 2) to segment 1
  tone: true,
  highlight: ((2, 0),), // Highlight the H tone
  baseline: 37%,
  spacing: 0.35,
  arrow: true,
  gloss: [_quick spacing adjustments_],
)

Code block 3: Code used to generate different scenarios

Multi-tier representations

Certain phonological structures have too many degrees of freedom for a single-purpose function to be enough. This is why the function #multi-tier() exists (introduced in version 0.4.0): it gives you the freedom to create a wide range of non-linear structures. It is much more flexible, but that comes with more complicated syntax and more arguments, by definition. Figure 7, adapted from Booij (2012), illustrates a scenario where we need more flexibility in a function to generate a multi-tier representation.

The function #multi-tier() automates part of the work. First, it’s based on a grid with fixed (but flexible) coordinates. Second, the function projects lines and links automatically (one per element added to its levels argument). The remaining links can be added with the links argument, and users can delete automatic links too, of course.

Figure 8: A grid helps the user visualize positions

One of the several arguments in #multi-tier() is show-grid, which prints a grid to help the user locate positions for elements and links — this is shown in Figure 8. If you inspect Code block 4, you will notice that the function automatically strings that are meant to be represented with Greek letters.

      #multi-tier(
        show-grid: true, // <- To help you see the grid
        levels: (
          ("", "", "", "", ("Adj", 3.5)),
          ("", "", "", "", ("Adj", 3.5)),
          ("", ("Af", 0.5), "", ("N", 2.5), "Af"),
          ("in", "ter", "na", "tion", "al"),
          ("sigma", "sigma", "sigma", "sigma", "sigma"),
          ("Sigma", "", "Sigma", "", ""),
          ("", "", "omega", "", ""),
        ),
        links: (
          ((0, 4), (2, 1)), // Adj -> Af
          ((1, 4), (2, 3)), // Adj -> N
          ((2, 1), (3, 0)), // Af -> in
          ((2, 3), (3, 2)), // N -> na
          ((5, 0), (4, 1)), // Ft -> Syl
          ((5, 2), (4, 3)), // Ft -> Syl
          ((6, 2), (5, 0)), // PWd -> Ft
          ((6, 2), (4, 4)), // PWd -> Ft
        ),
      )

Code block 4: Code used to generate the multi-tier representation above

Finally, Figure 9, adapted from Goad (2012), shows another example of how #multi-tier() can be used to generate a wide range of representations that require a higher degree of customization. You will find the code for the figure in the manual of the extension linked at the top of this page.

Constraint grammar module

The package includes a function to generate OT tableaux (see Figure 10), but it goes one step further and produces a MaxEnt tableau (Goldwater and Johnson 2003; Hayes and Wilson 2008) with the function #maxent(). Figure 11 illustrates a scenario where all candidates have a non-zero probability of being observed given a specific input \(x\). The column \(H(y)\) displays the Harmony score of each candidate \(y\), calculated as the weighted sum of all constraint violations. Next, the column \(e^{-H(y)}\) provides the unnormalized probability, which is the exponential of the negated Harmony score (this has also been called the MaxEnt score). Finally, the actual predicted probability is shown in column \(P(y|x)\), which is obtained by dividing the unnormalized value of a candidate by \(Z(x)\) (the sum of all unnormalized scores).

#tableau(
        input: "kraTa",
        candidates: ("kra.Ta", "ka.Ta", "ka.ra.Ta"),
        constraints: ("Max", "Dep", "*Complex"),
        violations: (
          ("", "", "*"),
          ("*!", "", ""),
          ("", "*!", ""),
        ),
        winner: 0, // <- Position of winning cand
        dashed-lines: (1,), // <- Note the comma
        shade: true, // <- true by default
      )

Code block 5: Code used to generate an OT tableau.

The function #maxent() calculates \(h_i\), \(e^{-h_i}\) and \(P_i\) automatically given the weights provided. Figure 11 lists the weights for the constraints in use at the top and prints probability bars at the right margin. These can be turned off with visualize: false (see Code block 6), but they are printed by default as this can help students quickly visualize probabilities when many candidates are evaluated. The function can also sort candidates by probability (sort: true) — see tableau at the bottom of Figure 11.

Figure 11: MaxEnt tableau with automatic calculation, optional visualization and sorting.

#maxent(
  input: "kraTa",
  candidates: ("[kra.Ta]", "[ka.Ta]", "[ka.ra.Ta]"),
  constraints: ("Max", "Dep", "*Complex"),
  weights: (2.5, 1.8, 1),
  violations: (
    (0, 0, 1),
    (1, 0, 0),
    (0, 1, 0),
  ),
  visualize: true,  // Show probability bars (default)
  // sort: true,       // Sort candidates by P
)

Code block 6: Code used to generate a MaxEnt tableau.

It is often useful to present a ranking using a Hasse diagram. These diagrams can be generated in phonokit using the #hasse() function. In a nutshell, the function takes tuples with \(n\) elements. In the simplest case, \(n = 1\), which produces a floating constraint. The example in Figure 12 shows a basic scenario The third element in the first tuple indicates the “stratum” in the diagram — this is especially important in more complex cases, which require better control over the vertical position of different constraints. Optional arguments exist to give the user more flexibility (e.g., scale and node-spacing).

#hasse(
        (
          ("*Complex", "Max", 0),
          ("*Complex", "Dep", 0),
          ("Onset", "Max", 0),
          ("Onset", "Dep", 0),
          ("Max", "NoCoda", 1),
          ("Dep", "Constraint[Feat]", 1, "dotted"),
        ),
        node-spacing: 3,
      )

Code block 7: Code used to generate a Hasse diagram.

Package Repository

You can download/fork the most up-to-date version of the package in its GitHub repository.

References

Booij, Geert. 2012. The Grammar of Words: An Introduction to Linguistic Morphology. 3rd ed. Oxford University Press.

Goad, Heather. 2012. “SC Clusters Are (Almost Always) Coda-Initial.” Linguistic Review 29 (3).

Goldwater, Sharon, and Mark Johnson. 2003. “Learning OT Constraint Rankings Using a Maximum Entropy Model.” In Proceedings of the Stockholm Workshop on Variation Within Optimality Theory, 111–20.

Hayes, Bruce, and Colin Wilson. 2008. “A Maximum Entropy Model of Phonotactics and Phonotactic Learning.” Linguistic Inquiry 39 (3): 379–440. https://doi.org/10.1162/ling.2008.39.3.379.

Zsiga, Elizabeth C. 2024. The Sounds of Language: An Introduction to Phonetics and Phonology. Chichester, UK: John Wiley & Sons.