ML (Machine Learning) at Georgia Tech

PhD Defense | Improving the Robustness of Natural Language Processing to Dialects and Language Variants

Title: Improving the Robustness of Natural Language Processing to Dialects and Language Variants

Date: 11/19/2025

Time: 12-2PM EST (9-11AM PST)

Location: https://gatech.zoom.us/j/4263320954?pwd=MGtPdUhKd0RIYWdqNzU4VW5RSk5zdz09 (with a small in person presence at Stanford University Gates 415)

William Held

Machine Learning PhD Student

School of Interactive Computing in the College of Computing

Georgia Institute of Technology

Committee

1 Diyi Yang

2 Mark Riedl

3 Larry Heck

4 Zsolt Kira

5 Percy Liang

Abstract: English — as a global language spoken by billions across continents — is rich with variation. Despite the number of speakers of other variants and dialects, most language technologies primarily serve Standard American English speakers, creating systematic barriers for other dialect communities. My research establishes empirical evidence for these disparities through novel controlled experiments and user experience studies spanning multiple English varieties. Building on these findings, I have developed computationally efficient adaptation techniques that enhance dialect robustness without requiring task-specific annotations. Finally, I have examined how dialect performance evolves as models scale, using scaling laws to assess whether increased compute alone can close dialect gaps or if targeted interventions remain necessary. These contributions advance both the theoretical understanding of language variation as a dimension of NLP performance and provide practical machine learning methods for building language technologies that serve English in all its forms.

Date/Time:

Wednesday, November 19, 2025
12:00 pm - 2:00 pm

Location: https://gatech.zoom.us/j/4263320954?pwd=MGtPdUhKd0RIYWdqNzU4VW5RSk5zdz09

URL: https://gatech.zoom.us/j/4263320954?pwd=MGtPdUhKd0RIYWdqNzU4VW5RSk5zdz09

Georgia Institute of Technology

ML@GT

Search form

PhD Defense | Improving the Robustness of Natural Language Processing to Dialects and Language Variants

Event Details

Georgia Tech Resources

Visitor Resources