A talk on computer-assisted fieldwork
Building a Southern Kurdish dialect tree with PARSE.
Lucas Ardelean · University of Bamberg · University of Zurich · 2026
The classification puzzle
Why a "press the button" AI does not work here
A guiding principle
The machine narrows the search.
The linguist makes the judgment.
Locates likely regions in long recordings. Offers a candidate transcription.
Every accept, edit, or rejection is an explicit human action.
Models write candidates, not commitments. Acceptance is an explicit human action.
Computed result and human correction stored as separate layers. Auditable forever.
Where AI sits in the pipeline
Three models transcribe 2–5 h of audio per speaker — orthographic, Kurdish-script, and phoneme-level IPA.
AILinguist checks quality, picks the best repetition, splits multi-word responses, re-runs noisy outputs.
HumanUse verified anchors plus the transcript layers to find the right 85 target words in 530+.
AILexStat clusters forms using Levenshtein distance + sound correspondences learned from the data.
ComputationalJob 1 · Reading the audio
Job 2 · Review and decide
Job 3 · Anchoring
Job 4 · Adjudicating
PARSE — keeping the human in charge
Original audio never cut · Algorithm and human stored separately · LingPy + NEXUS export
Beyond the four jobs
Step 4, in plain English
Like a confidence interval —
but for trees.
Pick one "best" tree and report it as the answer.
Sample a probability-weighted ensemble of plausible trees.
"Kalhori–Khanaqini together: 87% of sampled trees." A probability, not a yes/no.
When the data is ambiguous, the method says so. Low support is information, not failure.
Why this method, for this data
From the engine, visually
Numbers like 0.9999, 0.5675 = posterior probability — the share of sampled trees containing that grouping.
What this delivers — and what it doesn't claim
For language assessment
The rater's judgment is the result.
Audit trails matter more than raw accuracy.
The same pattern — scout, second opinion, human-owned decision, full provenance — applies wherever expert judgment doesn't scale. Proficiency rating, error coding, dictation, oral-task transcription. The tool may not transfer; the discipline does.
PARSE — github.com/ArdeleanLucas/PARSE · MIT licensed
Thank you. Questions welcome.