"The notion that the language of one (Cummings) is considered legitimate due to its production by an autonomous will or intention into language and its subversions, and the other (Wernicke’s patient) not so because it is an effect of illness and therefore an unintended accident, reproduces a system of value contingent on the Western subject of colonialism-capitalism. The division of poetic versus aphasic (read: not-poetic) speech is layered with the racism, heteropatriarchy, and ableism of a colonial order that also continues to delegitimate other syntactical and linguistic deviations found in linguistic practices."
"...links concept labels from different conceptlists to concept sets. Each concept set is given a unique identifier, a unique label, and a human-readable definition. Concept sets are further structured by defining different relations between the concepts..."
'None of this is predicated on “trying not to misgender someone” or even “trying not to mess up pronouns accidentally and get yelled at.” Linguistic care work, like any care work truly based in principles of a loving community, cannot run on shame-based fuel. Avoiding shame and harm are only the barest, most basic bar to clear—they do not constitute showing affection. Failing to abuse someone isn’t the same as loving them.'
incredibly insightful review. "Their exact words, not just their paraphraseable meaning but their precise choices of phrasing, become full of comprehensible information about character, and this gives the characters themselves an unusual reality and presence. As in all good poetry, it is the language itself, and not just the plot and worldbuilding, that makes us care."
"a collaborative effort to improve how NLP handles complex morphology in the world’s languages. The goal of UniMorph is to annotate morphological data in a universal schema that allows an inflected word from any language to be defined by its lexical meaning, typically carried by the lemma, and by a rendering of its inflectional form in terms of a bundle of morphological features from our schema."
"a large set of speech samples from a variety of language backgrounds. Native and non-native speakers of English read the same paragraph and are carefully transcribed" (close IPA transcriptions)
"a challenge set for evaluating what language models (LMs) know about major grammatical phenomena in English" it warms my heart to see an ngram baseline in there, haha
well this looks fascinating. “How to do things with nonwords: communication, expression, and meaning” “Musical gestures in the typology of linguistic inferences” “Iconic modulation in spoken language: iconicity, intensification, or both?” etc