Allison's bookmarks (tagged text)

An interactive introduction to the terrific experience of rendering Arabic typography and its technical debt | La Vita Nouva

"An Arabic font is a small program. The text you store is its input, not its output. The word is performed fresh every time you look at it, like music from a score."

text typography poetics mol arabic

Saved 2026-07-10T18:01:49.038986Z

Love letter to catlangs - Ideas - Malleable Systems Forum

incredible thread on concatenative languages that goes to some unexpected places

programming language math linguistics text poetics

Saved 2026-04-14T18:44:04.714403Z

Unicode Visual Explorer

"A visual explorer for Unicode. Browse the character set, discover related glyphs, and learn more about the scripts, symbols, and shapes that make up the standard."

machinelearning text poetics mol unicode

Saved 2026-04-09T18:11:14.157626Z

❒ Tofu | CJK Han Character Comparison Tool

great tool for comparing how hanzi are rendered in different regions

text writing poetics mol unicode

Saved 2026-04-08T21:49:40.111922Z

Typotheque: A typeface in four dimensions

beautiful font, fun visualization

type typography interactivity visualization text poetics mol

Saved 2026-01-19T19:49:39.957603Z

ASCII characters are not pixels: a deep dive into ASCII rendering

really cool documentation of a project applying nearest-neighbor search to vectors representing spatial coverage of ascii characters for doing image to ascii rendering. great application of machine learning techniques, typography, experimental iteration

text typography poetics ascii art mol

Saved 2026-01-17T19:45:14.298916Z

Apple II graphics: More than you wanted to know

tired: "What would Jesus do?" wired: "How did Woz do this?"

retro graphics text poetics

Saved 2025-12-29T20:39:11.855084Z

Modal editing is a weird historical contingency we have through sheer happenstance • Buttondown

"I think the best explanation is that in a vacuum modal editing sounds like a bad idea. The mode is global state that users always have to know, which makes it dangerous. To use new modes well you have to memorize all of the keybindings, which makes it difficult." good post but i would point to things like DAWs and trackers as modal interfaces (there's a record mode and an edit mode)

text editor ui history programming

Saved 2025-10-22T19:01:47.790852Z

all text in nyc

"a search engine that finds text in New York City's Google Street View images. Search for any word or phrase to see where it appears across the city—in shop signs, graffiti, advertisements, and protest signs"

text poetics data datasets nyc geography

Saved 2025-09-01T19:25:45.877625Z

The Beautiful Dissociation of the Japanese Language - Aether Mug

"It's like reading in stereo, where sometimes the same message is conveyed to you in two different formats on separate channels, and sometimes two messages blend together as something new."

language linguistics japanese text poetics mol

Saved 2025-08-15T18:55:22.785704Z

Alternative Layout System

"This research rethinks paragraph formatting, inspired by techniques from Hebrew and Arabic manuscripts to challenge conventional layouts" with downloadable scripts for indesign. very cool

design typography text poetics manuscripts

Saved 2025-06-27T19:14:50.217113Z

AI copyediting: how Paperpal butchered my paper on AI-generated writing – Jill Walker Rettberg

"The irony here is that the paper that Paperpal butchered is about how generative AI normalises and homogenises our writing. Which is exactly what Paperpal was doing to my writing."

writing ai text poetics

Saved 2025-06-19T17:16:59.802373Z

🛠️ My Text Tools - Free, Online, Text Manipulation Tools

charming! hopefully it sticks around for a while and doesn't go all paywall on us

writing text poetics

Saved 2025-06-04T19:42:39.454748Z

Bare Metal Vi, boot into Vi without an OS! - Raymii.org

dream OS tbh

programming text

Saved 2025-01-04T22:18:29.586764Z

Yellow Silence: Miniature from the Silos Apocalypse (ca. 1100) — The Public Domain Review

"Here sonic absence is visualized, and it is yellow.... Auditory interruption gets transposed onto the textual plane, as the rectangle veils the ruled lines it floats above.... The effect becomes all the more palpable when we consider that the manuscript may have been read aloud."

text poetics mol silence medieval christianity

Saved 2024-11-15T20:21:43.119563Z

GRAIL Text Recognizer

very informative essay with good inline demonstrations

handwriting ai text writing interfaces interactivity

Saved 2024-10-20T00:08:56.539386Z

wordfreq/SUNSET.md at master · rspeer/wordfreq

"The field I know as 'natural language processing' is hard to find these days.... It's rare to see NLP research that doesn't have a dependency on closed data controlled by OpenAI and Google, two companies that I already despise. [...] [C]ollecting a whole lot of text in a lot of languages... used to be a pretty reasonable thing to do, and not the kind of thing someone would be likely to object to. Now, the text-slurping tools are mostly used for training generative AI, and people are quite rightly on the defensive. If someone is collecting all the text from your books, articles, Web site, or public posts, it's very likely because they are creating a plagiarism machine that will claim your words as its own." i feel this in my very bones

ai nlproc text

Saved 2024-09-18T14:46:21.798924Z

The Encyclopedia Project, or How to Know in the Age of AI - Public Books

"[W]hat is currently sold to us as “Artificial Intelligence”... is neither intelligent nor entirely artificial, yet it’s pumping the internet with automated content more quickly than you can fire an editorial office. No system predicated on these assumptions can hope to discern “misinformation” from “information”: both are reduced to equally weighted packets of content, merely seeking an optimization function in a free marketplace of ideas. And both are equally ingested into a great statistical machinery, which weighs only our inability to discern."

epistemology ai text language internet culture

Saved 2024-06-13T22:48:17.355961Z

Doing their hype for them • Buttondown

"The central claim of the tech companies selling LLMs is that any work that people do that results in text artifacts is just "text in-text out" and can therefore be replaced by their synthetic text-extruding machines. The best response to that claim is not "oh no, we can't keep up" but to take pride in one's work... and push back"

ai text poetics

Saved 2024-04-04T20:49:53.442054Z

Poetix – Post Position

Nick Montfort's poetics: "Writing very small-scale computational poems allows me to learn more about computing and its intersection with language and poetry. Not computing in the abstract, but computing as embodied in particular platforms, which are intentionally designed and have platform imaginaries and communities of use and practice surrounding them."

poetics poetry text language computation

Saved 2024-03-02T22:00:42.151663Z

Nomic Blog

"Open source, open data, open training code, fully reproducible and auditable text embedding model"

text machinelearning ai nlproc

Saved 2024-03-02T21:53:48.831044Z

Fax History

references, articles, publications on the history of fax machines

text telecommunications history technology

Saved 2024-03-02T21:26:55.314537Z

The "Menard-O-Tron" and Some Reflections on Creative Processes | Zach Whalen

I define "writing" as a heuristic or a way of making educated guesses, letter by letter, until a writing task is considered complete. Users can experience this with whichever file they choose to upload."

poetics text

Saved 2024-03-02T21:15:32.630217Z

New Words

"a speculative research project exploring the use of machine learning for the evolution of language. Large language models (LLM's) are fantastic at capturing our language as it currently is - but language is constantly evolving and adapting. Can machine learning help us create something truly new and unbounded by its training data?"

poetics machinelearning text language

Saved 2023-12-15T00:59:10.317819Z

dell-research-harvard/AmericanStories · Datasets at Hugging Face

"a collection of full article texts extracted from historical U.S. newspaper images [that] includes nearly 20 million scans from the public domain"

datasets corpora language text history

Saved 2023-09-13T18:51:55.137457Z

Artificial Intelligence - The Authors Guild

"We need to ensure that human creators are compensated, not just for the sake of the creators, but so our books and arts continue to reflect both our real and imagined experiences, open our minds, teach us new ways of thinking, and move us forward as a society, rather than rehash old ideas."

ai generative text policy labor

Saved 2023-07-20T20:10:38.966066Z

blackout engine

intuitive and fun blackout poetry interface, using snippets from project gutenberg

text poetry poetics interface interactivity

Saved 2023-07-20T16:55:30.347407Z

The LLMentalist Effect: how chat-based Large Language Models replicate the mechanisms of a psychic's con

"The chatbot’s answers sound extremely specific to the current context but are in fact statistically generic. The mathematical model behind the chatbot delivers a statistically plausible response to the question. The marks that find this convincing get pulled in." this is really good but i wish it approached the topic of psychics with a bit less bro-ey skepticism

divination text poetics ai conversation

Saved 2023-07-08T18:10:57.254256Z

Degenerative AI in education | code acts in education

"[E]ducational technology is overly dominated by psychological conceptions of individual learning... AI-based personalized learning systems [are] based on notions of mastery and... statistical measurement," reflecting an "assumption that human intelligence is an individual capacity, which can therefore be improved with technical solutions — like tutorbots — rather than something shaped by educational policies and institutions."

education pedagogy culture ai text

Saved 2023-07-06T17:53:25.130644Z

Laying Out a Print Book With CSS | Ian G McDowell's Blog

books design css text

Saved 2023-05-05T17:46:32Z

Priya22/project-dialogism-novel-corpus: The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.

(via data is plural): "every quotation from 22 novels, plus who speaks each line, who they’re addressing, the characters they mention, and more. With 35,000+ quotations, the corpus 'is by an order of magnitude the largest dataset of annotated quotations for literary texts in English.'"

data datasets corpora text

Saved 2023-02-01T20:16:10Z

Pixtura12 Medieval Pixel Font | OpenGameArt.org

"12px pixel art font that resembles textura/textualis writing from late medieval manuscripts and early printed books"

fonts type text pixelart

Saved 2022-11-11T16:55:47Z

Hysterically Real: ELDEN POEM by Daniel Scott Snelson PDF Paperback ...

"Snelson plays a wandering bard – misusing the system to produce the most unlikely of scrawls: small poems scattered across the game’s landscape. The book is a documentation of that performance in a prosody marked by the poetics of fandom. They are recorded here as movie captures, static images, and poetic texts, arranged in four parts spelling a newly coherent object."

videogames poetry poetics text landscape

Saved 2022-09-21T03:35:23Z

Atomic Activity Books

computer-generated books ("a non-human reading")

poetics generative text publishing

Saved 2022-05-22T14:01:40Z

Bjørn Karmann › Occlusion Grotesque

"... an experimental typeface that is carved into the bark of a tree. As the tree grows, it deforms the letters and outputs new design variations, that are captured annually"

design typography text mol

Saved 2022-05-13T22:22:25Z

Download the Atkinson Hyperlegible Font | Braille Institute

"Atkinson Hyperlegible font is named after Braille Institute founder, J. Robert Atkinson. What makes it different from traditional typography design is that it focuses on letterform distinction to increase character recognition, ultimately improving readability." looks good too imo

accessibility fonts typography text

Saved 2022-03-21T17:03:33Z

Some Georgian and Victorian Acrostic Puzzles: Precursors to Crosswords | MetaFilter

wordgames poetics language text

Saved 2022-03-07T17:02:56Z

Brendan Howell – Rustic Computing

generating from a markov model by hand

language languagemodels poetics text

Saved 2022-01-14T18:53:09Z

Alphabetical Order - 99% Invisible

"Alphabetical approaches upended accepted hierarchies"

text poetics ontology alphabet

Saved 2021-12-04T23:41:20Z

annagarbier/simple_dialogues: Simple dialogues converted into ridiculously detailed phonetic descriptions.

replaces dialogue with detailed phonetic descriptions of the dialogue. very cool

text phonetics poetics generative

Saved 2021-11-19T16:26:12Z

actionscoregenerator - Nathan Walker - Performance Artist

"a website that produces event scores for performance. The material objects, locations and activities within each score are based on the performance archives of Nathan Walker between 2009-2014 and work towards shuffling and redistributing the archival record to create an anarchive."

text language poetics generative fluxus

Saved 2021-11-19T16:20:45Z

NaNoGenMo Workshop

zach whalen's notes! very thoughtful!

text poetics novels nanogenmo generative

Saved 2021-10-21T18:55:48Z

The Book of Veles: How Jonas Bendiksen Hoodwinked the Photography Industry | Magnum Photos

'I started to ask myself the question – how long will it take before we start seeing “documentary photojournalism” that has no other basis in reality than the photographer’s fantasy and a powerful computer graphics card? Will we be able to tell the difference? How hard is it to do? How skilled will our own community of photographers and editors be in sniffing out what are deep fakes and what is real?'

photography syntheticmedia generative poetics text

Saved 2021-10-15T18:05:23Z

The Languages Which Almost Became CSS

"While these languages are obviously not in common use today, we find it fascinating to think about the world that might have been. Even more surprisingly, it happens that many of these other options include features which developers would love to see appear in CSS even today."

programming history layout design web text poetics

Saved 2021-09-21T22:04:06Z

Blabrecs

"BLABRECS is a rules modification for the wordgame SCRABBLE that swaps out the dictionary of real-if-obscure English words for a capricious artificial intelligence. In BLABRECS, real English words aren't allowed! Instead, you have to play nonsense words that sound like English to the AI. These nonsense words are called – you guessed it – BLABRECS."

machinelearning wordgames text poetics language games

Saved 2021-09-04T02:43:22Z

alphacep/vosk-api: Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

"Vosk is an offline open source speech recognition toolkit. [...] Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification." Bindings for various languages, "scales from small devices like Raspberry Pi or Android smartphone to big clusters."

speech nlproc text language poetics

Saved 2021-07-20T05:15:40Z

Alien Dreams: An Emerging Art Scene - ML@B Blog

"...the method here is quite different. DALL-E is trained end-to-end for the sole purpose of producing high quality images directly from language, whereas this CLIP method is more like a beautifully hacked together trick for using language to steer existing unconditional image generating models." good history of the emergence of CLIP art

machinelearning generative art text poetics

Saved 2021-07-09T22:13:31Z

The Sense of Neoism?! | Sofian Audry

"At the top of the machine, an LED panel endlessly regurgitates its own new neoist verses into the eyes of the audience, equally brainwashing humans, cyborgs, robots, and other technobiological systems. Anyone can directly hack into the system's artificial neural synapses by unplugging, replugging, and criss-crossing jack cables directly on the machine, thus deconstructing, reconstructing, and even destroying the generative capabilities of the system in real-time."

machinelearning text poetics generative glitch

Saved 2021-06-29T16:57:18Z

eyung/Singling: Java application for sonification of linguistic data.

"Application for the sonification of text which can be transformed according to various triggers and parameters to facilitate the learning and analysis of literacoustics, reading by listening."

text poetics sonification nlproc

Saved 2021-06-15T21:08:56Z

The Hater Box - ParseError

"Voluntarily provocative, The Hater Box transforms the principle of old split flap displays into a random generator of contestations, cold and impersonal." 2018 – Wood, motor, cardstock, print, 3D print

text poetics materialoflanguage pcomp

Saved 2021-05-31T21:13:23Z

Rainbow Zero by Spinfoam Games

"Rainbow Zero is a... toy? widget? thingy? that allows you to explore a part of the space defined by the GloVe word vectors."

games nlproc text poetics wordvectors

Saved 2021-05-28T16:56:45Z

Thoughts on DeepDaze, BigSleep, and Aleph2Image | Ryan Murdock’s Portfolio

some text to image stuff

machinelearning ekphrasis text poetics generative

Saved 2021-05-20T23:20:33Z

Scenescoop (Cristóbal Valenzuela)

similarity of images based on semantic similarity between automatic captions

machinelearning ekphrasis text poetics

Saved 2021-05-20T21:31:39Z

ryankiros/neural-storyteller: A recurrent neural network for generating little stories about images

"a recurrent neural network that generates little stories about images"

machinelearning ekphrasis text poetics

Saved 2021-05-20T21:29:29Z

TextOCR

"TextOCR provides ~1M high quality word annotations on TextVQA images allowing application of end-to-end reasoning on downstream tasks such as visual question answering or image captioning."

text ocr datasets language poetics machinelearning

Saved 2021-05-17T21:47:42Z

Machines à écrire - CD-ROM - GALLIMARD - Site Gallimard

okay I had no idea this existed: an interactive CD-ROM of various Oulipo texts

french literature poetics poetry text interactivity

Saved 2021-05-06T17:48:04Z

artnet

char-rnn trained on ansi artwork

ascii ansi poetics text machinelearning languagemodels

Saved 2021-04-22T19:33:28Z

syntax-tree/unist: Universal Syntax Tree used by @unifiedjs

common format for representing syntax trees of html, markdown, etc.

programming plaintext_markup text

Saved 2021-03-25T19:49:02Z

Download the C4 dataset! · Discussion #5056 · allenai/allennlp

allennlp's version of the c4 dataset

machinelearning text datasets poetics

Saved 2021-03-22T22:16:57Z

Applied Language Technology - YouTube

spacy 3 tutorials

nlproc python spacy text

Saved 2021-03-01T23:30:10Z

Onfim Wuz Here: On the Unlikely Art of a Medieval Russian Boy ‹ Literary Hub

"Icons are not primitive or rudimentary attempts to duplicate the physical world; they are nuanced and complex attempts to embody the spiritual world."

art literature poetry poetics text writing history religion

Saved 2021-02-17T22:06:33Z

Foldable Words | bit-player

"How many words can we form by making folds in the straw-paper slogan? I could not have answered that question in 1967. I couldn’t have even asked it. But times change. Enumerating all the foldable messages now strikes me as an obvious thing to do when presented with the straw wrapper. Furthermore, I have the computational means to do it—although the project was not quite as easy as I expected."

text poetics

Saved 2021-02-15T18:48:41Z

nipunsadvilkar/pySBD: 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.

"a rule-based sentence boundary detection that works out-of-the-box"

nlproc programming text

Saved 2021-01-16T18:42:36Z

Gendered Characterizations

"graphs the usage of words (whether in description or dialogue) over time, distinguishing that usage both by the gender of the fictional characters the terms are associated with, and by the gender of the authors who used them"

text dh poetics gender literature

Saved 2021-01-16T00:01:07Z

Alt-Text as Poetry

"Alt-text is an essential part of web accessibility. It is often overlooked or understood through the lens of compliance, as an unwelcome burden to be met with minimum effort. How can we instead approach alt-text thoughtfully and creatively?" (presented at wordhack dec 2020)

accessibility web text poetics poetry

Saved 2020-12-18T01:25:18Z

Shapecatcher: Draw the Unicode character you want!

unicode text poetics

Saved 2020-12-10T23:16:24Z

Karrik

"Karrik is an open source typeface designed by Jean-Baptiste Morizot and Lucas Le Bihan. This font was originally commissioned by ‘Cercle’ magazine for their 2020 issue—dedicated to the topic of ghosts. The design started in March 2019 and ended in October of the same year. [...] Karrik is rooted in vernacular typography. The weight disadjustments, the lack of optical corrections, the uneven width of the letters are some of the features of early sans serif typefaces that inspired us..." (hey plus it's open source!)

fonts typography text poetics

Saved 2020-12-04T20:27:10Z

Soulcraft Typeface on Behance

nice bold sans serif with variable features

fonts typography text

Saved 2020-12-04T20:06:01Z

Piazzolla Type System

lovely ofl serif with many features

typography fonts text

Saved 2020-12-04T17:26:13Z

UniMorph: Schema and datasets for universal morphological annotation

"a collaborative effort to improve how NLP handles complex morphology in the world’s languages. The goal of UniMorph is to annotate morphological data in a universal schema that allows an inflected word from any language to be defined by its lexical meaning, typically carried by the lemma, and by a rendering of its inflectional form in terms of a bundle of morphological features from our schema."

nlproc text programming poetics linguistics morphology

Saved 2020-12-01T00:16:02Z

AI Generated Pokemon Sprites with GPT-2

cool détournement of gpt2

generative text graphics pokemon

Saved 2020-11-17T15:41:59Z

Leveraging Machine Learning to Fuel New Discoveries with the arXiv Dataset | arXiv.org blog

text poetics datasets corpora

Saved 2020-08-19T13:45:05Z

Jurafsky & Martin chapter on constituency grammars

language linguistics syntax poetics text

Saved 2020-08-19T13:20:29Z

Scientists rename human genes to stop Microsoft Excel from misreading them as dates - The Verge

bglkjawbflablfhbawefjh

language text programming poetics biology

Saved 2020-08-07T19:15:31Z

EPIC-KITCHENS Dataset

"The extended largest dataset in first-person (egocentric) vision; multi-faceted, audio-visual, non-scripted recordings in native environments - i.e. the wearers' homes, capturing all daily activities in the kitchen over multiple days. Annotations are collected using a novel 'Pause-and-Talk' narration interface."

datasets cooking cv text corpus

Saved 2020-06-29T16:32:34Z

htrc/htrc-feature-reader: Tools for working with HTRC Feature Extraction files

python interface for the HTRC Extracted Features dataset

python programming text corpora datasets

Saved 2020-06-24T19:27:27Z

kchapelier/stochastemes: Second entry for PROCJAM 2015

generates (via markov chain) and speaks made-up words from various corpora

spelling poetics poetry generative text

Saved 2020-06-24T19:21:58Z

StereoSet

"StereoSet is a dataset that measures stereotype bias in language models. StereoSet consists of 17,000 sentences that measures model preferences across gender, race, religion, and profession."

machinelearning nlproc text poetics culture

Saved 2020-06-17T20:08:01Z

OPEN SCORES. How to Program the Commons. Exhibition catalogue – creating commons

"The exhibition OPEN SCORES brought together a series of practices through which artists articulate their specific forms of digital commons. From online archives to digital tools/ infrastructure and educational formats, the projects envision a (post-)digital culture in which notions of collaboration, free access to knowledge, sustainable use of shared resources, and data privacy are central. For the exhibition, each of the projects created a unique score to present their practice."

text poetics creativecommons

Saved 2020-06-17T16:51:20Z

LUCAS LAROCHELLE — Queering The Map

mapping text poetics lgbtq

Saved 2020-05-15T17:08:09Z

Make Me a Hanzi | Free, open-source Chinese character data

"[a] dictionary and graphical data for over 9000 of the most common simplified and traditional Chinese characters. Among other things, this data includes stroke-order vector graphics for all these characters." (via gábor ugray's !!con 2020 talk)

writing text chinese poetics mol datasets

Saved 2020-05-09T17:21:45Z

whipson/PoKi-Poems-by-Kids: PoKi: A Large Dataset of Poems by Children

"freely available for research with the condition that the research be used for the benefit of children"

text datasets corpora poetry poetics

Saved 2020-04-29T15:37:13Z

Advanced NLP with spaCy · A free online course

should make sure I know all this stuff

nlp python tutorial text poetics

Saved 2020-03-23T21:24:15Z

Chinese WeChat Users Are Sharing A Censored Post About COVID-19 By Filling It With Emojis And Writing It In Other Languages

"[T]o avoid the censorship, people have converted parts of the interview into Morse code, filled it up with emojis, or translated it into fictional languages like Sindarin from The Lord of the Rings or Klingon from Star Trek. In one particularly creative example, someone inserted it into the iconic opening crawl of Star Wars."

language text poetics politics censorship china

Saved 2020-03-14T18:14:50Z

New York Apartment

sam and tega. very good

text poetics poetry DigitalHumanities politics realestate

Saved 2020-03-10T22:22:45Z

Grey Rabbit

'...very boring stories that did not even satisfy my youngest children... I tried these stories on my very small children but after some minutes they grew very irritable, because nothing actually happened. This shows that even small children of three can measure entropy'

poetry poetics generative text

Saved 2020-03-08T20:08:01Z

Face

"Face lets you edit both the text and the font it is rendered in. In text mode you can type and edit text normally. Press escape to enter font mode, where you can select a character to edit. Any changes to a character are visible immediately."

text interface writing poetics

Saved 2020-03-02T20:46:30Z

CCMatrix: A billion-scale bitext data set for training translation models

"CCMatrix is the largest data set of high-quality, web-based bitexts for training translation models. With more than 4.5 billion parallel sentences in 576 language pairs pulled from snapshots of the CommonCrawl public data set, CCMatrix is more than 50 times larger than the WikiMatrix corpus that we shared last year."

nlproc translation text poetics datasets

Saved 2020-02-10T22:19:38Z

Kyle Booten — Tentacular

"Conclusion: We hypothesized that radical swings in affective posture would make the writer more emotionally flexible. Likewise, we hypothesized that attempting to discern the emotional valences of a machine learning model derived from achingly sensitive Tumblr posts would make the writer more empathetic. Unfortunately, no conclusions could be drawn from a single poem."

poetics text machinelearning sentiment poetry

Saved 2020-02-07T22:55:50Z

GANfield - YouTube

important

gan machinelearning comics text poetics

Saved 2020-02-05T22:22:19Z

Evennia

python-based mush/moo thing! could be fun to play around with

python games text poetics

Saved 2020-02-05T15:47:07Z

ranjit :music_mouse:: "La kiki mise à nu par ses boubas, même - Duchamp …" - Friend Camp

sesame street on phonaesthetics

text poetics phonetics phonology nonsense

Saved 2020-01-20T20:52:50Z

Variable Fonts

"A simple resource for finding and trying variable fonts"

typography design text mol

Saved 2020-01-09T22:35:41Z

The Mechanical Muse | The New Yorker

ranjit!

poetry poetics text machinelearning

Saved 2020-01-08T02:50:50Z

文言 / wenyan‑lang

"an esoteric programming language that closely follows the grammar and tone of classical Chinese literature. Moreover, the alphabet of wenyan contains only traditional Chinese characters and 「」 quotes, so it is guaranteed to be readable by ancient Chinese people." (from one of Golan Levin's students)

programming chinese language text poetics

Saved 2020-01-01T16:55:09Z

mhagiwara/github-typo-corpus: GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors

"a large-scale dataset of misspellings and grammatical errors along with their corrections harvested from GitHub. It contains more than 350k edits and 65M characters in more than 15 languages, making it the largest dataset of misspellings to date."

datasets language text poetics

Saved 2019-12-11T16:06:06Z

🦄🤝🦄 Encoder-decoders in Transformers: a hybrid pre-trained architecture for seq2seq

this looks promising

machinelearning nlproc text language

Saved 2019-12-10T19:04:02Z

alexwarstadt/blimp: The Benchmark of Linguistic Minimal Pairs

"a challenge set for evaluating what language models (LMs) know about major grammatical phenomena in English" it warms my heart to see an ngram baseline in there, haha

language linguistics text nlproc

Saved 2019-12-09T22:04:59Z

OpenMoji

"OpenMoji is an open source project of 53 students and 2 professors of the HfG Schwäbisch Gmünd and external contributors" (CC BY-SA 4.0)

text typography emoji svg

Saved 2019-12-08T22:55:28Z

Teaching AI Feminism and Making Art

"I run datasets of iconic feminist texts through a simple textRNN, generating new feminists texts in the legendary words of bell hooks, Simone De Beauvoir, Betty Friedan and Audre Lorde. Some are funny. Some are poetic. Some make no sense at all and some are way too real. Information about the model and settings can be found under each post."

poetics text machinelearning language generative feminism theory

Saved 2019-12-06T16:57:07Z

Semantic Specialization of Distributional Representation Models

another tutorial from emnlp-19

nlproc language semantics poetics text

Saved 2019-12-01T21:34:55Z

Measuring gender imbalances in reporting on the creative industries

language text nlproc dataviz

Saved 2019-11-22T16:43:29Z

Google AI Blog: Announcing Two New Natural Language Dialog Datasets

"In the movie-oriented CCPE dataset, individuals posing as a user speak into a microphone and the audio is played directly to the person posing as a digital assistant. The “assistant” types out their response, which is in turn played to the user via text-to-speech. [...] The Taskmaster-1 dataset makes use of both the methodology described above as well as a one-person, written technique to increase the corpus size and speaker diversity—about 7.7k written “self-dialog” entries and ~5.5k 2-person, spoken dialogs. For written dialogs, we engaged people to create the full conversation themselves based on scenarios outlined for each task, thereby playing roles of both the user and assistant."

datasets nlproc text conversation poetics

Saved 2019-11-06T14:00:05Z

Lists of Note

"On October 19th of 1955, Pulitzer Prize-winning poet, Marianne Moore, was approached by a Mr. Robert Young of the Ford Motor Company and asked to assist them in naming a new series of cars."

advertising onomastics poetry poetics text

Saved 2019-11-04T19:46:17Z

Learn how to make BERT smaller and faster

"ways to make huge models like BERT smaller and faster": quantization, pruning, distillation

machinelearning nlproc text poetics

Saved 2019-10-29T03:19:43Z

How you’re feeling when machine learning might help - Quartz AI Studio

"We’ll never be able to read all of these documents. What’s unique about this text compared to all the rest? My eyes sting from searching these images for the same thing. We need to find more records like these in a huge pile of data. I could really use a heads-up before this happens again. (Post to come.)" I *reeeeeally* appreciate approaches to ml like this that start with problems to be solved (instead of just taking for granted that ai/ml is useful)

programming machinelearning journalism text

Saved 2019-10-28T21:31:30Z

Bit Tripper

"Create a unique bitfont from a vast space of glyphs generated by a neural network." yacht/counterpoint

mol text typography machinelearning

Saved 2019-10-28T16:31:25Z

Joel Simon on Twitter: "New work in my Dimension of Dialogue series :) Two neural nets learn to communicate through their own emergent visual language. The resulting alphabet is a product of their adversarial and cooperative relationship. Here set in clay

"Two neural nets learn to communicate through their own emergent visual language."

text language machinelearning poetics mol

Saved 2019-10-28T16:27:05Z

Vectoglyph | Nicolas Boillot | Fluate.net

gpt-2 on svg for generated emoji and letterforms

typography mol emoji text poetics

Saved 2019-10-28T16:24:33Z

Type@Cooper: The Hershey Fonts

a "talk about the history and environment of Hershey’s creation, and touch on the current state of resurrection."

typography text poetics

Saved 2019-10-22T15:16:38Z

Universal Dependencies

"Universal Dependencies (UD) is a framework for consistent annotation of grammar (parts of speech, morphological features, and syntactic dependencies) across different human languages. UD is an open community effort with over 200 contributors producing more than 100 treebanks in over 70 languages."

datasets nlproc language text poetics

Saved 2019-10-15T14:11:54Z

Text Rendering Hates You

"there are no consistent right answers, everything is way more important than you think, and everything affects everything else"

text typography

Saved 2019-10-07T16:29:17Z

Memex #001

"The father/daughter team of Trevor F. Smith and Sparks Webb have gathered all available documentation about the memex and carefully fabricated the Memex #001 to match Dr. Bush's specifications."

computation hypertext history text poetics

Saved 2019-09-20T16:20:34Z

Analog Science Fiction Magazine

"a free, non-commercial project with the goal of preserving selected paper-based cultural artifacts for future generations of readers, in the form of cover images in JPG format, and, where available, complete cover-to-cover scans in PDF format"—in this case, a bunch of old Analog magazines

scifi text

Saved 2019-09-13T21:45:58Z

Table of Contents for Chapter One. Hypertext and Critical Theory

I assume this is the full text of chapter one of the landow book?

books hypertext text poetics

Saved 2019-09-12T20:18:15Z

ColoredConventions.org

text datasets dh

Saved 2019-09-09T21:52:54Z

Other Orders

"Recommendation engines like the ones powering the endless feeds on Twitter, Facebook and YouTube, are designed to maximize ad revenue, and therefore to keep you online for as long as possible. In doing so they promote the most reactionary content on their platforms. Yet, these recommendation systems are nothing more than sorting mechanisms. Other Orders provides an alternate set of sorts, optimized for other outcomes."

text poetics programming nlproc

Saved 2019-09-06T19:26:43Z

Jenny Holzer Hits Her Mark in a Major, Largely Unnoticed Retrospective

"Artists aim differently than sharpshooters. They are not typically trying to take something out, but to draw something out. The mark Holzer hits in this case is the mark in the most cave-drawing sense: the effort to leave (or find) a trace of something that is not an opinion, but a register of some kind, certifying a lived experience. There may be no such thing as a permanent record, but the fact that the Washington Post contributor found Holzer’s work dangerous is a sign in and of itself that it has achieved one of its goals: it has carved a deep enough mark to leave a strong impression (for that writer, a menacing one). That’s the most any language or other kind of mark-making can hope to accomplish."

art politics language text poetics

Saved 2019-09-03T15:52:22Z

ProseMirror

well this looks like a dream come true?

text interface poetics editor

Saved 2019-08-03T15:27:58Z

Sam Roxas-Chua 姚 | Everything is here, everything.

asemic writing art text poetics

Saved 2019-07-23T16:53:52Z

The UN Security Council Debates - Harvard Dataverse

"[A] dataset of UN Security Council debates between January 1995 and December 2017... split into distinct speeches" with metadata on "the speaker, the speaker's nation or affiliation, and the speaker's role in the meeting" and "the topic of the meeting." 65393 speeches extracted from 4958 meeting protocols (!). via data is plural

text datasets politics

Saved 2019-07-17T21:13:50Z

The Annotated Transformer

nlproc machinelearning text poetics

Saved 2019-07-09T19:21:54Z

BPEmb

"a collection of pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE) and trained on Wikipedia. Its intended use is as input for neural models in natural language processing"

nlproc text poetics machinelearning

Saved 2019-06-30T20:07:12Z

trees are harlequins, words are harlequins — the transformer ... “explained”?

"The Transformer is nothing more than an architecture where the core functional unit is attention. You stack attention layers on top of attention layers, just like you would do with CNN or RNN layers."

algorithms ai nlproc text poetics

Saved 2019-06-28T17:42:33Z

VSCodium - Open Source Binaries of VSCode

"a community-driven, freely-licensed binary distribution of Microsoft’s editor VSCode"

editor text opensource

Saved 2019-06-26T19:47:03Z

Wiktionary:Frequency lists - Wiktionary

wiktionary word frequency lists

language data text poetics

Saved 2019-06-13T19:08:24Z

This page is a truly naked, brutalist html quine

amazing

html css programming layout text

Saved 2019-06-05T21:45:38Z

Markdown.css - make HTML look like plain-text

css stylesheet that displays html as markdown. brilliant

css html web programming text layout plaintext_markup

Saved 2019-06-05T21:42:29Z

Science Fiction Writer Robert J. Sawyer: WordStar Under Windows

down the rabbit hole of fantasy/sf authors using wordstar

text writing interfaces retro dos

Saved 2019-06-01T18:57:16Z

You elected them to write new laws. They’re letting corporations do it instead. – Center for Public Integrity

legislation business culture text poetics

Saved 2019-04-12T01:09:33Z

GPT-2 Neural Network Poetry - Gwern.net

uses my poetry corpus! though points out some shortcomings.

poetry generation gpt2 poetics text

Saved 2019-03-28T20:45:26Z

Building my first keyboard (and you can too) – Sasha Solomon – Medium

detailed explanation of an unbelievably bad-ass build, fuck

text writing typing interface pcomp

Saved 2019-03-27T18:00:50Z

What we learned from getting our autocomplete tested for accessibility - Accessibility in government

text interface accessibility

Saved 2019-03-22T13:53:41Z

We Should Talk by Jack Schlesinger, Jordan Jones-Brewster, Nobo B, ceschiii, kat, carolmertz

games playme narrative dialog text poetics

Saved 2019-03-20T20:41:43Z

Tsvetshop: Home

"Yulia Tsvetkov's research group at Language Technologies Institute of Carnegie Mellon University. Our work focuses on natural language processing, particularly cross-lingual approaches, low-resource settings, and social good."

language poetics text machinelearning nlproc

Saved 2019-03-18T18:58:08Z

Heather Dewey-Hagborg | Unlanguage

"In this interactive installation participants enter the first word that comes to their mind in one of two input terminals in any language. These words are then the seed of a generative process that develops a poem, bifurcating and mutating, merging languages, poetic styles, sense and nonsense. Poems overlap and degrade over time, eventually fading away. Phonetics are remapped to a new alphabet of sound referencing the body and incidental noises, creating a unique expression for each word and making literal the arbitrariness of the language. This installation was projected on a massive scale covering the walls and ceiling and filling the hall of the old imperial castle in Poznan, Poland. This video shows a demonstration of the generated poetry."

text language poetry poetics materialoflanguage

Saved 2019-03-12T20:37:17Z

gpt-2-poetry

kyle mcdonald's take

poetry text poetics nlproc machinelearning gpt2

Saved 2019-03-08T18:14:58Z

Catching Unicorns with GLTR

"a visually forensic tool to detect text that was automatically generated from large language models"

gpt2 text data machinelearning

Saved 2019-03-08T18:12:11Z

languagemodeling.pptx - languagemodeling.pdf

dan jurafsky intro lecture

computational linguistics language text data

Saved 2019-02-26T16:54:52Z

Google's Natural Questions

nlproc datasets text

Saved 2019-01-28T23:36:40Z

Nancy by Olivia Jaimes for January 20, 2019 - GoComics

observation on how avant-garde experimentation requires the materiality of the medium as a frame, or something

theory twitter comics text

Saved 2019-01-20T18:20:31Z

The CRPG Addict: Game 316: Caverns of Mordia (1980)

from what crpg addict calls the "establishing era," this reminds me a bit of 80 Days

games rpg text narrative

Saved 2019-01-17T21:35:30Z

Reading postmortems

"I love reading postmortems. They're educational, but unlike most educational docs, they tell an entertaining story."

programming text poetics narrative devops

Saved 2019-01-10T18:48:11Z

The LaTeX fetish (Or: Don’t write in LaTeX! It’s just for typesetting) – Daniel Allington

text writing plaintext_markup markdown

Saved 2019-01-09T19:17:18Z

Society for Textual Scholarship

"an international organization of scholars working in textual studies, editing and editorial theory, electronic textualities, and issues of textual culture across a wide variety of disciplines."

academia text

Saved 2019-01-06T04:02:04Z