I mean, that’s a good thing. No one’s saying we shouldn’t think about it. And news publishers, brands, government agencies, nonprofits and others who provide utility to the public on social media should certainly keep accessibility in mind.
Edited because auto-correct mis-corrected accessibility. Very funny, auto-correct.
Just so long as the conversation doesn’t turn into “won’t someone think of the software developers” - I don’t have a dog in the fight. Not a developer - don’t use the twit machine.
It’s not actually a different font, all the bold/cursive/italic/etc text is available in the standard unicode tables, you just need to know the correct code for them. So to a machine reader will instead of seeing an alphanumeric character will instead see the unicode reference.
Although… seeing as all the emojis that twitter can put out are also specified in unicode I wonder how this software handles those.
Ah that makes sense. So if it is standard unicode, then one would think that would be super easy to transcribe to text. “Bold U” = “U”, “Black Letter E” =“E”, etc
AFAIK, screen-readers read out the full Unicode name for the emoji. Every time. So becomes “Reversed Hand With Middle Finger Extended” to a screen reader. Something to consider the next time you decide to write a clapping hands sentence . (or as screen readers see it, “write Clapping Hands Sign a Clapping Hands Sign clapping Clapping Hands Sign hands Clapping Hands Sign sentence Clapping Hands Sign.”)
And the less said about ASCII-art memes, the better.
The thing is, this isn’t really something I think is on screen-readers to fix. Not on their own, anyway. They’re doing exactly what they’re supposed to do: read the text as written. These “fancy” characters are deep in the Unicode extended character set for mathematical symbols, and were never intended to be used as replacements for standard Latin characters in everyday writing. They’re not just bold or italic letters, they have their own semantic meanings, which is why VoiceOver is treating them this way to begin with (also, for the record, VoiceOver’s reading speed in this person’s tweet has been cranked up pretty high, which I suspect most of its day-to-day users are used to, but it does also strongly emphasize just how unintelligible these symbols can make things).
We have actual semantic tags built into HTML to do things like indicate emphasis in ways that screen readers are totally fine with, but Twitter—and most other social media platforms—don’t provide any means of inserting them into the text of a message. Add a text-formatting bar to the tweet field so you can actually bold and italicize normal ASCII/extended characters, and this problem mostly solves itself.
The original post clarifies downthread. They aren’t a clever font, they are using nonstandard characters from the mathematic symbol tables that look like a font. Twitter and the reader are both behaving reasonably, but the people who do that aren’t considering the full audience of their tweets.
The screen reader is doing the correct job. Those aren’t “fonts” they’re special symbols, as for mathematics, that are being “mis-used” for their visual similarity to alternate fonts. So the screen-reader is simply revealing the truth of the text.
While this does create a big problem for those who rely on assistive technologies, I imagine this also creates a problem for those who perform NLP and sentiment analysis on tweets?
I can’t agree with your analogy here. This seems to me to be more like their being a perfectly good ramp, but some of the users are making their contact point* up on the roof which is reachable by ladder. These characters are not a font. The use of them in that way is a hack.
That’s the thing – these aren’t different fonts. These are all special Unicode characters and/or transformations.
This text: “𝖍𝖊𝖑𝖑𝖔” isn’t a special font. It’s all made up of characters in the “Mathematical Alphanumeric Symbols” block:
For a screenreader to know this means “hello”, it needs to be able to know how to take these things that may look like letters (but technically aren’t) into letters. There’s hundreds of thousands of possible code points that can be interpreted as letters.