Comfortably Numb: I use image recognition to write better alt text 🤷‍♂️ bsky.app/profile/numb...

this is kinda the thing: using vision classifiers doesn't seem bad to me at all?

yeah I naïvely thought that nobody would care about using a vision model, since image classifiers seem non controversial to me? I wouldn’t dare touch a diffusion model since I agree those suck in every conceivable way. anyway clearly people don’t see it that way

I use image recognition to write better alt text 🤷‍♂️ bsky.app/profile/numb...

Confession time: Those alt-text started as ChatGPT image recognition results. I occasionally use it to create alt-text, when I want to capture all details I otherwise would miss. Usually it requires some minimal editing, but otherwise it's very good. The only good AI use case in my opinion.

Tedious translation and summarization tasks that only need to be 90%-99% accurate are a real sweet spot, and it's a surprisingly wide space of tasks.

Post