Multimodal Browser AI with Transformers.js for Images and Speech Most browser AI tutorials cover text because it is a natural starting point, but the applications people actually want to build are rarely text-only. Published: 2026-06-10