While v2.1.6 is a benchmark for stability, the technology continues to evolve. Recent announcements in early 2026 showcase a deepened partnership between Adobe and , leading to a new generation of on-device models. These new models deliver near-cloud accuracy at incredible speeds—processing one hour of audio in about 55 seconds—while supporting the latest hardware like Mac M5 and NVIDIA RTX GPUs.

Making videos accessible with captions not only helps viewers with hearing impairments but also boosts search engine optimization (SEO) and engagement on platforms like YouTube and TikTok. v2.1.6 makes this process effortless.

Double-click any word in the Transcript window to fix typos.

Once completed, review the transcription in the Text panel for accuracy. You can easily make corrections right there.

Once your transcript is accurate, click the icon (cc) at the top of the Text panel.

Newer versions allow for offline transcription if language packs are downloaded locally, bypassing the need for a constant internet connection.

En equipos con procesadores Intel Core i9 o Apple Silicon (M1/M2), la velocidad de transcripción puede ser hasta 3 veces más rápida que en hardware estándar.