AudioTextify

AudioTextify: Turning Audio Into Usable Content

AudioTextify Extension

Meetings, interviews, lectures, and video content generate a significant volume of spoken information that most people cannot practically use after the fact. AudioTextify was built to close that gap, transcribing spoken content and turning it into something you can search, summarize, translate, and repurpose.

Built as a Chrome extension, it works where the content already is: no download, no upload, no workflow change required.

What We Built

AI Transcription Engine

Accurate speech-to-text for audio and video content playing in the browser
Multi-speaker recognition that labels who said what
99 language support, covering both transcription and translation
Real-time processing with low-latency output

Content Generation Layer

Automatic summarization: key points extracted without manual review
Blog and article generation from transcript source material
Social media content drafts derived from the core ideas
Meeting summary generation with action item extraction

Chrome Extension Architecture

Sidebar-based UI that stays available without interrupting the main content
Minimal permissions model: only accesses audio from the active tab
Works across YouTube, recorded meetings, podcasts, and browser-based video
Sync across devices for users with multiple workstations

The Engineering Approach

A Chrome extension that processes audio in real time has a tight constraint: it cannot meaningfully delay the user's experience. We built the processing pipeline to run asynchronously: the extension captures audio, batches it for transcription, and updates the transcript progressively without blocking the video playback or requiring the user to wait for a complete file before seeing results.

For translation, we implemented a two-pass approach: fast machine translation for immediate output, followed by a refinement pass that improved accuracy for technical and domain-specific vocabulary.

My Role

Led product design and Chrome extension development end-to-end
Designed the transcription interface and content generation workflow
Built the audio processing and AI pipeline architecture
Managed Chrome Web Store submission and compliance
Optimized for performance within browser extension constraints

AudioTextify: Turning Audio Into Usable Content

What We Built

The Engineering Approach

My Role

More projects

Niggle.ai

Albin

Customer Health & Churn Alert