Voice systems
Real-time conversational AI
Work on the timing and dynamics of spoken interaction: when a system should speak, listen, backchannel, or hold back so it feels responsive rather than reactive.
ABOUT
John Brown works on real-time conversational AI for voice systems: turn-taking, backchannel behavior, and affect-aware decisions under hard on-device timing constraints.
PRACTICE DOSSIER
John works on the timing and dynamics of spoken interaction: not only what was said, but when a system should respond, listen, backchannel, or stay out of the way.
The work spans the full path from PyTorch/MLX research to Swift/CoreML on-device deployment, with full-stack and polyglot tooling around it in Python, Swift, TypeScript, and Bash.
A long background in music production, synthesis, and signal processing shapes how he thinks about timing, texture, and dynamics. The photography and cinematography, guitar and synthesis threads stay here as supporting context for that same constraint-driven practice.
CAPABILITY MAP
The through-line is real-time constraint: research, software, agent infrastructure, and technical direction all have to survive contact with production.
Voice systems
Work on the timing and dynamics of spoken interaction: when a system should speak, listen, backchannel, or hold back so it feels responsive rather than reactive.
Deployment path
Research prototypes in PyTorch/MLX have to survive export, Swift/CoreML deployment, streaming state, causal models, fixed hop budgets, and real-time audio loops.
Full-stack systems
Python, Swift, TypeScript, and Bash show up around the core work: research harnesses, local runtimes, frontends, protocol surfaces, and operational tooling.
Agent tools
Protocol servers, model-provider abstractions, agent gateways, lifecycle tooling, and wave-structured delivery with clear ownership boundaries.
Technical direction
ADRs, staged releases, acceptance gates, interface contracts, and roadmaps keep ambiguous research work legible enough to ship and maintain.
PUBLIC RECORD
Music, photography, and art stay on the site because they show the same concern with timing, texture, tools, and constraints from another angle.
Images
Camera work, surface studies, and visual systems track how light, texture, framing, and repetition become a record rather than a feed.
Sound
Music practice connects rhythm, synthesis, guitar, hardware routing, and SoundCloud sketches to the same timing instincts used in voice systems.
Notes
Technical notes, process records, and public entries make the work easier to inspect, critique, revisit, and hand to another person.
STACK
A lightweight static stack keeps the site inspectable, versioned, and easy to deploy.
Operating rule
Public pages should be specific enough that a stranger can tell what is finished, what is a note, and what constraints shaped it.