Voice AI Built for the Real World

Applied research at the intersection of voice, safety, and intelligent systems.

See What We're Building

Safety & Performance

Compliance Ready

Privacy by Design

On-Device STT/TTS

No cloud audio streaming

<850ms TTFA

Time to first audio

20-Layer Safety

Multi-stage content filter

Our Research Pillars

< 850ms TTFA

Voice Pipeline Architecture

End-to-end investigation of low-latency STT → safety → LLM → TTS architectures. We optimize every millisecond from utterance to response.

20-Layer

Safety Filter Design

Multi-layer content safety systems with real-time input and output filtering, designed to operate with zero latency using pre-computed fallback responses.

Flagship Product

Memo Kids

Voice AI your kids can trust

The first COPPA-compliant voice AI companion built for Pre-K to Grade 5. Natural conversation, on-device safety filtering, and curriculum-aligned intelligence.

✓ COPPA ✓ On-Device < 850ms
Learn more
Architecture

The Intelligent Voice Pipeline

01

On-Device Speech-to-Text

Speech is captured and transcribed locally using native platform APIs on both iPhone and Android. Audio never leaves the device. This eliminates cloud audio streaming latency and ensures COPPA compliance from the first byte.

02

Safety Filter

Transcribed text is sent to a backend safety filter that evaluates content safety in real time. Harmful, off-topic, or personally identifying content is intercepted before reaching the LLM. If unsafe input is detected, a pre-recorded safe redirect audio response plays instantly.

03

LLM Response Generation

Safe prompts are routed to a large language model for fast, age-appropriate, contextually rich responses. The system maintains conversational context across multiple turns, ensuring natural back-and-forth dialogue with educational relevance.

04

On-Device Text-to-Speech

Responses are synthesized into natural speech on-device using native platform APIs. Character voices bring personality to each response, making learning feel like a conversation with a friend. On-device synthesis means zero cloud costs and instant playback.