Mohamed Diomande — New York

Research, built
into systems.

I do machine learning research on speech, writing systems, and autonomous agents, and I build the infrastructure that puts the research to work. The papers live next to the systems they describe.

Capabilities

What I arrive with

01

Machine learning research

Speech recognition for low-resource languages. Writing systems as model infrastructure: tokenization, script visibility, phonemically interpretable evaluation. Transfer learning and CTC decoders trained under real data constraints. Reward modeling over agent trajectories.

02

Agent systems engineering

Multi-machine orchestration: a personal mesh of Macs, a Windows GPU node, and cloud workers coordinated through deterministic supervisors, message streams, and cron-governed loops. Trajectory scoring, typed skill libraries, persistent memory systems, autonomous multi-hour build sessions.

03

Real-time perception and graphics

Pose estimation pipelines from commodity cameras and depth sensors. Audio-reactive and body-reactive generative visuals in Unity and TouchDesigner. Multi-iPhone camera meshes with on-device capture orchestration. Gaussian splatting on consumer hardware.

04

Product engineering

iOS with SwiftUI, shipped through TestFlight. On-device inference: whisper.cpp, quantization, Apple Neural Engine serving. Web with Next.js. The discipline of carrying a model from training run to a phone in someone's hand.

05

Operating physical businesses

Specialty coffee service in New York under Buf Barista. Plant-based milk distribution under Koatji. Field-tested in inventory, events, sales routes, and the unforgiving feedback loop of selling something real.