Projects

A collection of things I've built

This is the hands-on side of what I do: model evaluation, interpreters, search, pricing tools, mobile apps, and whatever else sounds too interesting to ignore.

Machine Learning & AI

RedTeam Llama

Python / PyTorch / HuggingFace

Adversarial jailbreaking framework for evaluating LLM safety, with modular experimentation and supervised fine-tuning that improved ethical alignment in Llama-3.2.

BON Gemma Jailbreak

Python / PyTorch / HuggingFace

Recreated Anthropic's Best-of-N jailbreaking work on Gemma-3-1b and ran large-scale adversarial prompt experiments up to N = 5000.

NanoGPT Philosopher

Python / PyTorch / Transformers

Trained a NanoGPT model from scratch on more than 50,000 pages of Immanuel Kant and used it as a practical way to learn model internals by building one end to end.

Hearsay

Python / Selenium / Eleven Labs / Web Extension

DubHacks AI '23 runner-up. A browser extension for voice-command navigation and text-to-speech that pushed accessibility toward a more agentic experience.

Systems Programming

MonkeyLang VM

Rust / Systems Programming / Assembly

Built a high-performance language VM in Rust and improved interpreter speed with bytecode optimizations and solid unit test coverage.

Husky Hold 'Em

Go / Docker / Python

Poker tournament platform for bots, with a Go interface, Python-simulated gameplay logic, and Dockerized participant execution.

uDub Search

Python / PHP / NLP / Collaborative Filtering

Search and recommendation system for University of Washington subreddit posts, with improvements to ranking quality across a few thousand documents.

Slate

PostgreSQL / AWS / React

Production management software for small film productions and creative teams.