Posts

Thoughts on web development, AI, and building things that matter.

#ml/ai (15)#open-source (10)#frontend (6)#random (6)#linguistics (4)

3/5/20264 min read

Introducing agentpane

A local web UI for AI coding agents. Multi-pane, multi-session, streaming — run Claude Code and Codex side by side from your browser.

#ml/ai#open-source#frontend

2/25/20264 min read

Introducing helm

A typed TypeScript framework for AI agents. Replace dozens of tools with two — search and execute — and sandbox LLM-generated code with granular permissions.

#ml/ai#open-source#frontend

8/25/202512 min read

Introducing tokka-bench

A comprehensive evaluation framework for comparing tokenizers across human and programming languages.

#ml/ai#linguistics#open-source

7/1/20253 min read

Dialects for Humans: Sounding Distinct from LLMs

Humans are developing new linguistic patterns to distinguish themselves from AI-generated content, and the rate of change will accelerate.

#ml/ai#linguistics

7/16/20241 min read

Using HuggingFace Datasets Offline

How to save a HuggingFace dataset to disk and use it offline

#ml/ai

7/16/20241 min read

Tips #1

Markdown detection in Google Docs, swiping between tabs in Brave Browser for iOS, and running TypeScript files from the command line.

#random#ml/ai

11/7/20235 min read

Rebuilding Alpaca with the Hugging Face Trainer Class

Fine-tuning Llama-2-7B using the Alpaca dataset and Hugging Face Trainer

#ml/ai#open-source

10/16/20232 min read

Introducing gom: GPU Monitoring across Containers

I published `gom`, a CLI tool for monitoring GPU usage across Docker containers.

#ml/ai#open-source

9/11/20234 min read

Enroot on Slurm for Distributed ML: Part 2

How to use Enroot on Slurm for containerized multi-node training.

#ml/ai

9/8/20232 min read

Enroot on Slurm for Distributed ML: Part 1

How to use Enroot on Slurm for containerized multi-node training.

#ml/ai

9/8/20232 min read

Quick & Helpful Slurm Commands

A quick guide to using Slurm for distributed machine learning.

#ml/ai

9/8/20232 min read

Setting Up Docker for Machine Learning

The Dockerfile I use to set up my machine learning environment.

#ml/ai

8/29/20233 min read

Accelerate vs. DeepSpeed vs. FSDP

Which one should you use for distributed training?

#ml/ai

8/23/20232 min read

LLMs Will Never Be Able to Do (Complicated) Math

Since contemporary LLM architectures lack recursion, they're fundamentally incapable of doing some math operations.

#ml/ai

Ben Gubler

Introducing agentpane

Introducing helm

Introducing tokka-bench

Dialects for Humans: Sounding Distinct from LLMs

Using HuggingFace Datasets Offline

Tips #1

Rebuilding Alpaca with the Hugging Face Trainer Class

Introducing gom: GPU Monitoring across Containers

Enroot on Slurm for Distributed ML: Part 2

Enroot on Slurm for Distributed ML: Part 1

Quick & Helpful Slurm Commands

Setting Up Docker for Machine Learning

Accelerate vs. DeepSpeed vs. FSDP

LLMs Will Never Be Able to Do (Complicated) Math

Ben Gubler

Introducing agentpane

Introducing helm

Introducing tokka-bench

Dialects for Humans: Sounding Distinct from LLMs

Using HuggingFace Datasets Offline

Tips #1

Rebuilding Alpaca with the Hugging Face Trainer Class

Introducing gom: GPU Monitoring across Containers

Enroot on Slurm for Distributed ML: Part 2

Enroot on Slurm for Distributed ML: Part 1

Quick & Helpful Slurm Commands

Setting Up Docker for Machine Learning

Accelerate vs. DeepSpeed vs. FSDP

LLMs Will Never Be Able to Do (Complicated) Math

Ben Gubler

Introducing agentpane

Introducing helm

Introducing tokka-bench

Dialects for Humans: Sounding Distinct from LLMs

Using HuggingFace Datasets Offline

Tips #1

Rebuilding Alpaca with the Hugging Face Trainer Class

Introducing gom: GPU Monitoring across Containers

Enroot on Slurm for Distributed ML: Part 2

Enroot on Slurm for Distributed ML: Part 1

Quick & Helpful Slurm Commands

Setting Up Docker for Machine Learning

Accelerate vs. DeepSpeed vs. FSDP

LLMs Will Never Be Able to Do (Complicated) Math