Ben Gubler
Ben Gubler

Posts

RSS

Thoughts on web development, AI, and building things that matter.

3 min read

Dialects for Humans: Sounding Distinct from LLMs

Humans are developing new linguistic patterns to distinguish themselves from AI-generated content, and the rate of change will accelerate.

#ml/ai#linguistics
1 min read

Using HuggingFace Datasets Offline

How to save a HuggingFace dataset to disk and use it offline

#ml/ai
1 min read

Tips #1

Markdown detection in Google Docs, swiping between tabs in Brave Browser for iOS, and running TypeScript files from the command line.

#random#ml/ai
5 min read

Rebuilding Alpaca with the Hugging Face Trainer Class

Fine-tuning Llama-2-7B using the Alpaca dataset and Hugging Face Trainer

#ml/ai#open-source
2 min read

Introducing gom: GPU Monitoring across Containers

I published `gom`, a CLI tool for monitoring GPU usage across Docker containers.

#ml/ai#open-source
4 min read

Enroot on Slurm for Distributed ML: Part 2

How to use Enroot on Slurm for containerized multi-node training.

#ml/ai
2 min read

Enroot on Slurm for Distributed ML: Part 1

How to use Enroot on Slurm for containerized multi-node training.

#ml/ai
2 min read

Quick & Helpful Slurm Commands

A quick guide to using Slurm for distributed machine learning.

#ml/ai
2 min read

Setting Up Docker for Machine Learning

The Dockerfile I use to set up my machine learning environment.

#ml/ai
3 min read

Accelerate vs. DeepSpeed vs. FSDP

Which one should you use for distributed training?

#ml/ai
2 min read

LLMs Will Never Be Able to Do (Complicated) Math

Since contemporary LLM architectures lack recursion, they're fundamentally incapable of doing some math operations.

#ml/ai