4 posts tagged with "training"

Announcing Transformer Lab for Teams: The Operating System for AI Research Labs

February 2, 2026 · 2 min read

Person

Our open source research initiative recently closed a new round of funding and today, we launched the public beta for Transformer Lab for Teams: a modern operating system for AI research labs. It’s open source and free to use.

Transformer Lab Goes Beyond Images: Introducing Text Diffusion Model Support

November 20, 2025 · 4 min read

Deep Gandhi

Person

🎉 Transformer Lab just expanded beyond image diffusion! We're thrilled to announce text diffusion model support so you can train, evaluate, and interact with cutting-edge text diffusion architectures like BERT, Dream, and LLaDA directly in Transformer Lab.

What's included in this release

🚀 Text Diffusion Server for interactive generation with BERT, Dream, and LLaDA models
🏋️ Text Diffusion Trainer for fine-tuning with masked-language and diffusion-style alignment workflows
📊 Text Diffusion Evaluator for benchmarking with the EleutherAI LM Evaluation Harness

Accelerating Model Training with Multi-GPU Support in Transformer Lab

March 24, 2025 · 3 min read

Deep Gandhi

Person

Transformer Lab is excited to announce robust multi-GPU support for fine-tuning large language models. This update allows users to leverage all available GPUs in their system, dramatically reducing training times and enabling work with larger models and datasets.

Fine Tuning a Python Code Completion Model

February 7, 2025 · 7 min read

Sanjay

Person

This post details our journey to fine-tune smolLM 135M, a compact language model, for Python code completion.

We chose smolLM 135M for its size, which allows for rapid iteration. Instead of full fine-tuning, we employed LoRA (Low-Rank Adaptation), a technique that introduces trainable "adapter" matrices into the transformer layers. This provides a good balance between parameter efficiency and achieving solid results on the downstream task (code completion).

Transformer Lab handled the training, evaluation, and inference, abstracting away much of the underlying complexity. We used the flytech/python codes-25k dataset, consisting of 25,000 Python code snippets, without any specific pre-processing. Our training setup involved a constant learning rate, a batch size of 4, and an NVIDIA RTX 4060 GPU.

The Iterative Fine-tuning Process: Nine Runs to Success

The core of this project was an iterative refinement of LoRA hyperparameters and training duration. We tracked both the training loss and conducted qualitative assessments of the generated code (our "vibe check") to judge its syntactic correctness and logical coherence. This combination of quantitative and qualitative feedback proved crucial in guiding our parameter adjustments.

What's included in this release​

The Iterative Fine-tuning Process: Nine Runs to Success​

What's included in this release

The Iterative Fine-tuning Process: Nine Runs to Success