Skip to main content

Release Notes

October 2025

October 3, 2025

Standard Input Support

Send input directly to running commands in your devboxes, enabling interactive workflows and real-time command control. Learn more →

Repository Inspection Upgrades

Inspect repositories without building them first, allowing faster iteration and more flexible workflows when working with Repo Connect. Learn more →

October 1, 2025

Custom Benchmarks

Create private, specialized benchmarks tailored to your proprietary codebase and business logic. Test AI agents against your specific requirements in isolated, scalable environments.

September 2025

September 30, 2025

Suspend and Resume Devboxes

Pause your devboxes to preserve disk state while stopping compute costs, then resume exactly where you left off. Perfect for managing long-running development sessions efficiently. Learn more →

Required Secrets for Scenario Runs

Specify required secrets when running scenarios, ensuring benchmark executions have access to necessary credentials without manual intervention.

September 26, 2025

Snapshot Speed Improvements

Significantly faster snapshot creation and launch times; quicker devbox setup and more efficient state management workflows.

September 15, 2025

WebSocket Support

Connect to your devboxes via WebSocket for real-time, bidirectional communication, enabling live terminal experiences and interactive debugging. Learn more →

September 5, 2025

Streaming Command Output

Stream stdout and stderr in real-time as commands execute, providing immediate visibility into long-running processes like builds and tests. Learn more →

Enhanced Command Execution

Improved command execution API with optimistic execution, automatic fallback handling, and better reliability for both short and long-running commands. Learn more →

August 2025

August 27, 2025

Object Store

Store and manage files, datasets, and resources that can be shared across devboxes or mounted during creation. Upload training data, configuration files, model weights, and other assets for streamlined devbox setup. Learn more →

August 14, 2025

Public Blueprints

Access and use community-maintained blueprint templates for common development environments, eliminating the need to build from scratch for standard configurations. Learn more →

Overlaybd Support

Faster image pulling and reduced storage overhead with overlaybd technology, improving devbox launch times and resource efficiency.

July 30, 2025

New Public Benchmarks

BigCodeBench - Evaluate AI agents on complex code generation tasks with this comprehensive benchmark for assessing programming capabilities.

July 25, 2025

New Public Benchmarks

Expanded our public benchmark library with six new evaluation suites:
  • LiveCodeBench - Real-world coding challenges from recent competitive programming
  • DS1000 - Data science problem-solving across diverse domains
  • CruxEval - Code understanding and reasoning evaluation
  • LiveSWEBench - Contemporary software engineering tasks
  • SWEReBench - Software engineering reliability assessment
  • R2E Gym - Repository-to-execution testing suite
I