Release Notes
October 2025
October 3, 2025
Standard Input Support
Send input directly to running commands in your devboxes, enabling interactive workflows and real-time command control. Learn more →Repository Inspection Upgrades
Inspect repositories without building them first, allowing faster iteration and more flexible workflows when working with Repo Connect. Learn more →October 1, 2025
Custom Benchmarks
Create private, specialized benchmarks tailored to your proprietary codebase and business logic. Test AI agents against your specific requirements in isolated, scalable environments.September 2025
September 30, 2025
Suspend and Resume Devboxes
Pause your devboxes to preserve disk state while stopping compute costs, then resume exactly where you left off. Perfect for managing long-running development sessions efficiently. Learn more →Required Secrets for Scenario Runs
Specify required secrets when running scenarios, ensuring benchmark executions have access to necessary credentials without manual intervention.September 26, 2025
Snapshot Speed Improvements
Significantly faster snapshot creation and launch times; quicker devbox setup and more efficient state management workflows.September 15, 2025
WebSocket Support
Connect to your devboxes via WebSocket for real-time, bidirectional communication, enabling live terminal experiences and interactive debugging. Learn more →September 5, 2025
Streaming Command Output
Stream stdout and stderr in real-time as commands execute, providing immediate visibility into long-running processes like builds and tests. Learn more →Enhanced Command Execution
Improved command execution API with optimistic execution, automatic fallback handling, and better reliability for both short and long-running commands. Learn more →August 2025
August 27, 2025
Object Store
Store and manage files, datasets, and resources that can be shared across devboxes or mounted during creation. Upload training data, configuration files, model weights, and other assets for streamlined devbox setup. Learn more →August 14, 2025
Public Blueprints
Access and use community-maintained blueprint templates for common development environments, eliminating the need to build from scratch for standard configurations. Learn more →Overlaybd Support
Faster image pulling and reduced storage overhead with overlaybd technology, improving devbox launch times and resource efficiency.July 30, 2025
New Public Benchmarks
BigCodeBench - Evaluate AI agents on complex code generation tasks with this comprehensive benchmark for assessing programming capabilities.July 25, 2025
New Public Benchmarks
Expanded our public benchmark library with six new evaluation suites:- LiveCodeBench - Real-world coding challenges from recent competitive programming
- DS1000 - Data science problem-solving across diverse domains
- CruxEval - Code understanding and reasoning evaluation
- LiveSWEBench - Contemporary software engineering tasks
- SWEReBench - Software engineering reliability assessment
- R2E Gym - Repository-to-execution testing suite