Skip to main content

October 3, 2025

Standard Input Support
PlatformFeature
Send input directly to running commands in your devboxes, enabling interactive workflows and real-time command control.Learn more →
Repository Inspection Upgrades
PlatformImprovement
Inspect repositories without building them first, allowing faster iteration and more flexible workflows when working with Repo Connect.Learn more →

October 1, 2025

Custom Benchmarks
PlatformFeature
Create private, specialized benchmarks tailored to your proprietary codebase and business logic. Test AI agents against your specific requirements in isolated, scalable environments.

September 30, 2025

Suspend and Resume Devboxes
PlatformFeature
Pause your devboxes to preserve disk state while stopping compute costs, then resume exactly where you left off. Perfect for managing long-running development sessions efficiently.Learn more →
Required Secrets for Scenario Runs
PlatformFeature
Specify required secrets when running scenarios, ensuring benchmark executions have access to necessary credentials without manual intervention.

September 26, 2025

Snapshot Speed Improvements
PlatformImprovement
Significantly faster snapshot creation and launch times; quicker devbox setup and more efficient state management workflows.

September 15, 2025

WebSocket Support
PlatformFeature
Connect to your devboxes via WebSocket for real-time, bidirectional communication, enabling live terminal experiences and interactive debugging.Learn more →

September 5, 2025

Streaming Command Output
PlatformFeature
Stream stdout and stderr in real-time as commands execute, providing immediate visibility into long-running processes like builds and tests.Learn more →
Enhanced Command Execution
PlatformImprovement
Improved command execution API with optimistic execution, automatic fallback handling, and better reliability for both short and long-running commands.Learn more →

August 27, 2025

Object Store
PlatformFeature
Store and manage files, datasets, and resources that can be shared across devboxes or mounted during creation. Upload training data, configuration files, model weights, and other assets for streamlined devbox setup.Learn more →

August 14, 2025

Public Blueprints
PlatformFeature
Access and use community-maintained blueprint templates for common development environments, eliminating the need to build from scratch for standard configurations.Learn more →
Overlaybd Support
PlatformImprovement
Faster image pulling and reduced storage overhead with overlaybd technology, improving devbox launch times and resource efficiency.

July 30, 2025

BigCodeBench
PlatformFeature
Evaluate AI agents on complex code generation tasks with this comprehensive benchmark for assessing programming capabilities.

July 25, 2025

Benchmark Suite
PlatformFeature
New suite of code and software engineering benchmarks:
  • LiveCodeBench: Real-world coding challenges from recent competitive programming
  • DS1000: Data science problem-solving across diverse domains
  • CruxEval: Code understanding and reasoning evaluation
  • LiveSWEBench: Contemporary software engineering tasks
  • SWEReBench: Software engineering reliability assessment
  • R2E Gym: Repository-to-execution testing suite