Status
Show experiment dashboard with results, active loops, and progress
What Is Status?
The Status skill is an integral component of the Claude Code ecosystem, specifically designed to provide a comprehensive experiment dashboard for the autoresearch-agent. It delivers real-time visibility into experiment results, active automation loops, and overall progress across all tracked experiments. By consolidating this information into a unified, easily accessible interface, Status empowers developers and researchers to monitor, audit, and export experiment data with minimal overhead.
Why Use Status?
In modern research and engineering workflows, running multiple experiments in parallel is common—but tracking their outcomes, progress, and automation status can quickly become unwieldy. The Status skill addresses this challenge by:
- Centralizing Experiment Data: Consolidates experiment logs, loop states, and results into a single dashboard, reducing context switching and manual tracking.
- Real-Time Monitoring: Instantly displays which experiments are running, scheduled, or completed, minimizing human error and missed updates.
- Actionable Insights: Presents key metrics such as the number of runs, best results, and recent changes, enabling data-driven decisions and rapid iteration.
- Automation Transparency: Clearly indicates active experiment loops, including interval schedules and unique identifiers, fostering reproducibility and operational clarity.
- Flexible Exporting: Supports exporting dashboards in Markdown or CSV formats for reporting, sharing, or further analysis.
How to Get Started
The Status skill is invoked via the /ar:status command, which can be tailored to various scopes and output formats. To begin:
-
Install and Configure:
- Ensure the autoresearch-agent and its dependencies are installed per the official documentation.
- Place experiments in the expected directory structure, e.g.,
.autoresearch/{domain}/{experiment}.
-
Basic Usage:
- To display the full experiment dashboard:
/ar:status - For a detailed view of a single experiment:
/ar:status engineering/api-speed - To filter by domain:
/ar:status --domain engineering
- To display the full experiment dashboard:
-
Exporting Results:
- To export the dashboard as Markdown:
/ar:status --format markdown --output results.md - For CSV export:
/ar:status --format csv --output results.csv
- To export the dashboard as Markdown:
-
Direct Script Usage:
- Underlying these commands is a Python script. Direct calls can be made for advanced integration:
python {skill_path}/scripts/log_results.py --dashboard python {skill_path}/scripts/log_results.py --experiment engineering/api-speed python {skill_path}/scripts/log_results.py --domain engineering
- Underlying these commands is a Python script. Direct calls can be made for advanced integration:
Key Features
1. Comprehensive
Dashboard
The core function of Status is to display a dashboard summarizing all experiments, their results, and loop status. For example:
DOMAIN EXPERIMENT RUNS KEPT BEST CHANGE STATUS LOOP
engineering api-speed 38 7 95.2% +1.2% active every 1h (cron ID: 1234, started: 2024-05-01)
engineering data-cleaning 12 2 88.0% -0.5% stopped -This overview provides immediate insight into experiment health and automation status.
2. Single Experiment and Domain
View
Focus on details at granularity you need:
- Single Experiment:
Shows run logs and, if present, the active loop configuration (from
/ar:status engineering/api-speedloop.json). - Domain View:
Aggregates results for all experiments in the domain.
/ar:status --domain engineering
3. Active Loop
Detection
Status automatically checks for the presence of .autoresearch/{domain}/{experiment}/loop.json. If found, it parses and displays the loop interval, cron ID, and start date, making it easy to see which experiments are automated and their schedules.
4. Export
Capabilities
Easily export dashboards:
- Markdown: For documentation, sharing in wikis, or reports.
- CSV: For data analysis or integration with spreadsheets and BI tools.
Example export command:
/ar:status --format csv --output results.csv5. Scriptable
Integration
Use the underlying Python scripts for custom workflows or automation:
python skills/status/scripts/log_results.py --dashboard --format csv --output results.csvBest Practices
- Regularly Monitor the Dashboard: Integrate
/ar:statusinto your daily workflow or CI/CD pipelines to maintain situational awareness of ongoing experiments. - Automate Exports: Schedule periodic exports of the dashboard for archiving, compliance, or reporting purposes.
- Leverage Domain Filtering: Use the
--domainoption to focus on relevant subsets of experiments, especially in large projects. - Check Active Loops: Always review the loop status before making manual changes to experiments to avoid conflicts or duplicated runs.
- Version Control Outputs: Store exported CSV or Markdown files in version control for traceability and historical analysis.
Important Notes
- Access Control: Ensure appropriate permissions are set for the
.autoresearchdirectory and experiment data to protect sensitive results. - Data Consistency: Only modify experiment data and loop configurations via approved workflows to prevent dashboard inconsistencies.
- Output Accuracy: The accuracy of the dashboard is dependent on proper log and loop management. Corrupted or missing files can lead to incomplete or misleading displays.
- Skill Updates: As the Status skill evolves, periodically check the official repository for updates, bug fixes, and new features.
- Resource Utilization: Running frequent or numerous automated loops may impact system resources. Monitor usage and adjust intervals as needed.
The Status skill streamlines experiment monitoring and reporting, making it an essential tool for data-driven research and engineering teams.
More Skills You Might Like
Explore similar skills to enhance your workflow
Storyboard
Create a six-frame storyboard that shows a user's journey from problem to solution. Use when you need a fast narrative for alignment, concept
Rfdiffusion
Generate novel protein structures with RFDiffusion generative modeling
Document Release
Updates README, ARCHITECTURE, and CONTRIBUTING docs by cross-referencing post-ship diffs
Structured Autonomy Plan
structured-autonomy-plan skill for programming & development
Ra Qm Skills
12 regulatory & QM agent skills and plugins for Claude Code, Codex, Gemini CLI, Cursor, OpenClaw. ISO 13485 QMS, MDR 2017/745, FDA 510(k)/PMA, ISO 270
Analyzing Malware Persistence with Autoruns
Use Sysinternals Autoruns to systematically identify and analyze malware persistence mechanisms across registry