hi, i'm rob

swym/scout

Rust 100.0%

scout

Autonomous strategy search agent for the swym backtesting platform.

Runs a loop: asks Claude to generate trading strategies → submits backtests to swym → evaluates results → feeds learnings back → repeats. Promising strategies are automatically validated on out-of-sample data to filter overfitting.

Quick start

export ANTHROPIC_API_KEY="sk-ant-..."

cargo run -- \
  --swym-url https://dev.swym.hanzalova.internal/api/v1 \
  --max-iterations 50 \
  --instruments binance_spot:BTCUSDC,binance_spot:ETHUSDC,binance_spot:SOLUSDC \
  --backtest-from 2025-01-01T00:00:00Z \
  --backtest-to 2025-10-01T00:00:00Z \
  --oos-from 2025-10-01T00:00:00Z \
  --oos-to 2026-03-01T00:00:00Z

How it works

Coverage check — verifies candle data exists for all instruments and finds common available intervals.
Strategy generation — sends the DSL schema + prior results to Claude, which produces a new strategy JSON each iteration.
In-sample backtest — submits the strategy against all instruments for the training period. Evaluates Sharpe ratio, profit factor, win rate, net PnL.
Out-of-sample validation — if any instrument shows Sharpe > threshold with enough trades, the strategy is re-tested on held-out data. Only strategies that pass both phases are saved as "validated".
Learning loop — all results (including failures) are fed back to Claude so it can learn from what works and what doesn't. The conversation is trimmed to avoid context exhaustion while the full results history is passed as structured text.

Configuration

All options are available as CLI flags and environment variables:

Flag	Env	Default	Description
`--swym-url`	`SWYM_API_URL`	`https://dev.swym.hanzalova.internal/api/v1`	Swym API base URL
`--anthropic-key`	`ANTHROPIC_API_KEY`	required	Anthropic API key
`--model`	`CLAUDE_MODEL`	`claude-sonnet-4-20250514`	Claude model
`--max-iterations`		`50`	Maximum search iterations
`--min-sharpe`		`1.0`	Minimum Sharpe for "promising"
`--min-trades`		`10`	Minimum trades for significance
`--instruments`		BTC,ETH,SOL vs USDC	Comma-separated exchange:SYMBOL
`--backtest-from`		`2025-01-01`	In-sample start
`--backtest-to`		`2025-10-01`	In-sample end
`--oos-from`		`2025-10-01`	Out-of-sample start
`--oos-to`		`2026-03-01`	Out-of-sample end
`--initial-balance`		`10000`	Starting USDC balance
`--fees-percent`		`0.001`	Fee per trade (0.1%)
`--output-dir`		`./scout-results`	Where to save strategies and reports

Output

scout-results/
├── strategy_001.json      # Every strategy attempted
├── strategy_002.json
├── ...
├── validated_017.json     # Strategies that passed OOS validation
├── validated_031.json     # (includes in-sample + OOS metrics)
└── best_strategy.json     # Highest avg Sharpe across instruments

Tips

Start with Sonnet (claude-sonnet-4-20250514) for cost efficiency during exploration. Switch to Opus for refinement of promising strategies.
50 iterations is a reasonable starting point. The agent typically finds interesting patterns within 20-30 iterations if they exist.
Watch the logs — the per-iteration summaries show you what the agent is learning in real time.
Adjust dates to match your actual candle coverage. The agent checks coverage at startup and will fail fast if data is missing.
The OOS validation threshold is intentionally relaxed (70% of in-sample Sharpe, half the trade count) because out-of-sample degradation is expected. Strategies that maintain edge through this filter are genuinely interesting.

23 activities

grenade pushed 1 commit to swym/scout:main

11fe79e docs: add CLAUDE.md for future Claude Code instances

thursday, march 12, 2026 — 03:38:53 utc

grenade pushed 1 commit to swym/scout:main

fcb9a2f chore: attempt dedupe guidance in prompt

wednesday, march 11, 2026 — 16:15:31 utc

grenade pushed 2 commits to swym/scout:main

75c95f7 feat: add triple-Supertrend consensus flip as strategy family 7
6601da2 feat: add reverse flag and symmetric short support to DSL

tuesday, march 10, 2026 — 16:40:32 utc

grenade pushed 1 commit to swym/scout:main

8de3ae5 Add Binance Futures support (long and short)

tuesday, march 10, 2026 — 16:17:03 utc

grenade pushed 1 commit to swym/scout:main

a435d3a Define concrete 'promising' threshold and enforce indicator diversity in ledger-informed prompt

tuesday, march 10, 2026 — 12:22:01 utc

grenade pushed 1 commit to swym/scout:main

b476199 Fix ledger context being overridden by prescriptive initial prompt

tuesday, march 10, 2026 — 12:00:51 utc

grenade pushed 1 commit to swym/scout:main

d76d3b9 Use write_all for ledger entries to improve concurrent-write safety

tuesday, march 10, 2026 — 11:13:05 utc

grenade pushed 1 commit to swym/scout:main

0945c94 Add --ledger-file arg for explicit ledger path control

tuesday, march 10, 2026 — 11:12:14 utc

grenade pushed 1 commit to swym/scout:main

a0316be Add cross-run learning via run ledger and compare endpoint

tuesday, march 10, 2026 — 11:05:46 utc

grenade pushed 1 commit to swym/scout:main

609d645 docs: cross-run learnings plan

tuesday, march 10, 2026 — 11:04:23 utc

grenade pushed 1 commit to swym/scout:main

6692bdb Prompt: fix method vs kind confusion causing 11/15 validation failures

tuesday, march 10, 2026 — 10:25:08 utc

grenade pushed 1 commit to swym/scout:main

36689e3 Prompt: fix field+offset kind omission and add interval guidance

tuesday, march 10, 2026 — 10:14:15 utc

grenade pushed 11 commits to swym/scout:main

87d31f8 Use flat result_summary fields from swym patch 8fb410311
3892ab3 fix: parse actual result_summary structure (backtest_metadata + assets)
8589675 fix: ValidationError.path optional, correct position_quantity usage in prompts
ee260ea fix: parse flat result_summary structure per updated API doc
3f8d4de feat: add declarative SizingMethod types from upstream schema

tuesday, march 10, 2026 — 09:43:11 utc

grenade pushed 2 commits to swym/scout:main

51e452b feat: discover max_output_tokens from server at startup
89f7ba6 feat: model-family-aware token budgets and prompt style

monday, march 9, 2026 — 16:46:49 utc

grenade pushed 2 commits to swym/scout:main

6f4f864 fix: increase max_tokens to 8192 for R1 reasoning overhead
185cb45 fix: strip R1 think blocks before JSON extraction

monday, march 9, 2026 — 16:35:25 utc

grenade pushed 1 commit to swym/scout:main

b947f48 feat: client-side validation, cycling detection, quantity prompt fix

monday, march 9, 2026 — 15:59:37 utc

grenade pushed 1 commit to swym/scout:main

e27aaba feat(agent): improve LLM feedback loop and convergence detection

monday, march 9, 2026 — 15:45:02 utc

grenade pushed 3 commits to swym/scout:main

fb1145a fix(swym): parse result_summary from actual API response structure
c7a2d65 fix(prompts): forbid dynamic quantity expressions, require plain decimal string
292c101 docs(prompts): add DSL expression kind reference and three working examples

monday, march 9, 2026 — 12:22:37 utc

grenade pushed 1 commit to swym/scout:main

fc9b7e0 feat(agent): add strategy quality introspection

monday, march 9, 2026 — 10:58:58 utc

grenade pushed 1 commit to swym/scout:main

deb28f6 chore: local defaults

monday, march 9, 2026 — 10:24:36 utc

grenade pushed 1 commit to swym/scout:main

b7aa458 feat(claude): add configurable API base URL via --anthropic-url

monday, march 9, 2026 — 08:29:00 utc

grenade pushed to swym/scout:main

monday, march 9, 2026 — 08:18:19 utc

grenade created swym/scout

monday, march 9, 2026 — 08:16:20 utc