ChatGPT as My $100 Trader: Results Inside
- Timeframe: 8 weeks (paper trading)
- Starting balance: $100.00
- Ending balance: $103.40
- Net return: +3.4%
- Benchmark (SPY over same period): +3.9% (approx.)
- Trades placed: 12 (entries/exits)
- Win rate: 58%
- Max drawdown: -2.8%
- Commissions: $0 (assumed), slippage est. 0.05%/trade
- Approach: ChatGPT suggested ideas and rules; I executed them in a paper account
Ground rules (a.k.a. the guardrails)
- Account: Paper trading with fractional shares.
- Instruments: Large, liquid U.S. stocks and ETFs only (e.g., SPY, QQQ, AAPL, MSFT, NVDA, IWM, XLE, GLD).
- No leverage, no options, no crypto, no day trades.
- Max positions at once: 3.
- Position sizing: ≤40% of portfolio per position.
- Per-trade risk: Target ≤2% of portfolio using stop-loss or mental stop.
- Rebalancing cadence: Check once daily after market close; act at the next day’s open.
- Data hygiene: Verify all quotes and news outside ChatGPT before acting.
- Costs: Assume $0 commissions; include a small slippage assumption.
- “Build a weekly checklist: macro notes, key earnings, sector strength/weakness, and one sentence per position: ‘Why we hold.’”
- “Explain your rationale like you’re writing a trading journal. What’s the thesis? What would prove it wrong?”
- Structure: Turned vague hunches into explicit rules.
- Restraint: Kept me from overtrading tiny swings.
- Journaling: Clear hypotheses and ‘exit if wrong’ lines.
Where it struggled (and how I compensated)
- Real-time data: ChatGPT can reference concepts, not live quotes. I verified prices elsewhere.
- Specificity: It’s good at frameworks, not timing the last 0.5%. I used end-of-day triggers only.
- Hallucinated facts: It occasionally misremembered indicator values. I pasted verified numbers into the prompt.
- Theme: Keep it simple—broad market and mega-cap tech strength.
- Positions: SPY (35%), QQQ (25%). 40% cash buffer.
- Result: Modest lift (+1.1%) as large-cap momentum continued.
- ChatGPT flagged small-cap weakness (IWM) and suggested a cautious probe in energy (XLE) as a diversifier, keeping tight risk.
- I rotated 15% into XLE, cut QQQ to 15%.
- Result: Whipsaw. Energy dipped; QQQ rebounded. Portfolio briefly fell to -0.7% MTD. Max drawdown hit -2.8% on a stop-out.
Weeks 5–6: Letting winners work
- Back to strength: re-added QQQ to 25% and opened a small NVDA slice (10%) with a tight trailing stop, plus SPY core (35%).
- Result: Trend did its job. A couple of green weeks pushed the account to +3.9% from start before a small giveback.
Weeks 7–8: Defensive exits and tidy-up
- Raised stops to protect gains. Trimmed NVDA on a wobble; rotated 10% into GLD as a hedge while keeping SPY as the anchor.
- Final tally: +3.4%, just shy of SPY over the same window.
- Starting value: $100.00
- Ending value: $103.40
- Total return: +3.4%
- Best day: +0.9%
- Worst day: -0.8%
- Max drawdown: -2.8%
- Trades: 12 total (entries + exits)
- Winners: 7, Losers: 5
- Average winner: +3.1%
- Average loser: -1.9%
- Turnover (8 weeks): ~2.1x starting capital
- SPY over the same period: +3.9% (approx.)
- The takeaway: The “AI + rules” approach kept risk contained and results respectable, but it didn’t beat a simple index hold over this short window. That’s fine; the point was process.
What I learned (so you don’t pay the tuition)
- Automate the data fetch: Paste a compact daily snapshot (prices, 20/50/200-day MAs, RSI) so ChatGPT isn’t guessing.
- Predefine exits in dollars: “Risk $1.50 to make $3.00” from a $100 base clarifies sizing.
- Fewer rotations: Hold winners longer; make the benchmark earn its replacements.
- Add a simple regime filter: Only lean risk-on when price > 50-day MA on the index.
The exact prompts that worked best for me
- “Write a trading journal entry for today: What changed? Why keep or exit each position? What’s the single biggest risk tomorrow?”
- “If the benchmark (SPY) outperforms my mix by >1% for 2 consecutive weeks, propose a simpler allocation with fewer positions.”
Replicate this yourself (paper trading edition)
- Open a paper trading account that supports fractional shares.
- Pick a small, liquid watchlist (5–10 tickers).
- Decide your rules up front (position limits, stop policy, rebalance cadence).
- Execute at the next open; log entries, exits, and emotions.
- Compare weekly against SPY. If your process underperforms consistently, simplify.