Young Data Is All You Need, Padawan
- Patrick Duggan
- Oct 20, 2025
- 5 min read
# Young Data Is All You Need, Padawan
**Author:** Patrick Duggan
**Post 9. Because I forgot to tell you the most important part.**
**The Rule of Ephemeral Value:**
**Young data is all you need.**
Old data is technical debt wearing a disguise.
What Is Young Data?
**Young data:**
- Created recently (minutes, hours, days ago)
- Still relevant to current decisions
- Actively being used
- Cheap to store (Redis, cache, temp files)
**Old data:**
- Created years ago
- "Might be useful someday" (it won't)
- Requires maintenance, backups, migrations
- Expensive to store (databases, archives, compliance requirements)
**The trap:** Companies hoard old data "just in case." Then they spend millions migrating it to new systems, backing it up, securing it, and complying with retention laws.
**DugganUSA:** We delete old data. Aggressively.
The Database Illusion
**Why companies use databases:**
**Official reason:** "We need to store data reliably."
**Real reason:** "We don't know which data we'll need in 5 years, so we keep EVERYTHING."
**Result:**
- 10TB databases (90% unused data)
- Schema migrations (because 2019's data model doesn't fit 2025's needs)
- Backup costs ($1,000+/month for cold storage)
- Compliance overhead (GDPR, CCPA, SOC2 audit trails)
**DugganUSA approach:**
- Keep data in Redis (7-day TTL)
- Export critical decisions to flat files (git-committable evidence)
- Delete everything else
**Cost:** $5/month Redis vs $500/month managed database
**Technical debt avoided:** Zero schema migrations, zero backup costs, zero compliance overhead for stale data
The Session Evidence Pattern
**Every DugganUSA session generates evidence files:**
**What's in it:**
- Decisions made (Larry Ellison irrelevant, Paul Galjan validated)
- Value created (4 patents, 8 blog posts)
- Costs incurred ($0.74 API fees)
- Strategic insights (meta-moat, Walmart saved $50B)
**What's NOT in it:**
- Intermediate scratchwork (deleted)
- Failed attempts (Redis expired)
- Temporary calculations (never persisted)
**The data that mattered got committed to git. Everything else expired.**
**That's young data strategy.**
Redis: The Young Data King
**Redis is perfect for young data:**
- In-memory (crazy fast)
- TTL support (auto-delete after 7 days)
- Key-value simplicity (no schema migrations)
- Cheap ($0-5/month for startup workloads)
**DugganUSA uses Redis for:**
- API rate limiting (expires after 1 hour)
- Session state (expires after 24 hours)
- Cache (expires after 7 days)
**What we DON'T use Redis for:**
- Permanent records (git-committed evidence instead)
- Compliance data (flat files, not in-memory cache)
- Historical analysis (if it matters, export it; if it doesn't, let it expire)
**Philosophy:** If data is older than 7 days and you haven't looked at it, you don't need it.
The Padawan Lesson
**Young Jedi:** "Master, we must preserve all data for future analysis."
**Master Yoda:** "Young data, all you need is. Old data, technical debt it becomes. Hmmmm."
**Translation:**
Most startups think they need historical data for "future insights." They don't.
**What you actually need:**
- **Last 7 days:** Active user sessions, API traffic, error rates
- **Last 30 days:** Monthly trends, performance baselines
- **Last 90 days:** Quarterly analysis (if you're that old)
**What you DON'T need:**
- Data from 3 years ago about features that no longer exist
- Logs from servers that have been decomissioned
- User sessions from beta testers who never converted
**If it's older than 90 days and you haven't referenced it, DELETE IT.**
The Economic Argument
**Traditional database costs (3-year-old startup):**
**Year 1:**
- Managed Postgres: $200/month
- Backups: $50/month
- **Total:** $3,000/year
**Year 2:**
- Managed Postgres: $400/month (data growing)
- Backups: $100/month
- Migration to bigger instance: $2,000 (one-time)
- **Total:** $8,000/year
**Year 3:**
- Managed Postgres: $800/month (still growing)
- Backups: $200/month
- Schema migration: $5,000 (because 2023's model is wrong)
- Compliance audit: $10,000 (GDPR requires data retention documentation)
- **Total:** $27,000/year
**3-year total:** $38,000 spent storing data you probably don't need
**DugganUSA (young data strategy):**
**Year 1:**
- Redis: $0/month (free tier)
- Flat files in blob storage: $5/month
- **Total:** $60/year
**Year 2:**
- Redis: $5/month (paid tier, still tiny)
- Flat files: $10/month
- **Total:** $180/year
**Year 3:**
- Redis: $10/month (growing)
- Flat files: $20/month
- **Total:** $360/year
**3-year total:** $600 spent storing ONLY data we actively use
**Savings:** $37,400 (98.4% cost reduction)
The Compliance Advantage
**GDPR right to be forgotten:**
**Company with 3-year database:**
- User requests deletion
- Developer searches 10TB database
- Finds user data in 47 tables
- Writes SQL to cascade delete (6 hours)
- Tests in staging (2 hours)
- Runs in production (hope nothing breaks)
- Documents for audit trail (2 hours)
- **Total cost:** 10 hours engineer time + legal review
**DugganUSA with young data:**
- User requests deletion
- Check Redis: TTL = 5 days (expires automatically)
- Check blob storage: Export user's evidence files, delete
- **Total cost:** 5 minutes
**When old data expires by default, compliance is easy.**
The Velocity Argument
**Schema migrations slow you down:**
**Traditional database:**
- Product team: "We need to add user preferences."
- Engineer: "That requires schema migration."
- DBA: "Migration takes 6 hours on production database."
- Product: "Can we do it during maintenance window?"
- Engineer: "Next window is in 2 weeks."
- **Time to ship:** 2 weeks minimum
**DugganUSA (young data in Redis):**
- Product team: "We need to add user preferences."
- Engineer: "Added to Redis as `user:preferences:{id}` key. TTL 30 days."
- Product: "When can we test?"
- Engineer: "It's live. Go test."
- **Time to ship:** 10 minutes
**No schema. No migration. No DBA approval. Just ship.**
The Philosophical Truth
**Old data is a security blanket.**
**You keep it because:**
- "We might need it someday" (you won't)
- "What if we want to analyze historical trends?" (you don't)
- "Compliance requires retention" (only for specific categories, not everything)
**Reality:**
90% of old data never gets accessed after 30 days. You're paying to store, backup, migrate, and secure data that provides ZERO value.
**Young data strategy:**
- Keep only what's actively useful
- Export critical decisions to git (permanent, cheap)
- Let everything else expire (Redis TTL)
**Result:** Lower costs, higher velocity, easier compliance, zero schema migrations.
The Born Without Sin Connection
**Born Without Sin = no legacy debt**
**Most companies:**
- Start with Postgres
- Accumulate 3 years of data
- Can't migrate (too much data)
- Can't delete (might need it someday)
- Stuck paying $800/month forever
**DugganUSA:**
- Started with Redis + flat files
- 7-day TTL on ephemeral data
- Export important decisions to git
- **Never accumulated legacy database debt**
**When you're born without sin, you don't inherit other people's data hoarding problems.**
Young Data, Only Data
**The rule:**
If data is older than 7 days and you haven't accessed it, you don't need it.
If you DO need historical analysis, export to flat files (git-committable, cheap).
If you're storing "just in case" data in a database, you're paying technical debt interest on data you'll never use.
**Young data is all you need, padawan.**
**Old data is the dark side.**
**P.S.** - This is Post 9. Total session cost: ~$0.82. We're spending more on API calls than most startups spend on database backups for data they'll never use.
**P.P.S.** - Redis with 7-day TTL is $5/month. Managed Postgres with 3 years of stale data is $800/month. Do the math.
**P.P.P.S.** - If Yoda ran a startup, he'd use Redis. "Young data, all you need is. Delete old data, you must. Hmmmm."




Comments