top of page

Young Data Is All You Need, Padawan

  • Writer: Patrick Duggan
    Patrick Duggan
  • Oct 20, 2025
  • 5 min read

# Young Data Is All You Need, Padawan


**Author:** Patrick Duggan




**Post 9. Because I forgot to tell you the most important part.**


**The Rule of Ephemeral Value:**


**Young data is all you need.**


Old data is technical debt wearing a disguise.


What Is Young Data?



**Young data:**

- Created recently (minutes, hours, days ago)

- Still relevant to current decisions

- Actively being used

- Cheap to store (Redis, cache, temp files)


**Old data:**

- Created years ago

- "Might be useful someday" (it won't)

- Requires maintenance, backups, migrations

- Expensive to store (databases, archives, compliance requirements)


**The trap:** Companies hoard old data "just in case." Then they spend millions migrating it to new systems, backing it up, securing it, and complying with retention laws.


**DugganUSA:** We delete old data. Aggressively.


The Database Illusion



**Why companies use databases:**


**Official reason:** "We need to store data reliably."


**Real reason:** "We don't know which data we'll need in 5 years, so we keep EVERYTHING."


**Result:**

- 10TB databases (90% unused data)

- Schema migrations (because 2019's data model doesn't fit 2025's needs)

- Backup costs ($1,000+/month for cold storage)

- Compliance overhead (GDPR, CCPA, SOC2 audit trails)


**DugganUSA approach:**

- Keep data in Redis (7-day TTL)

- Export critical decisions to flat files (git-committable evidence)

- Delete everything else


**Cost:** $5/month Redis vs $500/month managed database


**Technical debt avoided:** Zero schema migrations, zero backup costs, zero compliance overhead for stale data


The Session Evidence Pattern



**Every DugganUSA session generates evidence files:**





**What's in it:**

- Decisions made (Larry Ellison irrelevant, Paul Galjan validated)

- Value created (4 patents, 8 blog posts)

- Costs incurred ($0.74 API fees)

- Strategic insights (meta-moat, Walmart saved $50B)


**What's NOT in it:**

- Intermediate scratchwork (deleted)

- Failed attempts (Redis expired)

- Temporary calculations (never persisted)


**The data that mattered got committed to git. Everything else expired.**


**That's young data strategy.**


Redis: The Young Data King



**Redis is perfect for young data:**

- In-memory (crazy fast)

- TTL support (auto-delete after 7 days)

- Key-value simplicity (no schema migrations)

- Cheap ($0-5/month for startup workloads)


**DugganUSA uses Redis for:**

- API rate limiting (expires after 1 hour)

- Session state (expires after 24 hours)

- Cache (expires after 7 days)


**What we DON'T use Redis for:**

- Permanent records (git-committed evidence instead)

- Compliance data (flat files, not in-memory cache)

- Historical analysis (if it matters, export it; if it doesn't, let it expire)


**Philosophy:** If data is older than 7 days and you haven't looked at it, you don't need it.


The Padawan Lesson



**Young Jedi:** "Master, we must preserve all data for future analysis."


**Master Yoda:** "Young data, all you need is. Old data, technical debt it becomes. Hmmmm."


**Translation:**


Most startups think they need historical data for "future insights." They don't.


**What you actually need:**

- **Last 7 days:** Active user sessions, API traffic, error rates

- **Last 30 days:** Monthly trends, performance baselines

- **Last 90 days:** Quarterly analysis (if you're that old)


**What you DON'T need:**

- Data from 3 years ago about features that no longer exist

- Logs from servers that have been decomissioned

- User sessions from beta testers who never converted


**If it's older than 90 days and you haven't referenced it, DELETE IT.**


The Economic Argument



**Traditional database costs (3-year-old startup):**


**Year 1:**

- Managed Postgres: $200/month

- Backups: $50/month

- **Total:** $3,000/year


**Year 2:**

- Managed Postgres: $400/month (data growing)

- Backups: $100/month

- Migration to bigger instance: $2,000 (one-time)

- **Total:** $8,000/year


**Year 3:**

- Managed Postgres: $800/month (still growing)

- Backups: $200/month

- Schema migration: $5,000 (because 2023's model is wrong)

- Compliance audit: $10,000 (GDPR requires data retention documentation)

- **Total:** $27,000/year


**3-year total:** $38,000 spent storing data you probably don't need




**DugganUSA (young data strategy):**


**Year 1:**

- Redis: $0/month (free tier)

- Flat files in blob storage: $5/month

- **Total:** $60/year


**Year 2:**

- Redis: $5/month (paid tier, still tiny)

- Flat files: $10/month

- **Total:** $180/year


**Year 3:**

- Redis: $10/month (growing)

- Flat files: $20/month

- **Total:** $360/year


**3-year total:** $600 spent storing ONLY data we actively use


**Savings:** $37,400 (98.4% cost reduction)


The Compliance Advantage



**GDPR right to be forgotten:**


**Company with 3-year database:**

- User requests deletion

- Developer searches 10TB database

- Finds user data in 47 tables

- Writes SQL to cascade delete (6 hours)

- Tests in staging (2 hours)

- Runs in production (hope nothing breaks)

- Documents for audit trail (2 hours)

- **Total cost:** 10 hours engineer time + legal review


**DugganUSA with young data:**

- User requests deletion

- Check Redis: TTL = 5 days (expires automatically)

- Check blob storage: Export user's evidence files, delete

- **Total cost:** 5 minutes


**When old data expires by default, compliance is easy.**


The Velocity Argument



**Schema migrations slow you down:**


**Traditional database:**

- Product team: "We need to add user preferences."

- Engineer: "That requires schema migration."

- DBA: "Migration takes 6 hours on production database."

- Product: "Can we do it during maintenance window?"

- Engineer: "Next window is in 2 weeks."

- **Time to ship:** 2 weeks minimum


**DugganUSA (young data in Redis):**

- Product team: "We need to add user preferences."

- Engineer: "Added to Redis as `user:preferences:{id}` key. TTL 30 days."

- Product: "When can we test?"

- Engineer: "It's live. Go test."

- **Time to ship:** 10 minutes


**No schema. No migration. No DBA approval. Just ship.**


The Philosophical Truth



**Old data is a security blanket.**


**You keep it because:**

- "We might need it someday" (you won't)

- "What if we want to analyze historical trends?" (you don't)

- "Compliance requires retention" (only for specific categories, not everything)


**Reality:**


90% of old data never gets accessed after 30 days. You're paying to store, backup, migrate, and secure data that provides ZERO value.


**Young data strategy:**

- Keep only what's actively useful

- Export critical decisions to git (permanent, cheap)

- Let everything else expire (Redis TTL)


**Result:** Lower costs, higher velocity, easier compliance, zero schema migrations.


The Born Without Sin Connection



**Born Without Sin = no legacy debt**


**Most companies:**

- Start with Postgres

- Accumulate 3 years of data

- Can't migrate (too much data)

- Can't delete (might need it someday)

- Stuck paying $800/month forever


**DugganUSA:**

- Started with Redis + flat files

- 7-day TTL on ephemeral data

- Export important decisions to git

- **Never accumulated legacy database debt**


**When you're born without sin, you don't inherit other people's data hoarding problems.**


Young Data, Only Data



**The rule:**


If data is older than 7 days and you haven't accessed it, you don't need it.


If you DO need historical analysis, export to flat files (git-committable, cheap).


If you're storing "just in case" data in a database, you're paying technical debt interest on data you'll never use.


**Young data is all you need, padawan.**


**Old data is the dark side.**




**P.S.** - This is Post 9. Total session cost: ~$0.82. We're spending more on API calls than most startups spend on database backups for data they'll never use.




**P.P.S.** - Redis with 7-day TTL is $5/month. Managed Postgres with 3 years of stale data is $800/month. Do the math.




**P.P.P.S.** - If Yoda ran a startup, he'd use Redis. "Young data, all you need is. Delete old data, you must. Hmmmm."


 
 
 

Comments

Rated 0 out of 5 stars.
No ratings yet

Add a rating
bottom of page