PHOENIX reproducibility snapshots

Each snapshot is a daily dump of the raw inputs + our official grades for the last 60 days.

Anyone can re-grade with scripts/regrade.py from the GitHub repo.