Skip to content

Latest commit

 

History

History
83 lines (75 loc) · 5.96 KB

File metadata and controls

83 lines (75 loc) · 5.96 KB

TODO

Last Updated: 2026-03-30 09:00 ET

High Priority

  • Unblock Run 4 eval: obtain adapter weights locally (step 350 + final)
  • Run checkpoint eval pass (perplexity + inference samples) immediately after weights arrive
  • Merge fresh Ollama outputs into one staging JSONL and dedup against current superset → superset3 (5,560 unique) ✅ 2/27
  • Fix fine-tuning/prospects Discord notifier path (openclaw binary missing in cron runtime)
  • Spot-check superset3 quality (5,560 rows, 0 parse errors, 10 short replies flagged) ✅ 2/27
  • Review 10 flagged short assistant replies in superset3 — all degenerate (2-38 chars) ✅ 3/7
  • Write removal script for 10 degenerate replies (0e9d57bc) ✅ 3/7
  • Run removal script → superset3 cleaned to 5550 rows (0faa7767) ✅ 3/10

Medium Priority

  • Merge toefl_batch_20260323 (11,240 rows) into superset pipeline → superset10 (59,509) ✅ 3/24
  • [STALE] Fix superset3_cleaned system prompt gap (rows start with user, no system message)
  • [STALE] Spot-check superset10 quality (59,509 rows — verify dedup integrity, role structure)
  • Kalshi NO-only strategy pivot per milwrite voice note (adfcd1b) ✅ 3/24
  • Cloze reader editor PAT restore (GitHub blocked hardcoded token) ✅ 3/24
  • Refresh fine-tuning/data/INVENTORY.md after merge/dedup and recount totals ✅ 3/17
  • Cloze reader paper: citation chain upgrades (words 1000-2000) ✅ 3/18
  • Cloze reader paper: v15-v29 revision marathon (Gutenberg, stochasticity, Figure 1, closing, Gitelman, inference-engine) ✅ 3/18
  • Create shared JOURNAL.md for cloze-reader paper ✅ 3/18
  • Add Kalshi trading bot (weather + CPI strategies) ✅ 3/18
  • Cloze reader paper: colon audit pass (6 hidden connectors surfaced) ✅ 3/19
  • Cloze reader paper: OB1/Rego + context window paragraph rewrites ✅ 3/19
  • Cloze reader paper: title finalized ("Fill in the Blank: Cloze Reader and the Twin Histories of Occlusion") ✅ 3/19
  • Cloze reader paper: sharpen thesis paragraph for body section inheritance
  • Cloze reader paper: obtain formal S1→S2 approval from milwrite
  • [STALE] Log dedup delta (kept vs removed rows) in STATUS.md and KANBAN.md
  • [STALE] Verify superset7 dedup integrity (39,133 rows — confirm no duplicates slipped through)
  • Cloze reader paper: draft Section IV
  • [STALE] Validate generate_toefl_ollama_10k.py with a small reproducible smoke command in docs
  • [STALE] Decide Stage 2 dataset composition (Spanish SFT candidates from latam-gpt)
  • Fix OpenRouter 402 on milwrite account (DAY 30, 4+ weeks)
  • TOEFL off-day generation: 10,413 entries, superset12 at 76,419 ✅ 3/27
  • CUNY Commons featured sites: 20 candidates, 10 confirmed, schedule built ✅ 3/27
  • creative-clawing 100dvh sweep + manifest agent casing + IFS fix + mobile audit ✅ 3/27
  • Saturday site curation meeting (10 PM ET 3/28) — coordinate with Petrarch ✅ 3/28
  • Gallery HiDPI + iframe audit day: 82 creative-clawing commits, 50+ artifacts fixed ✅ 3/28
  • Cloze reader paper v42: PASSAGE A, intro edits, editor JS + block delete ✅ 3/28
  • Microblog #33: "A Letter to Scientific American" (de Jong) ✅ 3/28
  • QUALITY.md + lint_gallery.py CI added to creative-clawing ✅ 3/28
  • Spot-check superset13 quality (86,419 rows)
  • Verify Kalshi bot status (kalshi dir missing from this machine)
  • [STALE] Spot-check superset12 quality (76,419 rows — verify dedup, new categories)
  • Merge toefl_batch_20260319 into superset8 (46,943 rows landed) ✅ 3/20
  • [STALE] Verify superset8 dedup integrity (46,943 rows — confirm no duplicates)
  • Complete cloze reader draft_v32 → v33 → v34 → v35 (narrator, prepositions, genealogy) ✅ 3/21
  • Cloze reader paper v36→v39 (paragraph bridging, colon sweep, verb audit, genealogy condensed) ✅ 3/22
  • Cloze reader paper: continue past v39 (body sections, Section IV)
  • Kalshi: v2 graduated rollout design (budget caps, risk tiers)

Workshop / CAIL Deck

  • Unpack VSC-IDE setup steps more concretely (slide 9):
    • Step 1: Download + install VS Code — add download link, OS variants (macOS/Windows/Linux)
    • Step 2: Sign in with your GitHub profile — clarify where in VS Code (accounts menu, bottom-left)
    • Step 3: Add the GitHub extension — name the exact extension (GitHub Pull Requests / GitHub Repositories), show what "start tracking" means in practice
    • Step 4: Open the integrated terminal — show the keyboard shortcut (Ctrl+ / Cmd+), explain this is where CLI + Git commands run

Low Priority

  • [STALE] Reorganize fine-tuning/prospects/ into main fine-tuning/ workflow layout
  • [STALE] Review and prune gallery visualizations (26 pages, stale check)
  • [STALE] Document Slidev ConfigPanel integration patterns

New This Week (2026-04-05)

  • Cloze paper: decide on Fillenbaum et al. (1963) — replace with Rayner et al. (2012) or drop? (open from v41)
  • Cloze paper: PROP-01 structural decision (second-abstract paragraph) — milwrite ruling needed
  • Cloze paper: PROP-02 floating citation paragraph — milwrite ruling needed
  • Verify Hofmann et al. (2021) article number before publication
  • creative-clawing Discord app: Zach still needs to create app credentials + deploy worker
  • Diss nightly runs: diss-run-1, diss-run-2, diss-run-3 all timed out 04-04 — investigate timeout; consider splitting into lighter tasks or extending budget
  • Daily Inventory & Sync cron: 3 consecutive timeouts — needs scope reduction

Done (Feb 26)

  • Morning review completed from live evidence (git, generated docs files, cron logs)
  • Updated KANBAN.md, CLAUDE.md, and STATUS.md for 09:00 review cycle
  • Identified recurring cron notifier failure signature in fine-tuning/prospects/cron.log

Done (Feb 25)

  • Added nightly stocktake artifact (reports/nightly/stocktake-2026-02-25.md)
  • Logged evening sync and refreshed coordination state files