Department at a glance
Five overlapping workstreams. Not one thing.
PACE certification on track
101 ✓ · 201 ✓ · 301 Phase 13 prep
Data-Workbench split
3 locations: skill, pace-local, standalone repo. Consolidation candidate.
Insight pipelines 1/4 warm
panthera-titration warm · email-analysis · genai-naep · session-analytics cold
Verified research spec'd
Perplexity pipeline designed, working, not yet wired to brain DB
Knowledge / brain live
27 learnings · 2 CF deploys · brain DB v2.sessions streaming
Overall milestone progress
pace_pin_auth_v1.
Milestones — tick as you go
Persists per-device via localStorage. Each group shows its own %.
PACE certification
- CAM_DS_101 Applied StatisticsComplete · course scraped to canvas-export/
- CAM_DS_201 Supervised MLComplete · pace-course2 portfolio + LinkedIn post drafted
- CAM_DS_301 NLP — rubric 48/48All rubric items covered in v3 pipeline
- CAM_DS_301 walkthrough deployedpace-study.pages.dev live
- CAM_DS_301 submission notebook (basic)Run patched notebook top-to-bottom on A100 · sync back to repo
- CAM_DS_301 submittedDeadline 2026-04-27 17:00 UK
8-project portfolio (DeepMind roadmap)
- 1. Education Data Pipelinein_progress per yaml · but: data-workbench infra already ships this — rename?
- 2. Student Success PredictorNeeds Sparx task-attempt data access
- 3. Student Error Type ClassifierMulti-class; random-forest / XGBoost
- 4. Learning Trajectory ModelTime-series / LSTM — DeepMind-shaped
- 5. ML Evaluation PipelineCross-validation, metrics for educational ML
- 6. Adaptive Learning RecommenderRL / recommender systems
- 7. Interpretable ML for TeachersSHAP / LIME explanations
- 8. Education ML Research PaperCombine PACE techniques with Sparx data
Vertex AI learning NEW
Detailed roadmap on the Vertex AI tab. Checked items sync.
- GCP + Vertex AI account / free tierBilling alerts set ≤ £5/month
- Vertex AI Studio walkthroughPrompt design + model comparison in console
- Model Garden: deploy an endpointPick one model · deploy · call from Python SDK
- Gemini via Vertex (vs direct API)Grounding, function calling, system instructions
- Vector Search (formerly Matching Engine)Embed · index · query — brain DB doc_chunks equivalent
- Vertex AI Pipelines (KFP)One real DAG — e.g. the PACE emotion pipeline
- Custom training jobBYO container · TPU/GPU pricing sanity-check
- Agent Builder: one agent with tool useCompare to current brain/persona architecture
- Portfolio project: port PACE → VertexBatch prediction job for emotion classifier · publishable
Department hygiene
- Commit uncommitted workpace-nlp-project + brain-vault · workbench tooling + patched notebook + learnings
- Wire data-workbench guard hookOne line into acebuddy/scripts/file-guardrail-hook.sh
- Consolidate workbench (pace-local → standalone)After 2026-04-27 submission
- Wire perplexity-api-test → brain DBTier-classified writes to learnings + research_outputs
- Email-analysis pipeline — first pass127k records in brain DB; attention pattern signal
- Reconcile career.yaml vs sabbatical stateSee Career tab
- Scrape Career Toolkits + Owning Your Career coursesCanvas IDs 519 + 520 · already paid for
Career plan — current yaml
Target
DeepMind Research Engineer — AI for Education
Timeline: 2030 · Angle: PhD in learning mechanisms + Sparx data at scale
Career state — answered 2026-04-18: other / complicated
career.yaml still says current_role: "School Success Coach at Sparx Learning (0.8 FTE)". Yaml needs a fresh pass to reflect the current transitioning state (FTE, dates, Sparx relationship).
Saves locally. Next session I'll read this and update yaml accordingly.
DeepMind 2030 — answered 2026-04-18: still live
Portfolio — from yaml
| # | Project | Status | Career score |
|---|---|---|---|
| 1 | Education Data Pipeline | in_progress | — |
| 2 | Student Success Predictor | planned | logistic-regression: 9, nn: 10 |
| 3 | Student Error Type Classifier | planned | random-forest: 8, xgboost: 9 |
| 4 | Learning Trajectory Model | planned | neural-networks: 10 |
| 5 | ML Evaluation Pipeline | planned | evaluation-metrics: 8 |
| 6 | Adaptive Learning Recommender | planned | RL / recommender systems |
| 7 | Interpretable ML for Teachers | planned | explainability |
| 8 | Education ML Research Paper | planned | publication |
Honest observation
LinkedIn templates in yaml
Three post templates exist: technique_insight, sparx_application, portfolio_update. Hashtags include #DataScience #MachineLearning #EdTech #AIinEducation #CambridgeDataScience.
Vertex AI learning track
GCP's managed ML platform. Gap in current stack (AWS/local/HF). Cloud-neutral pedigree + Google-adjacent for DeepMind line. Budget-first approach.
Roadmap
1. Foundations ~2h
- Create GCP project, enable Vertex AI APIFree tier: $300 credit 90 days · set hard budget cap
- Install + authenticate
gcloudCLIgcloud auth application-default login - Vertex AI Studio — prompt design consoleCompare Gemini 2.5 Flash vs Pro on a PACE review classification task
2. Model Garden ~3h
- Deploy one endpoint from Model Gardene.g. Llama-3.1-8B or Gemma-7B · note cost/hr of the smallest machine
- Call endpoint from Python SDK
google-cloud-aiplatform - Undeploy endpoint (don't forget — billing)Endpoints accrue cost while running, even idle
3. Gemini via Vertex ~2h
- Gemini 2.5 basic call (pay-per-token)Cheaper than endpoint deploys for learning
- Grounding with Google SearchBuilt-in retrieval for freshness
- Function calling / tool useStructured output via schema
4. Vector Search ~4h
- Generate embeddings (text-embedding model)Compare to OpenAI ada for cost + quality
- Create Vector Search indexBrain DB doc_chunks equivalent · ~1k docs to start
- Query index from PythonNearest neighbour + metadata filter
5. Pipelines (Kubeflow) ~6h
- Hello-world pipeline (2 components)KFP SDK basics
- Port PACE emotion pipeline as a DAGload → clean → classify → report · portfolio-publishable
6. Custom training ~4h
- BYO training containerDockerfile + training script
- Hyperparameter tuning jobSmall search, single region, time-capped
7. Agent Builder ~3h
- Build one agent with two toolsCompare UX + cost to your brain/persona system
8. Portfolio project capstone
- PACE emotion classifier → Vertex batch predictionDeploy · schedule · cost-report · write-up for LinkedIn
- Full PACE v3 pipeline as a Vertex PipelineRubric-to-cloud port · DeepMind-relevant narrative
Good resources (unopened)
- cloud.google.com/vertex-ai/docs — official
- Google Cloud Skills Boost (formerly Qwiklabs) — hands-on with sandbox projects
- github.com/GoogleCloudPlatform/vertex-ai-samples — copyable notebooks
- "Prompt design with Gemini" + "Vertex Vector Search" learning paths on Skills Boost
Open threads — set priority
Edit priority numbers → auto-sort. 1 = ship next. Saves locally. Colour: 1-3 red, 4-6 amber, 7+ green.
Miss a thread? Priorities are yours — I'll pick up whichever sits at #1 next session.
Panel — next block of time
Fork: 9 days to PACE · ambiguous career state · uncommitted work · new Vertex track.
Architect
Consolidate workbench into standalone repo now. One night's work. Commit everything, wire guard hook, lift to ~/projects/data-workbench/. Pays forward into every future insight pipeline — including Vertex.
Risk: not on the rubric, not before deadline.
Cognitive Load Specialist
Commit everything today (30 min — protects state). Defer consolidation until after 2026-04-27. Start Vertex AI only after submission. You have a known ADHD pattern of starting rebuilds mid-project. Submit PACE clean, then refactor, then reopen career.
Risk: uncommitted discipline lingers 9 more days.
Behavioural Product Designer
Radios answered 2026-04-18: career state = "other/complicated" (transitioning), DeepMind 2030 = live (north star). Next: update career.yaml to reflect the transition (FTE, dates, Sparx relationship). That unblocks every downstream prioritisation.
Risk: introspection as procrastination — keep the yaml update to 10 min, not a rewrite.
My read (subtractive)
Order:
- Update career.yaml (10 min — radios say "other/transitioning"; yaml still lists 0.8 FTE Sparx. Reconcile.)
- Finish PACE submission (9 days to 2026-04-27 17:00 — A100 run patched_v2.ipynb → sync → ship)
- Then Vertex AI track · start with Foundations (£0 budget, 2h) + Gemini-via-Vertex (token-priced, small)
- Then wire perplexity → brain DB
- Status check on cold pipelines: email-analysis / genai-naep / session-analytics — alive or parked?
Retire: urge to restructure the department before submitting. Career.yaml update is all the restructuring this month needs.
Retired 2026-04-18: workbench-lift merge, guard-hook wiring, pace-compass git init — all shipped.
Quick actions
What I'm tracking for you between sessions
Tick state, hold-list priorities, career radio answers all live in localStorage on this device. Use Export state to back up. In next session I'll read this before planning.