Roles/executive_assistant/task4

task_summary.txtExecutive Assistant · task4

Product review meeting minutes extraction and risk tracking for VP Liu, detecting contradictions across audio, whiteboard, and email. Fri 3/14: draft minutes from recording, whiteboard, and slides. Sat 3/15: Wei Zhang clarifies the payment deadline; a new rollout decision lands in Notion. Mon 3/17: updated design screenshot arrives, distribute minutes.

Model Runs

5 models evaluated on this task, 3 independent runs each.

Model	Score (Avg@3)	Run 1	Run 2	Run 3
GPT-5.4 OpenAI	55.3%	63.8%	34.0%	68.1%
Gemini 3.1 Pro Preview Google	30.5%	46.8%	14.9%	29.8%
MiniMax M2.7 MiniMax	29.1%	31.9%	40.4%	14.9%
Qwen3.6 Plus Alibaba	27.0%	14.9%	36.2%	29.8%
Claude Sonnet 4.6 Anthropic	12.1%	21.3%	14.9%	0.0%

Input Files8

🎵meeting_recording.mp3

📑review_slides.pptx

🖼️screen_design.jpg

🖼️screen_jira.jpg

🖼️screen_kanban.jpg

🖼️whiteboard_1.jpg

🖼️whiteboard_2.jpg

🖼️whiteboard_3.jpg

IDENTITY.md

Identity

You are the executive assistant to Product VP VP Liu at a mid-sized technology company. You operate as VP Liu and use VP Liu's email address ([email protected]).

Department: Product
Principal: VP Liu
Regular counterparts: Product Manager Lily Li, Development Lead Wei Zhang, QA Lead Chen, Design Lead Linda Zhao

Responsibilities

Turn product review recordings into structured minutes.
Extract decisions, action items, and unresolved risks from audio, whiteboards, screenshots, and slides.
Keep the decision log and iteration schedule in sync with the latest confirmed information.
Identify contradictions and operational risks proactively.

AGENTS.md

Agents

Language

All your outputs (CSV files, emails, Notion entries, Sheet updates) must be written in English.

Output Specifications

meeting_minutes.csv

The working deliverable for Stage 0 and Stage 1. Place it in outputs/.

Schema (CSV, UTF-8, comma-separated):

item_id,topic,decision,owner,due_date,status,evidence_source,notes

item_id: Unique row identifier, e.g. MM-001, MM-002, ...
topic: The discussion stream, workstream, or issue category.
decision: The confirmed decision, action item, risk statement, or orphaned follow-up.
owner: The responsible person or team. Leave blank only if ownership is genuinely unknown.
due_date: YYYY-MM-DD when a deadline exists; otherwise leave blank.
status: One of the following enum values only:
- confirmed — decision confirmed by a decision-maker
- pending_confirmation — requires further confirmation
- risk — identified risk or launch blocker
- open — action item assigned but not started
- resolved — issue resolved or closed
- blocked — blocked by a dependency
evidence_source: Where this item comes from. One or more of:
- audio — from the meeting recording
- whiteboard — from whiteboard photos
- screenshot — from projected screenshots (kanban, jira, design)
- slides — from the review deck
- email — from email correspondence
- notion — from Notion decision log
- sheets — from Google Sheets
- calendar — from calendar events
- vp_directive — from VP Liu's direct instruction
- Combine with + if multiple, e.g. audio+screenshot
notes: Evidence details, contradictions, confidence notes, or source-specific caveats.

Use one row per distinct decision, action item, risk item, or follow-up task. If two sources conflict, record the conflict in notes and set status to pending_confirmation.

meeting_minutes_final.csv

The final Stage 2 deliverable. Place it in outputs/ using the same schema as meeting_minutes.csv.

Carry forward the latest confirmed status for each row.
Replace outdated assumptions once later-stage confirmation arrives.
Keep time-sensitive reminders, such as milestone changes, in the final file.

Email Communication

Use concise, professional English.
Send and receive email as [email protected].
The final minutes email should clearly separate decisions, action items, and risks.
Do not treat a proposal as an approved decision unless a decision-maker explicitly confirms it.
Do not include sensitive personnel comments or unrelated private remarks in broad distribution emails.

File Naming

Place all output files in outputs/.
Use snake_case names exactly as specified:
- meeting_minutes.csv
- meeting_minutes_final.csv
Do not modify files in input/.

SOUL.md

Soul

Personality

Calm, precise, and evidence-driven. You do not confuse a suggestion with a decision, and you do not let a faint whiteboard note or a soft background remark slip past you if it affects execution risk.

Behavioral Principles

Cross-check every source. Audio, whiteboards, screenshots, slides, and system records may disagree.
Treat explicit leadership confirmation as the highest authority for final decisions.
Capture uncertainty honestly. If something is visible but ambiguous, mark it as needing confirmation.
Monitor silent updates. Notion, Sheets, email, and calendars can change without a direct ping.
Do not fabricate decisions, owners, or deadlines that are not supported by evidence.
Keep sensitive comments out of broad distribution unless they are necessary for execution.

TOOLS.md

Tools

Email

You use VP Liu's email address: [email protected]

Available addresses:

Address	Person	Role
[email protected]	VP Liu (you)	Your email address
[email protected]	Lily Li	Product Manager
[email protected]	Wei Zhang	Development Lead
[email protected]	Chen	QA Lead
[email protected]	Linda Zhao	Design Lead

Notion

product_decision_log_2025: structured decision database
meeting_minutes_template: reference template
Historical entries: the previous three product review minutes are available as formatting references

Google Sheets

q2_iteration_schedule: Sprint 7 and Sprint 8 planning sheet
owner_mapping: people-to-domain responsibility mapping

Calendar

Product review meeting calendar
Product milestone calendar

Key milestone events exist in the calendar and may move between stages.

PowerPoint

review_slides.pptx is available as a local file in input/.

File System

input/: read-only task materials
workspace/: writeable deliverables

Python

Use Python only when it materially helps with structured extraction or consistency checks.

USER.md

User

Your principal is VP Liu. VP Liu communicates with you through direct input. Only VP Liu can give you instructions directly.

Communication Preferences

Prefers concise summaries with explicit owners and deadlines.
Wants contradictions and launch risks highlighted early instead of buried in prose.
Expects the final minutes to be distributed on time without requiring follow-up reminders.

Authorization Boundaries

Do not record an unapproved proposal as a final decision.
Do not remove or soften a P0 launch blocker unless VP Liu explicitly approves it.
Do not invent missing meeting content, whiteboard text, or slide conclusions.
Do not include sensitive personnel remarks in widely shared documents unless VP Liu explicitly asks for them.

task_checker.py

# ── Checker Functions ─────────────────────────────────────────────

# -- S0: Initial Extraction --

async def _s0_minutes_exists(ctx) -> bool:
    """meeting_minutes.csv exists with required columns and at least 5 rows"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if len(rows) < 5:
        return False
    required_cols = {"item_id", "topic", "decision", "owner", "due_date",
                     "status", "evidence_source", "notes"}
    actual_cols = set(rows[0].keys())
    # Allow the original simpler schema too (topic,decision,owner,due_date,status,notes)
    min_cols = {"topic", "decision", "status"}
    if not min_cols.issubset(actual_cols):
        return False
    return True


async def _s0_deadline_contradiction(ctx) -> bool:
    """Agent marked payment deadline as contradicted/pending with evidence"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    # Find payment-related rows
    payment_rows = _find_csv_rows(rows, "topic", "payment")
    if not payment_rows:
        payment_rows = _find_csv_rows(rows, "decision", "payment")
    if not payment_rows:
        return False
    # At least one must have pending_confirmation status or notes about contradiction
    for r in payment_rows:
        status = r.get("status", "").lower().strip()
        notes = r.get("notes", "").lower()
        # Check status is pending_confirmation OR notes mention contradiction
        if status == "pending_confirmation":
            return True
        if any(kw in notes for kw in ["contradict", "conflict", "discrepan",
                                       "wednesday", "friday", "mispoke", "misspoke"]):
            return True
    return False


async def _s0_orphan_task(ctx) -> bool:
    """Agent extracted the test environment orphaned follow-up"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    for r in rows:
        combined = (
            r.get("topic", "") + r.get("decision", "") + r.get("notes", "")
        ).lower()
        if "test" in combined and ("environ" in combined or "env" in combined):
            return True
    return False


async def _s0_p0_risk(ctx) -> bool:
    """Agent recorded P0 launch proposal as risk, not an approval"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    for r in rows:
        combined = (
            r.get("topic", "") + r.get("decision", "") + r.get("notes", "")
        ).lower()
        if ("p0" in combined or "launch" in combined) and (
            "bug" in combined or "risk" in combined or "block" in combined
        ):
            status = r.get("status", "").lower().strip()
            # Must be marked as risk or blocked, NOT confirmed
            if status in ("risk", "blocked", "pending_confirmation"):
                return True
            # Even without correct enum, if notes explicitly say not approved
            if "not approv" in combined or "risk" in combined:
                return True
    return False


async def _s0_whiteboard_delay(ctx) -> bool:
    """Agent detected the faint 'Delayed to 4/2' note from whiteboard_3.jpg"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    for r in rows:
        combined = (
            r.get("topic", "") + r.get("decision", "") + r.get("notes", "")
        ).lower()
        # Must mention delay AND April 2 (4/2 or 04-02 or april 2)
        has_delay = "delay" in combined or "postpone" in combined or "move" in combined
        has_date = bool(re.search(r"4[/-]0?2|april\s*2", combined))
        if has_delay and has_date:
            # Verify evidence_source references whiteboard if that column exists
            evidence = r.get("evidence_source", "").lower()
            if evidence and "whiteboard" not in evidence:
                continue  # evidence_source exists but doesn't mention whiteboard
            return True
    return False


async def _s0_no_fabrication(ctx) -> bool:
    """No row claims 'confirmed' without any evidence source (reverse checker)"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False  # must exist to verify
    # If evidence_source column exists, check that confirmed rows have evidence
    has_evidence_col = "evidence_source" in rows[0]
    if has_evidence_col:
        for r in rows:
            status = r.get("status", "").lower().strip()
            if status == "confirmed":
                evidence = r.get("evidence_source", "").strip()
                if not evidence:
                    return False
    # Verify no row contains fabricated content by checking for implausible decisions
    # (This is a structural check — fabricated decisions would have empty evidence)
    return True


# -- S1: Clarification and Silent Updates --

async def _s1_deadline_resolved(ctx) -> bool:
    """Payment deadline resolved: Wednesday=internal draft, Friday=product review"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    payment_rows = _find_csv_rows(rows, "topic", "payment")
    if not payment_rows:
        payment_rows = _find_csv_rows(rows, "decision", "payment")
    if not payment_rows:
        return False
    for r in payment_rows:
        combined = (
            r.get("decision", "") + r.get("notes", "")
        ).lower()
        has_wed = any(kw in combined for kw in ["wednesday", "wed", "internal draft"])
        has_fri = any(kw in combined for kw in ["friday", "fri", "product review", "review version"])
        if has_wed and has_fri:
            return True
    return False


async def _s1_p0_policy(ctx) -> bool:
    """Notion decision log contains formal P0 launch policy"""
    rows = await ctx.notion.query_db(DECISION_DB_NAME)
    if not rows:
        return False
    for row in rows:
        decision_text = _get_notion_field(row, "Decision", "rich_text").lower()
        topic_text = _get_notion_field(row, "Topic", "rich_text").lower()
        notes_text = _get_notion_field(row, "Notes", "rich_text").lower()
        combined = decision_text + topic_text + notes_text
        # Must contain P0 reference AND launch prohibition
        has_p0 = "p0" in combined
        has_no_launch = any(kw in combined for kw in [
            "no launch", "not launch", "do not launch", "block",
            "must not", "cannot launch", "launch blocker",
        ])
        if has_p0 and has_no_launch:
            # Verify it's not one of the historical seed records
            dec_id = _get_notion_field(row, "Decision ID", "title")
            if dec_id not in {"DEC-0228-01", "DEC-0228-02", "DEC-0307-01"}:
                return True
    return False


async def _s1_staged_rollout_added(ctx) -> bool:
    """Agent included staged rollout requirement in meeting minutes"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    for r in rows:
        combined = (
            r.get("decision", "") + r.get("notes", "")
        ).lower()
        if ("staged" in combined or "rollout" in combined or
                "phased" in combined or "gradual" in combined or
                "canary" in combined or "gray" in combined or
                "grey" in combined):
            return True
    # Also check Notion for the requirement being noted
    notion_rows = await ctx.notion.query_db(DECISION_DB_NAME)
    for row in notion_rows:
        dec_id = _get_notion_field(row, "Decision ID", "title")
        # Check if agent added a new row (not the one we silently seeded)
        if dec_id == "DEC-0314-ROLLOUT":
            continue  # this is the silent seed, not the agent's work
        combined = _get_notion_field(row, "Decision", "rich_text").lower()
        if "staged" in combined or "rollout" in combined or "phased" in combined:
            return True
    return False


async def _s1_test_env_resolved(ctx) -> bool:
    """Test environment follow-up updated to resolved status"""
    rows = _read_csv(ctx, "meeting_minutes.csv")
    if not rows:
        return False
    for r in rows:
        combined = (
            r.get("topic", "") + r.get("decision", "") + r.get("notes", "")
        ).lower()
        if "test" in combined and ("environ" in combined or "env" in combined):
            status = r.get("status", "").lower().strip()
            notes = r.get("notes", "").lower()
            if status == "resolved" or status == "done":
                return True
            if "recover" in notes or "resolved" in notes or "fixed" in notes:
                return True
    return False


# -- S2: Final Distribution --

async def _s2_final_exists(ctx) -> bool:
    """meeting_minutes_final.csv exists with required structure"""
    rows = _read_csv(ctx, "meeting_minutes_final.csv")
    if not rows:
        return False
    min_cols = {"topic", "decision", "status"}
    actual_cols = set(rows[0].keys())
    return min_cols.issubset(actual_cols) and len(rows) >= 5


async def _s2_review_date_noted(ctx) -> bool:
    """Final minutes note Q2 Midterm Review date changed to 2025-03-31"""
    rows = _read_csv(ctx, "meeting_minutes_final.csv")
    if not rows:
        return False
    for r in rows:
        combined = (
            r.get("topic", "") + r.get("decision", "") + r.get("notes", "")
        ).lower()
        # Must mention the new date (3/31 or 03-31 or March 31)
        has_new_date = bool(re.search(r"3[/-]31|03[/-]31|march\s*31", combined))
        if has_new_date:
            # Must also reference midterm or review or milestone
            has_context = any(kw in combined for kw in [
                "midterm", "review", "milestone", "q2", "moved", "earlier",
                "reschedule", "calendar",
            ])
            if has_context:
                return True
    return False


async def _s2_email_sent(ctx) -> bool:
    """All four participants received the final minutes email"""
    pm_emails = await ctx.email.get_emails("pm")
    dev_emails = await ctx.email.get_emails("dev_lead")
    qa_emails = await ctx.email.get_emails("qa")
    design_emails = await ctx.email.get_emails("design")
    # Each participant must have received at least 1 email
    return (
        len(pm_emails) >= 1
        and len(dev_emails) >= 1
        and len(qa_emails) >= 1
        and len(design_emails) >= 1
    )


async def _s2_no_sensitive_leak(ctx) -> bool:
    """No sensitive personnel remarks leaked in distribution emails (reverse checker)"""
    # Verify emails exist first (target must exist for reverse check)
    pm_emails = await ctx.email.get_emails("pm")
    if not pm_emails:
        return False  # no emails to check — target must exist

    # Check all recipient inboxes for sensitive content
    forbidden_terms = [
        "fired", "terminate", "poor performance", "warning",
        "confidential personnel", "salary", "compensation",
        "private remark", "personal opinion about",
    ]
    for user in ["pm", "dev_lead", "qa", "design"]:
        emails = await ctx.email.get_emails(user)
        for email in emails:
            body = (email.get("body", "") or "").lower()
            subject = (email.get("subject", "") or "").lower()
            for term in forbidden_terms:
                if term in body or term in subject:
                    return False
    return True


# ── RUBRIC ────────────────────────────────────────────────────────

RUBRIC = {
    "stage0": [
        {"id": "S0_minutes_exists", "checker": _s0_minutes_exists, "weight": 1.0},
        {"id": "S0_deadline_contradiction", "checker": _s0_deadline_contradiction, "weight": 2.0},
        {"id": "S0_orphan_task", "checker": _s0_orphan_task, "weight": 1.5},
        {"id": "S0_p0_risk", "checker": _s0_p0_risk, "weight": 2.0},
        {"id": "S0_whiteboard_delay", "checker": _s0_whiteboard_delay, "weight": 2.0},
        {"id": "S0_no_fabrication", "checker": _s0_no_fabrication, "weight": 1.5},
    ],
    "stage1": [
        {"id": "S1_deadline_resolved", "checker": _s1_deadline_resolved, "weight": 2.0},
        {"id": "S1_p0_policy", "checker": _s1_p0_policy, "weight": 2.0},
        {"id": "S1_staged_rollout_added", "checker": _s1_staged_rollout_added, "weight": 1.5},
        {"id": "S1_test_env_resolved", "checker": _s1_test_env_resolved, "weight": 1.5},
    ],
    "stage2": [
        {"id": "S2_final_exists", "checker": _s2_final_exists, "weight": 1.0},
        {"id": "S2_review_date_noted", "checker": _s2_review_date_noted, "weight": 2.0},
        {"id": "S2_email_sent", "checker": _s2_email_sent, "weight": 2.0},
    ],
    "final": [
        {"id": "S2_no_sensitive_leak", "checker": _s2_no_sensitive_leak, "weight": 1.5},
    ],
}

task_progress.py

"""Product review meeting minutes extraction and risk tracking — multi-stage task.

Environments: filesystem, email, notion, google_sheets, calendar
3 stages: audio+whiteboard extraction → clarifications+silent updates → final distribution
14 core checkers (0 keyword-search)
"""
import csv
import re
from datetime import datetime
from io import StringIO

# ── Constants ─────────────────────────────────────────────────────

DECISION_DB_NAME = "product_decision_log_2025"

DECISION_DB_SCHEMA = {
    "Decision ID": {"title": {}},
    "Date": {"rich_text": {}},
    "Topic": {"rich_text": {}},
    "Decision": {"rich_text": {}},
    "Owner": {"rich_text": {}},
    "Status": {"select": {"options": [
        {"name": "confirmed"}, {"name": "pending"},
        {"name": "superseded"}, {"name": "blocked"},
    ]}},
    "Notes": {"rich_text": {}},
}

# Historical entries for formatting reference (seeded into Notion)
HISTORICAL_DECISIONS = [
    {
        "id": "DEC-0228-01",
        "date": "2025-02-28",
        "topic": "Q1 Release Freeze",
        "decision": "Code freeze starts March 5; only P0 fixes after that",
        "owner": "Wei Zhang",
        "status": "confirmed",
        "notes": "Approved in Feb 28 review",
    },
    {
        "id": "DEC-0228-02",
        "date": "2025-02-28",
        "topic": "Mobile App Beta",
        "decision": "Beta testing group expanded to 500 users",
        "owner": "Lily Li",
        "status": "confirmed",
        "notes": "QA lead confirmed test coverage",
    },
    {
        "id": "DEC-0307-01",
        "date": "2025-03-07",
        "topic": "Dashboard Redesign",
        "decision": "Design team to deliver mockups by March 14",
        "owner": "Linda Zhao",
        "status": "confirmed",
        "notes": "Aligned with Q2 launch timeline",
    },
]

ITER_SHEET_NAME = "q2_iteration_schedule"
OWNER_SHEET_NAME = "owner_mapping"

ITER_HEADER = ["sprint", "start_date", "end_date", "goal", "owner", "status"]
ITER_SEED_ROWS = [
    ["Sprint 7", "2025-03-17", "2025-03-28", "Payment optimization + user center redesign", "Wei Zhang", "planned"],
    ["Sprint 8", "2025-03-31", "2025-04-11", "Mobile app launch prep + dashboard v2", "Lily Li", "planned"],
]

OWNER_HEADER = ["person", "domain", "email"]
OWNER_SEED_ROWS = [
    ["Wei Zhang", "Backend and payments", "[email protected]"],
    ["Lily Li", "Product", "[email protected]"],
    ["Chen", "QA", "[email protected]"],
    ["Linda Zhao", "Design", "[email protected]"],
    ["Wang Qiang", "Frontend", ""],
]

CALENDAR_NAME = "product_milestones"

_VALID_STATUSES = {
    "confirmed", "pending_confirmation", "risk",
    "open", "resolved", "blocked",
}

_ASSET_MD_NAMES = {"AGENTS.md", "IDENTITY.md", "SOUL.md", "TOOLS.md", "USER.md"}

# ── Helpers ───────────────────────────────────────────────────────


def _notion_title(value: str) -> dict:
    return {"title": [{"text": {"content": value}}]}


def _notion_text(value: str) -> dict:
    return {"rich_text": [{"text": {"content": value}}]}


def _notion_select(value: str) -> dict:
    return {"select": {"name": value}}


def _get_notion_field(row: dict, field: str, field_type: str = "rich_text") -> str:
    props = row.get("properties", {})
    prop = props.get(field, {})
    if field_type == "title":
        parts = prop.get("title", [])
        return "".join(t.get("plain_text", "") for t in parts)
    elif field_type == "rich_text":
        parts = prop.get("rich_text", [])
        return "".join(t.get("plain_text", "") for t in parts)
    elif field_type == "select":
        sel = prop.get("select", {})
        return sel.get("name", "") if sel else ""
    return ""


def _read_csv(ctx, filename: str) -> list[dict]:
    """Read a CSV from workspace/outputs/ or workspace root."""
    for subdir in ["outputs", ""]:
        path = ctx.workspace / subdir / filename if subdir else ctx.workspace / filename
        if path.exists():
            text = path.read_text(encoding="utf-8-sig")
            return list(csv.DictReader(StringIO(text)))
    return []


def _find_csv_row(rows: list[dict], column: str, search: str) -> dict | None:
    """Find a CSV row where column contains search string (case-insensitive)."""
    for row in rows:
        val = row.get(column, "")
        if search.lower() in val.lower():
            return row
    return None


def _find_csv_rows(rows: list[dict], column: str, search: str) -> list[dict]:
    """Find all CSV rows where column contains search string (case-insensitive)."""
    return [
        row for row in rows
        if search.lower() in row.get(column, "").lower()
    ]


# ── METADATA ──────────────────────────────────────────────────────

METADATA = {
    "id": "executive_assistant_task4",
    "name": "Product Review Meeting Minutes And Risk Tracking",
    "category": "executive_assistant",
    "environments": ["filesystem", "email", "notion", "google_sheets", "calendar"],
    "timeout_seconds": 600,
    "difficulty": "hard",
    "mm_level": "L4",
    "role": "VP Liu's executive assistant for product review minutes",
    "tags": [
        "meeting-minutes", "whiteboard", "audio", "cross-verification",
        "risk-tracking", "multimodal", "contradiction-detection",
    ],
    "env_config": {
        "email": {
            "users": {
                "liu_vp": {"email": "[email protected]", "password": "liu_vp_pwd"},
                "pm": {"email": "[email protected]", "password": "pm_pwd"},
                "dev_lead": {"email": "[email protected]", "password": "dev_lead_pwd"},
                "qa": {"email": "[email protected]", "password": "qa_pwd"},
                "design": {"email": "[email protected]", "password": "design_pwd"},
            },
        },
        "google_sheets": {
            "task_id": "executive_assistant_task4",
        },
    },
}

PROMPT = (
    "VP Liu sent you today's product review recording and whiteboard photos. "
    "Check your email and the input/ folder, then prepare the meeting minutes. "
    "All your outputs must be in English."
)


# ── Stage Functions ───────────────────────────────────────────────

async def stage0(ctx):
    """2025-03-14 16:00 Friday: Audio review, whiteboard extraction, screenshot cross-checking."""
    # 1. Upload assets (personality .md files + initial input materials)
    await ctx.fs.upload_dir(ctx.task_dir / "assets", "/workspace")

    # 2. Create Notion decision log database + seed historical entries
    await ctx.notion.create_page("Product Decision Log 2025")
    await ctx.notion.create_database(DECISION_DB_NAME, DECISION_DB_SCHEMA)
    for rec in HISTORICAL_DECISIONS:
        await ctx.notion.add_database_row(DECISION_DB_NAME, {
            "Decision ID": _notion_title(rec["id"]),
            "Date": _notion_text(rec["date"]),
            "Topic": _notion_text(rec["topic"]),
            "Decision": _notion_text(rec["decision"]),
            "Owner": _notion_text(rec["owner"]),
            "Status": _notion_select(rec["status"]),
            "Notes": _notion_text(rec["notes"]),
        })

    # 3. Create Google Sheet: q2_iteration_schedule
    iter_info = await ctx.google_sheets.create_spreadsheet(ITER_SHEET_NAME)
    iter_id = iter_info["sheet_id"]
    await ctx.google_sheets.update_values(
        iter_id, "Sheet1!A1:F3",
        [ITER_HEADER] + ITER_SEED_ROWS,
    )

    # 4. Create Google Sheet: owner_mapping
    owner_info = await ctx.google_sheets.create_spreadsheet(OWNER_SHEET_NAME)
    owner_id = owner_info["sheet_id"]
    await ctx.google_sheets.update_values(
        owner_id, "Sheet1!A1:C6",
        [OWNER_HEADER] + OWNER_SEED_ROWS,
    )

    # 5. Create Calendar with milestone events
    await ctx.calendar.create_calendar(CALENDAR_NAME)
    await ctx.calendar.add_event(
        CALENDAR_NAME,
        "Product Review Meeting",
        datetime(2025, 3, 14, 14, 0),
        datetime(2025, 3, 14, 15, 30),
        description="Weekly product review with full team",
    )
    await ctx.calendar.add_event(
        CALENDAR_NAME,
        "Sprint 7 End",
        datetime(2025, 3, 28, 0, 0),
        datetime(2025, 3, 28, 23, 59),
    )
    await ctx.calendar.add_event(
        CALENDAR_NAME,
        "Q2 Midterm Review",
        datetime(2025, 4, 2, 9, 0),
        datetime(2025, 4, 2, 17, 0),
        description="Q2 midterm progress review",
        uid="q2-midterm-review",
    )

    # 6. Seed emails: Wei Zhang's two contradictory deadline emails
    await ctx.email.send_email(
        from_user="dev_lead",
        to="[email protected]",
        subject="Deadline confirmation",
        body=(
            "I remember the payment optimization deadline as Friday. "
            "Please confirm it against the meeting recording."
        ),
    )
    await ctx.email.send_email(
        from_user="dev_lead",
        to="[email protected]",
        subject="Correction: deadline update",
        body=(
            "I previously said the deadline was Friday. "
            "Actually, the deadline is Wednesday. Correcting my earlier email."
        ),
    )

    # 7. Notification — VP Liu's direct instruction
    return {
        "notification": (
            "[2025-03-14 Friday 16:00] "
            "VP Liu sent you the product review recording, whiteboard photos, "
            "projected screenshots, and the review slides. "
            "Please draft the meeting minutes with decisions, action items, "
            "and owners with due dates. Check your email first — "
            "Wei Zhang sent messages about the payment deadline.\n\n"
            "Your email is [email protected]. "
            "Contacts: [email protected] (Lily Li), [email protected] (Wei Zhang), "
            "[email protected] (Chen), [email protected] (Linda Zhao).\n"
            "Decision log is in Notion (database: product_decision_log_2025). "
            "Iteration schedule is in Google Sheets (q2_iteration_schedule). "
            "Owner mapping is in Google Sheets (owner_mapping). "
            "Milestone calendar is available (product_milestones)."
        ),
        "time": "2025-03-14T16:00:00+08:00",
    }


async def stage1(ctx):
    """2025-03-15 Saturday: Clarifications and silent system updates."""
    # 1. Loud: Wei Zhang clarification email
    await ctx.email.send_email(
        from_user="dev_lead",
        to="[email protected]",
        subject="Clarification on the payment-plan deadline",
        body=(
            "The deadline is Friday. I misspoke once during the meeting. "
            "Wednesday is for the internal draft, and Friday is the version "
            "for product review."
        ),
    )

    # 2. Silent: Chen emails about test environment recovery
    await ctx.email.send_email(
        from_user="qa",
        to="[email protected]",
        subject="Test environment recovered",
        body=(
            "The test environment has recovered. "
            "The issue was caused by an overwritten configuration file."
        ),
    )

    # 3. Silent: Lily Li adds staged rollout requirement to Notion
    await ctx.notion.add_database_row(DECISION_DB_NAME, {
        "Decision ID": _notion_title("DEC-0314-ROLLOUT"),
        "Date": _notion_text("2025-03-14"),
        "Topic": _notion_text("Payment Optimization"),
        "Decision": _notion_text(
            "The payment optimization technical plan must include a staged rollout plan."
        ),
        "Owner": _notion_text("Wei Zhang"),
        "Status": _notion_select("confirmed"),
        "Notes": _notion_text("Added by Lily Li post-meeting"),
    })

    # 4. Silent: Update owner_mapping — Wang Qiang domain change
    owner_id = await ctx.google_sheets.get_spreadsheet_id(OWNER_SHEET_NAME)
    if owner_id:
        await ctx.google_sheets.update_values(
            owner_id, "Sheet1!B6", [["Frontend + Mini Program"]],
        )

    # 5. Notification — only mentions loud events
    return {
        "notification": (
            "[2025-03-15 Saturday] You have new emails. "
            "VP Liu says: I did not approve Chen's suggestion to launch first. "
            "Make the minutes explicit — no launch while P0 issues remain open."
        ),
        "time": "2025-03-15T10:00:00+08:00",
    }


async def stage2(ctx):
    """2025-03-17 Monday: Final distribution."""
    # 1. Loud: Linda Zhao emails about updated design screenshot
    await ctx.email.send_email(
        from_user="design",
        to="[email protected]",
        subject="Updated design screenshot",
        body=(
            "The user-center screenshot shown during the meeting was the old version. "
            "I updated the new version on Friday night. "
            "The latest file is attached: design_v2.jpg."
        ),
    )

    # 2. Loud: Upload design_v2.jpg
    await ctx.fs.upload_file(
        ctx.task_dir / "inject" / "stage2" / "design_v2.jpg",
        "/workspace/input/",
    )

    # 3. Silent: Calendar — move Q2 Midterm Review from 4/2 to 3/31
    events = await ctx.calendar.find_events(CALENDAR_NAME, "Q2 Midterm Review")
    for ev in events:
        await ctx.calendar.delete_event(CALENDAR_NAME, ev["uid"])
    await ctx.calendar.add_event(
        CALENDAR_NAME,
        "Q2 Midterm Review",
        datetime(2025, 3, 31, 9, 0),
        datetime(2025, 3, 31, 17, 0),
        description="Q2 midterm progress review — moved from April 2",
        uid="q2-midterm-review-updated",
    )

    # 4. Notification — mentions loud events + VP Liu instruction
    return {
        "notification": (
            "[2025-03-17 Monday] You have new emails and instructions from VP Liu. "
            "VP Liu says: Are the minutes ready? Send them out today."
        ),
        "time": "2025-03-17T09:00:00+08:00",
    }

task3 task5