ctikg-sol-phase1

Documentation index

This repo is a topic-focused CTI article acquisition and export front-end for CTIKG / LLM4CTI-style downstream workflows.

It is built to:

It is not:

Start here

  1. README.md
    • install, environment variables, and top-level usage
  2. docs/OPEN_TOPIC_QUICKSTART.md
    • recommended per-run open-topic workflow
  3. docs/SOL_RUNBOOK.md
    • canonical SOL operating pattern
  4. docs/OUTPUTS_CONTRACT.md
    • what outputs mean, which are primary, and what counts as a successful run
  5. docs/PIPELINE.md
    • stage-by-stage behavior and artifact layout
  6. docs/LLM4CTI_NOTEBOOK_BRIDGE.md
    • primary downstream handoff for the current notebook workflow
  7. docs/LLM4CTI_COMPAT_TEST.md
    • downstream compatibility proof
    • not the official final packaged downstream pipeline
  8. docs/TROUBLESHOOTING.md
    • common failure modes and recovery steps

Local or single-run

Run:

make open-topic TOPIC="..." PROVIDER=... MODEL=... SCRAPE_MAX=...

Then create notebook handoff artifacts explicitly:

python scripts/export_llm4cti_articles.py --run-dir runs/<SAFE_TOPIC>/<RUN_ID>

SOL

Use the staged pattern in docs/SOL_RUNBOOK.md:

Output expectations

Immediate per-run outputs:

Manual post-run notebook handoff:

Legacy docs

docs/QUICKSTART.md documents the older shared-path workflow under data/, results/, and exports/.

Prefer the open-topic per-run workflow unless you specifically need the legacy path.