# all of this runs without one outbound network call # 1. -weight: 500;">install (single binary; can be staged from an internal mirror)
ingero check # local capability sanity check # 2. capture (writes to a local SQLite DB)
ingero trace --duration 5m --out /var/lib/ingero/run.db # 3. query in place
ingero query /var/lib/ingero/run.db \ "SELECT * FROM cuda_events WHERE duration_ns > 1000000 LIMIT 20" # 4. (optional) pull DB through an approved transfer channel for offline review
sha256sum /var/lib/ingero/run.db
# all of this runs without one outbound network call # 1. -weight: 500;">install (single binary; can be staged from an internal mirror)
ingero check # local capability sanity check # 2. capture (writes to a local SQLite DB)
ingero trace --duration 5m --out /var/lib/ingero/run.db # 3. query in place
ingero query /var/lib/ingero/run.db \ "SELECT * FROM cuda_events WHERE duration_ns > 1000000 LIMIT 20" # 4. (optional) pull DB through an approved transfer channel for offline review
sha256sum /var/lib/ingero/run.db
# all of this runs without one outbound network call # 1. -weight: 500;">install (single binary; can be staged from an internal mirror)
ingero check # local capability sanity check # 2. capture (writes to a local SQLite DB)
ingero trace --duration 5m --out /var/lib/ingero/run.db # 3. query in place
ingero query /var/lib/ingero/run.db \ "SELECT * FROM cuda_events WHERE duration_ns > 1000000 LIMIT 20" # 4. (optional) pull DB through an approved transfer channel for offline review
sha256sum /var/lib/ingero/run.db - one kernel, zero sidecars – why a single host-side binary fits this constraint better than per-pod agents.
- counting privileged processes on a real GPU host – audit-side companion: how many host-level agents are actually running.
- read-only kernel telemetry as MCP tools – how the same local DB is queryable by an internal AI assistant.