mirror of https://gitlab.com/cynium/istota.git synced 2026-05-14 14:05:02 +00:00

The octopus-shaped AI emissary that learns, grows, and remembers. https://istota.xyz

Python 87.5%
Svelte 6.9%
Shell 2.3%
TypeScript 2.1%
Jinja 0.6%
Other 0.5%

Find a file

Stefan Kubicki 58938029c4 docs(memory): route quantitative health data to the health module only The health module is now the single source of truth for measurements, biomarker values, medication doses, lab dates, and current symptoms. Memory / KG / dated / KV prompts updated to keep those out and to direct cross-skill consumers at `health latest` / `health trend`. Identity-level medical facts (allergies, named chronic conditions) stay in the knowledge graph.		2026-05-13 09:45:45 -07:00
.claude/rules	feat(db): self-healing SQLite sweep for per-user DBs	2026-05-12 17:47:28 -07:00
.github/workflows	Consolidate duplicate ### subsections in GitHub Release notes	2026-04-29 08:27:19 -07:00
config	feat(experimental): operator-scoped feature flags for in-tree-but-off-by-default work	2026-05-11 15:44:43 -07:00
deploy	feat(db): self-healing SQLite sweep for per-user DBs	2026-05-12 17:47:28 -07:00
docker	ntfy: move from operator-global to per-user connected service	2026-05-06 22:56:42 -07:00
docs	feat(experimental): operator-scoped feature flags for in-tree-but-off-by-default work	2026-05-11 15:44:43 -07:00
scripts	feat(health): vision-direct OCR + CSV bulk import + upload UI cleanup	2026-05-11 20:32:10 -07:00
src/istota	docs(memory): route quantitative health data to the health module only	2026-05-13 09:45:45 -07:00
tests	feat(db): self-healing SQLite sweep for per-user DBs	2026-05-12 17:47:28 -07:00
web	feat(web/health): unit-aware input for weight/height logging	2026-05-12 00:12:42 -07:00
.dockerignore	Add Docker Compose deployment	2026-03-13 18:40:16 -07:00
.env.example	Browse: overhaul skill doc, anti-detection, session persistence, captcha handling	2026-02-25 16:39:01 -08:00
.gitignore	feat(health): vision-direct OCR + CSV bulk import + upload UI cleanup	2026-05-11 20:32:10 -07:00
AGENTS.md	feat(db): self-healing SQLite sweep for per-user DBs	2026-05-12 17:47:28 -07:00
CHANGELOG.md	feat(db): self-healing SQLite sweep for per-user DBs	2026-05-12 17:47:28 -07:00
CLAUDE.md	Fold moneyman into istota as money module	2026-04-25 02:47:52 -07:00
DEVLOG.md	feat(db): self-healing SQLite sweep for per-user DBs	2026-05-12 17:47:28 -07:00
install.sh	security: admin-gate command/shell-command jobs; misc hygiene	2026-05-07 18:11:15 -07:00
LICENSE	Switch license from AGPL-3.0 to MIT	2026-02-21 20:10:53 -08:00
mkdocs.yml	Memory subsystem round 2: brain routing, KG audit	2026-04-28 02:22:31 -07:00
pyproject.toml	Bump version to 0.11.1	2026-05-09 11:15:30 -07:00
README.md	fix: align channel sleep cycle test with UTC convention; document install.sh sudo path	2026-05-07 18:52:24 -07:00
schema.sql	feat: per-user location.db split	2026-05-08 20:33:01 -07:00
uv.lock	feat(health): experimental health module — body stats, bloodwork, biomarker trends	2026-05-11 18:42:05 -07:00

README.md

Istota

An octopus-shaped, self-hosted AI agent that lives in your Nextcloud. (istota.xyz)

Quick start

Bare metal/VM is the canonical deployment and requires an existing Nextcloud installation on the network/internet.

# Bare metal (Debian/Ubuntu VM, connects to your existing Nextcloud) — recommended
curl -fsSL https://raw.githubusercontent.com/istota-project/istota/main/install.sh | sudo bash

# Docker (bundles its own Nextcloud)
curl -fsSL https://raw.githubusercontent.com/istota-project/istota/main/install.sh | bash -s -- --docker

Both run the same interactive wizard. --help lists flags; the dispatcher forwards everything except --docker to the chosen path. At least glance at install.sh (or docker/init.sh) before you pipe it into a shell.

Bare metal/VM install

Requirements: a Nextcloud instance, a Debian/Ubuntu VM, and a Claude Code OAuth token.

curl -fsSL https://raw.githubusercontent.com/istota-project/istota/main/install.sh | sudo bash

The installer walks you through connecting to Nextcloud, setting up users, and choosing optional features. After installation:

sudo -u istota HOME=/srv/app/istota claude login

To update: sudo bash install.sh --update. An Ansible role is also available at deploy/ansible/.

Docker install

The Docker setup spins up a complete stack from scratch: Postgres, Redis, a fresh Nextcloud instance, the Istota scheduler, the SvelteKit web UI, an nginx reverse proxy on a single host port (Nextcloud at /, the Istota dashboard at /istota/), and — on x86-64 Linux hosts and Apple Silicon under Rosetta — a Playwright browser container with bot-detection countermeasures that the browse skill drives. The GPS webhook receiver is opt-in. If you already have a Nextcloud instance, use the bare-metal path instead — Compose creates its own Nextcloud and is meant for evaluation or standalone deployments, not for connecting to an existing one.

1. Configure

curl -fsSL https://raw.githubusercontent.com/istota-project/istota/main/install.sh | bash -s -- --docker
# or, from a clone: bash docker/init.sh

The curl path clones the repo to ~/istota (override with ISTOTA_CLONE_DIR=...) and runs the wizard from there. When invoked via sudo, ~ resolves to the invoking user's home (via SUDO_USER), not /root — so sudo curl … | bash -s -- --docker still lands in your own home directory. Keep this directory around — it holds your .env and is where you'll run docker compose commands from later. To update: cd ~/istota && git pull && docker compose -f docker/docker-compose.yml up -d --build. Volumes (istota_data, nextcloud_data, postgres, etc.) survive rebuilds.

init.sh is a guided wizard that mirrors the bare-metal install flow. It auto-generates passwords for the Nextcloud admin, Postgres, the bot account, and your user; auto-detects your timezone; and walks through the same optional-feature prompts you'd see on a real install:

Identity — bot name, public hostname (DOMAIN — leave empty for localhost-only).
Claude auth — paste a token from claude setup-token (instructions printed inline), or skip and set ANTHROPIC_API_KEY later.
Primary user — login id, display name, timezone, optional email address.
Email integration — IMAP/SMTP host, user, password (skipped if you say no).
GPS location tracking — toggles the location compose profile so the Overland webhook receiver starts.
Developer credentials — optional GitLab / GitHub personal access tokens.
Browser container — auto-enabled on x86-64 Linux hosts and Apple Silicon under Rosetta (slow, preview-grade); disabled on Linux ARM, where Chrome has no native packages and qemu emulation is unreliable.

The script writes docker/.env (mode 600) and prints the generated passwords once — copy them somewhere safe. Flags: --minimal skips the optional-feature sections (passwords + Claude + user only), --force overwrites an existing .env without asking.

If you'd rather configure by hand, copy .env.example to .env and edit it directly. At minimum set CLAUDE_CODE_OAUTH_TOKEN, ADMIN_PASSWORD, POSTGRES_PASSWORD, BOT_PASSWORD, USER_PASSWORD, and USER_NAME.

2. Start

docker compose up -d

First start takes a few minutes: Nextcloud initializes the database, creates user accounts, installs apps (Talk, Calendar, External Storage), sets up shared folders, registers an OAuth2 client for the web dashboard, and creates a Talk room between you and the bot. The Istota container waits for all of this before starting the scheduler. On first boot it also generates a LOCATION_INGEST_TOKEN and an ISTOTA_SECRET_KEY (the master key for the encrypted secrets table) and persists them under /data/.

3. Chat

Open http://localhost:8080, log in with your USER_NAME / USER_PASSWORD, go to Talk, and start chatting. The bot responds through the same Talk interface. The web dashboard lives at http://localhost:8080/istota/ (Nextcloud OAuth2 — same login).

Optional services

The browser container (Playwright with bot-detection countermeasures) and the GPS webhook receiver run as Docker Compose profiles. init.sh sets COMPOSE_PROFILES=browser by default on x86-64 Linux and Apple Silicon (with a 5G memory bump for Rosetta-emulated Chromium); on Linux ARM Chrome has no native packages so it stays off. GPS is off by default. Edit COMPOSE_PROFILES in .env — comma-separated list — or pass --profile flags ad-hoc:

docker compose --profile browser up -d              # Browser only
docker compose --profile location up -d             # GPS webhook receiver only
docker compose --profile browser --profile location up -d  # Both

Configuration

The .env file exposes most of the same settings available in the Ansible role: scheduler intervals, conversation context tuning, progress updates, sleep cycle, memory search, email (IMAP/SMTP), developer skill (git/gitlab/github), and per-user overrides. See .env.example for the full list with defaults. Per-user features like ntfy push notifications, Karakeep bookmarks, Google Workspace, and Monarch are configured per-user in the web settings (/istota/settings) — there are no operator-shared blocks for those.

The config file at /data/config/config.toml inside the container is generated on first start and not overwritten on subsequent starts. To change settings after initial setup, either delete the config and restart (it regenerates from env vars), or edit it directly:

docker compose exec istota vi /data/config/config.toml
docker compose restart istota

Differences from bare metal

The Docker deployment differs from a bare metal / Ansible installation in a few ways:

No network proxy. The CONNECT-based network proxy (domain allowlist) is disabled — Docker's own network isolation serves the same purpose. Bubblewrap filesystem sandboxing and the skill credential proxy are enabled and work inside the container. Bubblewrap requires kernel support for user namespaces; in containers without CAP_SYS_ADMIN it cannot create namespaces and the sandbox is unavailable. Add --cap-add SYS_ADMIN to the istota service if you see SECURITY UNSUPPORTED CONFIGURATION at startup — Linux + bubblewrap is the only supported configuration (see docs/deployment/security.md).
Single user. The Docker setup provisions one human user. Additional users can be added by editing config.toml directly and creating them in Nextcloud.
Bundled Nextcloud. The Compose file creates a new Nextcloud instance. If you already run Nextcloud, use the bare metal installer or Ansible role instead — they connect to your existing instance without creating a second one.
No backups or auto-update. The Ansible role sets up cron-based DB backups and optional auto-update. In Docker, volume backups are your responsibility.
All Python extras installed. The Docker image includes every optional dependency (whisper, memory-search, etc.) so all skills are available without rebuilding.

How it works

Talk message ──>┐
Email ─────────>├──> SQLite queue -> Scheduler -> Claude Code -> Response
TASKS.md ──────>│
CLI ───────────>┘

Messages arrive through Talk polling, IMAP, TASKS.md file watching, or the CLI. The scheduler claims tasks from a SQLite queue, builds a prompt with the user's resources, skills, memory, and conversation context, then invokes Claude Code in a sandbox. Responses go back through the same channel.

Per-user worker threads handle concurrency. Foreground tasks (chat) and background tasks (scheduled jobs, briefings) run on separate pools so a long-running job never blocks a conversation.

Features

Messaging — Nextcloud Talk (DMs and multi-user rooms with @mention support), email (IMAP/SMTP with threading), TASKS.md file polling, CLI.

Skills — Two-pass selection: deterministic keyword/resource matching, then optional Haiku-based semantic routing for the unselected manifest. Sticky skills carry the previous turn's set into follow-ups in the same conversation. Ships with: Nextcloud file management, CalDAV calendar, email, web browsing (Dockerized Playwright with bot-detection countermeasures), git/gitlab/github workflows, money (in-process beancount ledger, invoicing, transactions, work log), Google Workspace (Drive, Gmail, Calendar, Sheets, Docs), GPS location tracking (Overland), Karakeep bookmarks, voice transcription (faster-whisper), OCR (Tesseract), native RSS/Atom/Tumblr/Are.na feed manager (in-process — feedparser + vendored API providers, per-user SQLite, OPML import/export), and more. Skills are a curated standard library, not a plugin marketplace.

Memory — Per-user persistent memory (USER.md, auto-loaded into prompts), per-channel memory (CHANNEL.md), dated memory files from nightly extraction, and BM25 auto-recall. Temporal knowledge graph stores structured facts as entity-relationship triples with validity windows — freeform predicates, automatic supersession for single-valued relations, fuzzy dedup. Configurable memory cap to limit total prompt size. Hybrid BM25 + vector search (sqlite-vec, MiniLM) across conversations and memory files.

Scheduling — Cron jobs via CRON.md (AI prompts, prompt files, or shell commands), per-job model and effort overrides for cheap retrieve-and-render runs, natural-language reminders as one-shot cron entries, scheduled briefings with calendar/markets/headlines/news/todos components.

Briefings — Configurable morning/evening summaries. Components include calendar events, market data (futures, indices via yfinance + FinViz), headlines (pre-fetched frontpages from AP, Reuters, Guardian, FT, Al Jazeera, Le Monde, Der Spiegel), email newsletter digests, todos, and reminders. Output to Talk, email, or both.

Heartbeat monitoring — User-defined health checks: file age, shell commands, URL health, calendar conflicts, task deadlines, and system self-checks. Cooldowns, quiet hours, and per-check intervals.

Multi-user — Per-user config files, resource permissions, worker pools, and filesystem sandboxing. Admin/non-admin isolation. Each user gets their own Nextcloud workspace with config files, exports, and memory. Multiple bot instances can coexist on the same Nextcloud, each running as its own Nextcloud user with a separate namespace, and they can interact with each other through Talk rooms like any other participant.

Security — Bubblewrap sandbox per invocation (PID namespace, restricted mounts, credential isolation). Non-admin users can't see the database, other users' files, or system config. Deferred DB writes via JSON files for sandboxed operations. Credential stripping from subprocess environments.

Email routing — Plus-addressed inbound (bot+user_id@domain) so external contacts can email a specific user's agent directly. Outbound emails are tracked so external replies thread back to the originating Talk conversation, where the bot drafts a response and asks the user to confirm before sending. Untrusted senders to plus-addresses go through a deterministic confirmation gate before any task runs.

Web interface — Authenticated SvelteKit dashboard at /istota (Nextcloud OIDC). Includes a feed reader (viewport-based read tracking, infinite scroll, lightbox, per-entry starring with a "Starred" sidebar view, scope-aware bulk mark-as-read with Shift-A / toolbar button, f to toggle star) backed by the in-tree istota.feeds module, with a sprocket-icon settings page for subscriptions, categories, and OPML import/export; a GPS location/places page with map, cluster discovery, dismiss-zones, and per-place visit stats; plus money pages backed by the same in-process accounting code the skill uses. The user's /settings page exposes Profile, Resources, Briefings, and a "Disabled modules" multiselect; per-module credentials live on a cog-icon page for each module (Tumblr API key on /feeds/settings, Monarch credentials on /money/settings, Overland ingest token on /location/settings); cross-cutting Connected services (Karakeep, Google Workspace) live on /settings.

Pluggable model backend — A Brain protocol ([brain] kind = "claude_code") sits between the executor and the model invocation. Phase 1 wraps the claude CLI with stream-json parsing and transient-API retries; future brains (anthropic, openrouter) drop in without touching the executor.

Constitution — An Emissaries layer defines how the agent reasons about data, handles the boundary between private and public action, and what it owes to people beyond its operator. Per-user persona customization sits on top.

Why Nextcloud?

Most AI assistant projects treat infrastructure as someone else's problem. They connect to third-party APIs for storage, calendars, contacts, and messaging, accumulating credentials and vendor dependencies. Istota takes a different approach: it lives inside a Nextcloud instance as a regular user.

The bot gets files, calendars, contacts, Talk messaging, and sharing through the same protocols every other Nextcloud user uses. File sharing works by sharing a folder with the bot's user account. Calendar access works through standard CalDAV. Talk conversations work through the regular user API. No webhooks, no OAuth apps, no server plugins.

In practice this means:

Zero Nextcloud configuration. Create a user account, invite it to a chat. No admin panel changes, no app installation, no API tokens on the Nextcloud side.
File sharing is native. Users share files with the bot the same way they share with colleagues. The bot shares files back the same way. Permissions, links, and access control are handled by Nextcloud.
Multi-user comes free. Nextcloud already handles user isolation, file ownership, and access control. Istota inherits all of it rather than reimplementing it.
Self-hosted end to end. Your data stays on your Nextcloud server and the VM running Istota. No external services required beyond the Claude API.
User self-service. Config files (persona, briefings, cron jobs, heartbeat checks) live in the user's shared Nextcloud folder. Users edit them with any text editor or the Nextcloud web UI, no CLI access needed.

Istota is built around Nextcloud — it uses your files, calendars, contacts, and chat directly rather than wrapping them in API adapters. This tight integration is by design: your assistant lives where your data already is. That said, like any agent, it integrates with outside services where useful — a Google Workspace skill (Drive, Gmail, Calendar, Sheets, Docs) ships in the box, and skills for Microsoft 365 or other services can be added the same way.

User workspace

Each user gets a Nextcloud workspace:

/Users/alice/
├── istota/              # Shared with the user
│   ├── config/
│   │   ├── USER.md          # Persistent memory
│   │   ├── PERSONA.md       # Personality customization
│   │   ├── TASKS.md         # File-based task queue
│   │   ├── BRIEFINGS.md     # Briefing schedule
│   │   ├── CRON.md          # Scheduled jobs
│   │   └── HEARTBEAT.md     # Health monitoring config
│   ├── exports/         # Bot-generated files
│   ├── scripts/         # User-authored reusable scripts
│   └── examples/        # Reference documentation
├── inbox/               # Drop files here for the bot to process
├── memories/            # Nightly-extracted dated memories (YYYY-MM-DD.md)
└── shared/              # Auto-organized files shared with the bot

Channels (Talk rooms) get their own /Channels/{token}/ namespace with CHANNEL.md and dated channel memories.

Development

uv sync --extra all                        # Install all dependencies
uv run pytest tests/ -v                    # Run tests (~3450 unit tests)
uv run pytest -m integration -v            # Integration tests (needs live config)
uv run istota task "hello" -u alice -x     # Test execution

Most skill dependencies are optional. Install everything with --extra all, or pick individual groups:

uv sync --extra calendar         # caldav + icalendar
uv sync --extra email            # imap-tools
uv sync --extra markets          # yfinance
uv sync --extra transcribe       # pytesseract + Pillow (OCR)
uv sync --extra memory-search    # sqlite-vec + sentence-transformers for semantic search
uv sync --extra whisper          # faster-whisper for audio transcription
uv sync --extra location         # fastapi + uvicorn + geopy for GPS location receiver

Skills with missing dependencies are automatically excluded from prompt selection. Use !skills in Talk to see which are available.

License

MIT