Hermes — Overview

[hermes reference]
updated: 2026-06-25

🜂 Hermes — Overview

Omair's always-on personal AI agent on the Oracle VPS. One-pager. Full detail → Hermes — Reference.

What it is

A self-hosted AI assistant you talk to over WhatsApp self-chat (text or voice, in and out), running 24/7 on oraclevps — with a web dashboard, scheduled automations, Google/ClickUp/web tools, an Obsidian knowledge vault, full self-monitoring, and a self-improvement loop. Remembers you via mem0.

What it can do

  • Chat over WhatsApp — text or voice in; replies in text or voice (Gemini TTS, voice Kore).
  • Capturenote:/todo:/idea:/... or a voice note → saved to your Obsidian Inbox.md (voice transcribed: Groq → Deepgram → Gemini → local Whisper).
  • Morning brief (11 AM PKT) — weather, prayer, calendar, Google + ClickUp tasks, news → WhatsApp + email + Discord.
  • Dedicated Discord feeds — #briefings, #alerts, #status, #news (tech + AI, 2×/day), #audit (every action), #seggy-said / #yasar-ahmad (creator updates), #trg-watch, #health.
  • Self-monitoring — watchdogs probe every dependency every 5 min (services, LLMs, disk, dashboard uptime) and push to your phone (ntfy) + Discord. Healthchecks.io catches total outages.
  • Self-improvement (Dojo) — daily 6AM cycle that analyzes session history, finds weaknesses, and patches its own skills automatically.
  • Langfuse observability — every LLM call traced (tokens, cost, latency, tool invocations) → cloud.langfuse.com.
  • Creator books → Obsidian — auto-follows Seggy Said & Yasar Ahmad, turning their clips into a categorized, tappable reading library (synced to phone + published at vault.omair.pages.dev).
  • Session logging — every interactive session auto-logged to Obsidian with stats, tools used, key exchanges, and wikilinks.
  • Google + ClickUp + web — Calendar, Gmail, Drive, Docs, Sheets, Slides, Tasks, Contacts; ClickUp; Tavily search, Context7 docs, Composio (1,000+ SaaS apps).
  • Syncthing + Git backup — Obsidian vault synced to phone in real-time and committed to GitHub hourly for time-machine history.

Installed systems

System What Section
Hermes Agent Core agent + gateway (WhatsApp, Discord, Email, ntfy) Hermes — Reference#2
GLM-5.2 + 7-deep fallback LLM brain (all endpoints green) Hermes — Reference#3
Groq + Deepgram + Gemini STT Voice transcription (off-CPU) Hermes — Reference#4
Gemini TTS Voice replies (voice Kore) Hermes — Reference#4
mem0 External memory provider Hermes — Reference#6
Langfuse LLM call observability + tracing Hermes — Reference#12
Hermes Dojo Daily self-improvement cycle Hermes — Reference#13
Fallback Watchdog Model degradation alerting Hermes — Reference#14
Hermes Audit Every tool call logged to Discord + file Hermes — Reference#11
6 MCP servers Context7, Google Suite, Local Services, DocIntel, ClickUp, Composio Hermes — Reference#9
6 plugins Audit, Capture, Disk-Cleanup, Langfuse, Security, Tavily Hermes — Reference#11
~60 skills 14 categories (devops, creative, research, mlops, etc.) Hermes — Reference#18
Creator pipeline Seggy Said + Yasar Ahmad book generation Hermes — Reference#15
Cloudflare Pages dashboard omair.pages.dev (auto-refreshing) Hermes — Reference#16
Obsidian vault + web viewer vault.omair.pages.dev (auto-rebuilding) Hermes — Reference#17
Syncthing + Git Vault sync to phone + hourly GitHub backup Hermes — Reference#17
Watchdog stack 5-min health probes, healthchecks.io dead-man's-switch Hermes — Reference#21
TRG Watch Stock monitoring (9AM/6PM PKT + Jul 1 escalation) Hermes — Reference#8

Headlines

  • Brain: GLM-5.2 (z.ai) + a 7-deep fallback chain (glm-5.1 → 5-turbo → 4.7 → 4.5-air → Gemini → Cerebras → Groq). All endpoints verified green 2026-06-25.
  • Voice: STT via Groq (working) + Deepgram + Gemini. TTS via Gemini (voice Kore).
  • Everything under ~/.hermes/; dashboard via ssh -L 9119 oraclevps → http://localhost:9119.
  • Vault: ~/obsidian-vault/ synced via Syncthing + hourly git; web viewer at vault.omair.pages.dev.

Architecture Diagram (High-Level)

graph TB
    %% User platforms
    Omair["👤 Omair
Text + Voice"] WA["💬 WhatsApp
Bridge :3000"] DC["🎮 Discord
11 channels"] EM["📧 Email
IMAP/Gmail"] NT["🔔 ntfy
Push alerts"] %% Core GW["🜂 Hermes Gateway
Agent core · Session DB · Hooks"] LLM["🧠 LLM: GLM-5.2 (z.ai)
→ 5.1 → 5-turbo → 4.7
→ 4.5-air → Gemini → Cerebras → Groq"] %% MCP MCP["🔌 MCP Servers (6)
Google Suite · ClickUp · Composio
Context7 · Local Svc · DocIntel"] %% Memory MEM["💾 Memory
mem0 · MEMORY.md · USER.md
Session DB (SQLite)"] %% Plugins PLG["🧩 Plugins (6)
Audit · Capture · Langfuse
Security · Cleanup · Tavily"] %% Scheduled CRON["⏰ Scheduled Jobs
10 crontab + 9 Hermes cron
Brief · Watchdog · TRG · Dojo"] %% Voice VOICE["🎙️ Voice Pipeline
STT: Groq→Deepgram→Gemini→Whisper
TTS: Gemini (Kore)"] %% Outputs VAULT["📚 Obsidian Vault
Syncthing→Phone · Git→GitHub
vault.omair.pages.dev"] CF["🌐 Cloudflare Pages
Dashboard · Books · Transcripts
omair.pages.dev"] MON["🛡️ Monitoring
Watchdog · Langfuse · Audit
→ ntfy + Discord alerts"] %% External ZAI["z.ai / Gemini / Cerebras / Groq"] GAPI["Google APIs"] CUP["ClickUp · Composio"] %% Connections Omair --> WA & DC & EM & NT WA & DC & EM --> GW NT -.->|alerts| Omair GW <-->|reasoning| LLM LLM -->|API| ZAI GW -->|tools| MCP MCP --> GAPI & CUP GW -->|recall/store| MEM GW -.->|hooks| PLG GW <-->|STT/TTS| VOICE CRON -->|trigger| GW CRON -->|daily notes| VAULT GW -->|deploy| CF CRON -->|monitor| MON MON -.->|fail/recover| NT & DC classDef user fill:#083344,stroke:#22d3ee,color:#fff classDef core fill:#064e3b,stroke:#34d399,color:#fff classDef mcp fill:#4c1d95,stroke:#a78bfa,color:#fff classDef sched fill:#78350f,stroke:#fbbf24,color:#fff classDef plg fill:#881336,stroke:#fb7185,color:#fff classDef ext fill:#1e293b,stroke:#94a3b8,color:#fff classDef out fill:#083344,stroke:#22d3ee,color:#fff class Omair,WA,DC,EM,NT user class GW,LLM core class MCP,MEM mcp class CRON sched class PLG,MON plg class ZAI,GAPI,CUP ext class VAULT,CF out class VOICE out