Initial commit: homelab infrastructure wiki
- Full Obsidian vault content - Host configs (ice, grizzley, ubuntu, proxmox, truenas, panda, hyte) - Media stack documentation - Traefik HA setup - Automation scripts - Bachelor party planning
This commit is contained in:
52
homelab/concepts/ai-applications.md
Normal file
52
homelab/concepts/ai-applications.md
Normal file
@@ -0,0 +1,52 @@
|
||||
---
|
||||
title: AI Applications Pipeline
|
||||
created: 2026-04-28
|
||||
updated: 2026-04-28
|
||||
type: concept
|
||||
tags: [concept, ai, services]
|
||||
sources: [../../homelab/architecture.md]
|
||||
---
|
||||
|
||||
# AI Applications Pipeline
|
||||
|
||||
Local AI/ML stack running on ubuntu with GPU acceleration (GTX 1080 8GB), plus AI-powered applications that use LLM inference.
|
||||
|
||||
## Core AI Infrastructure
|
||||
|
||||
| Service | URL | Purpose |
|
||||
|---------|-----|---------|
|
||||
| Ollama | localhost:11434 | Local LLM inference (GPU via GTX 1080) |
|
||||
| Qdrant | ubuntu:6333 | Vector database for OpenCode cluster memory |
|
||||
| Faster Whisper Server | — | Speech-to-text (Whisper) |
|
||||
|
||||
## AI Applications (7 containers)
|
||||
|
||||
| Application | Description |
|
||||
|-------------|-------------|
|
||||
| AI Job Pipeline (backend + frontend) | AI task orchestration |
|
||||
| AI Alert Aggregator (backend + frontend + postgres) | Alert intelligence |
|
||||
| AI Media Intelligence (backend) | Media analysis |
|
||||
| AI Subscriptions | Subscription management |
|
||||
| Homelab Inventory (backend) | Infrastructure inventory |
|
||||
|
||||
## Immich ML
|
||||
|
||||
| Component | Description |
|
||||
|-----------|-------------|
|
||||
| Immich Server | Photo/video management |
|
||||
| Immich ML | Machine learning on GPU |
|
||||
| Immich Postgres | Dedicated PostgreSQL (pgvecto-rs extension) |
|
||||
| Immich Redis | Caching |
|
||||
|
||||
## OpenCode Embeddings
|
||||
|
||||
OpenCode instances across the cluster use:
|
||||
- **Ollama** — generating embeddings for vector memory
|
||||
- **Qdrant** — storing shared vector memory across OpenCode cluster
|
||||
|
||||
## Related
|
||||
|
||||
- [[opencode-cluster]] — OpenCode cluster using this AI infrastructure
|
||||
- [[ubuntu]] — Hosts GPU (GTX 1080) and all AI services
|
||||
- [[jellyfin]] — Media server with AI features
|
||||
- [[../../homelab/docs/ai-applications.md]] — AI applications documentation
|
||||
Reference in New Issue
Block a user