How Sound Suite Works

From raw court PDFs to AI-powered intelligence in four automated steps.

Step 1

Monitor Your Case Folders

Point Sound Suite at your case directories. The built-in file watcher monitors for new PDFs in real-time. When a new document arrives, it's automatically queued for processing.

Technical Details

  • Configurable watch paths via WATCH_PATHS
  • Recursive directory monitoring with chokidar
  • SHA-256 hash deduplication prevents reprocessing
  • Creates Document records with QUEUED status
Cases/
Smith-v-Jones/
complaint.pdf INDEXED
motion-to-dismiss.pdf PROCESSING
deposition-smith.pdf QUEUED
Processing Pipeline
Text extraction (42 pages)
Density analysis complete
OCR processing (3 scanned pages)
Exhibit extraction
Text chunking
Step 2

Automated Processing

Each document flows through an intelligent processing pipeline: text extraction, page-by-page density analysis, OCR for scanned content, and exhibit image extraction. All processing happens locally.

Technical Details

  • Job queue with configurable concurrency (p-queue)
  • Automatic retry with exponential backoff
  • Status: QUEUED → PROCESSING → INDEXED
  • Job logging for full audit trail
Step 3

AI Indexing

Processed text is split into semantic chunks and converted to vector embeddings — numerical representations that capture meaning, not just keywords. Vectors are stored in LanceDB for millisecond-fast similarity search.

Embedding Providers

Local

Transformers — fully offline

OpenAI

High-quality via API

Anthropic

Claude-powered

Ollama

Local LLM-based

Vector Space Visualization

Contracts Depositions Motions Orders
What obligations does the contract impose on the defendant?

Based on the Service Agreement (Exhibit A), the defendant has the following obligations:

  • Monthly reporting due by the 15th (Section 4.2)
  • Maintain insurance coverage of $1M (Section 7.1)
  • Non-compete for 24 months post-termination (Section 9.3)

Sources: exhibit-a.pdf (p.3, 7, 12)

Step 4

Search & Analyze

Now your documents are alive. Use any of 14 MCP analysis tools through your preferred AI assistant — ask questions in natural language, find contradictions, extract timelines, screen for privileged content, and more.

Compatible AI Assistants

  • Claude (Anthropic)
  • ChatGPT (OpenAI)
  • Any MCP-compatible AI client
Beyond Search

Draft Documents with AI Assistance

Sound Suite doesn't just analyze your case files — it helps you write. The built-in draft editor connects your indexed documents to the writing process.

Auto-Suggest

As you type, Sound Suite searches your indexed case documents and suggests relevant text — citations, contract clauses, deposition excerpts, dates with sources.

You type: "The contract requires..."

...monthly reporting by the 15th (Section 4.2, Exhibit A p.3) and maintenance of $1M insurance coverage (Section 7.1)...

Tab to accept · Esc to dismiss

Search Speed Presets

Simple Medium Ultra

Balance speed vs. depth. Ultra searches more documents with larger context windows.

AI Chat Panel

A side panel where you ask questions about your case while writing. The AI searches your indexed documents and returns sourced answers you can insert directly.

AI Chat

"What did the plaintiff claim about the delivery date?"

Plaintiff states delivery was promised by March 1 (Complaint, para. 14) but actually occurred April 12 (Exhibit B, p.2). The 42-day delay is referenced in three filings.

Sources: complaint.pdf (p.5), exhibit-b.pdf (p.2), reply-brief.pdf (p.8)

Full Legal Editor

Not a toy editor — a production-ready tool built for legal document workflows.

  • Import/export Word (.docx)
  • PDF export with headers/footers
  • Track changes with accept/reject
  • Version history with compare
  • Format Painter & heading nav
  • Font sizes in points (legal standard)
  • Document minimap

Architecture

Everything runs on your machine. Here's how the pieces fit together.

Your Machine

Case Folders

PDFs on disk

Job Queue

p-queue + retry

Ingestion Pipeline

OCR + extraction + chunking

FileWatcher (chokidar)

SQLite

Prisma ORM

LanceDB

Vector embeddings

MCP Server

14 analysis tools · port 3001

Next.js Dashboard

port 3000

AI Clients

Claude, ChatGPT, etc.

GPU Acceleration

Sidecar: Local AI via Docker

Sound Suite includes a GPU sidecar — a self-contained process that orchestrates Docker containers running local AI models. It handles embeddings, OCR, reranking, and completions without any cloud API keys.

The sidecar auto-detects your GPU, pulls the right Docker images, and manages VRAM allocation across containers. It communicates with the main server via WebSocket (with HTTP polling fallback), so it works across network boundaries and NAT.

How It Works

  1. 1 Download & launch — The sidecar ships as a tarball or runs via Docker itself. On startup it detects Node.js 18+ and GPU availability.
  2. 2 Auto-provision — Pulls Ollama and vLLM images, creates containers for each role (embedding, completion, OCR, reranker), checks VRAM before starting.
  3. 3 Connect to server — Opens a WebSocket to the main Sound Suite instance. The server dispatches embedding, OCR, and reranking jobs to the sidecar.
  4. 4 Mode switching — Swaps between "indexing" mode (embedding + OCR containers) and "searching" mode (embedding + reranker + completion) to fit within available VRAM.

The sidecar runs on port 8098 with its own dashboard for monitoring container health and GPU utilization.

Learn More About the Sidecar

Docker Containers

Embedding

ollama · qwen3-embedding:0.6b

1.2 GB

Completion

ollama · qwen3.5:9b

10 GB

OCR

ollama · olmocr2:7b-q8

8 GB

Reranker

vLLM · Qwen3-Reranker-8B

7 GB

Sidecar Communication

Main Server

:3000

WebSocket
HTTP poll

GPU Sidecar

:8098

Primary: WebSocket (:3002) · Fallback: HTTP polling · Heartbeat every 5s

VRAM-Aware Mode Switching

Indexing Mode

  • Embedding container
  • OCR container

Searching Mode

  • Embedding container
  • Reranker container
  • Completion container

Containers are stopped and started as needed to stay within available VRAM.

What Makes It Different

Traditional Cloud Solutions Sound Suite
Documents uploaded to third-party servers All processing on your local machine
Monthly subscription fees Source-available, free for pro se use
Vendor lock-in Your data in standard formats
Internet required Works fully offline
Limited AI analysis tools 14 specialized MCP tools
Generic document processing Built for legal documents
Complex enterprise setup Install and run in minutes

Ready to See It In Action?

Download Sound Suite and index your first case in under 5 minutes.