Changelog

Jan 15, 2025

Cloud-Native Architecture Migration

From self-hosted Docker stack to Firebase + frontier AI models.

LegalEase has undergone a significant architectural transformation, moving from a fully self-hosted Docker Compose stack to a cloud-native architecture powered by Firebase and Google Cloud services.

Why We Migrated

The original self-hosted architecture (FastAPI, Celery, PostgreSQL, MinIO, Ollama, WhisperX) served as an excellent proof of concept, but presented challenges for production deployment:

Hardware requirements: GPU-accelerated transcription required significant on-premises infrastructure
Model limitations: Local models (Llama 3.1, WhisperX) couldn't match frontier model quality for legal document analysis
Operational complexity: Managing 8+ Docker containers, Celery workers, and GPU drivers created operational burden
Scaling constraints: Horizontal scaling required complex orchestration setup

The New Architecture

LegalEase now runs on a modular cloud-native stack:

Component	Before	After
Frontend	Nuxt 4	Nuxt 4 (unchanged)
Backend API	FastAPI + Celery	Firebase Cloud Functions + Genkit
Database	PostgreSQL	Cloud Firestore
File Storage	MinIO	Firebase Storage
Transcription	WhisperX / Whisper API	Gemini 2.5 Flash / Google Speech-to-Text (Chirp 3)
Summarization	Ollama (Llama 3.1)	Gemini 2.5 Flash
Vector Search	Qdrant (self-hosted)	Qdrant Cloud
Auth	Stub/None	Firebase Authentication

Key Benefits

Frontier Model Quality

Gemini 2.5 Flash provides superior transcription with automatic speaker name inference
Better summarization, entity extraction, and key moment identification
Multi-modal capabilities for future document analysis features

Simplified Setup

Local development requires only a Gemini API key and Firebase emulators
No GPU required for development or testing
Single mise run dev:local command starts everything

Multi-Agent Ready

Genkit-based flows support future multi-agent orchestration
Provider abstraction allows swapping AI backends without code changes
Ready for Claude, GPT-4, or other frontier models

Still Self-Hostable

Firebase emulators provide full offline development capability
Qdrant can run locally or in cloud
Architecture designed for eventual Kubernetes and AWS deployment options

Future Roadmap

We're committed to avoiding vendor lock-in:

Kubernetes deployment: Helm charts for self-hosted production deployments
AWS alternative: S3, DynamoDB, and Lambda equivalents planned
Provider flexibility: Additional transcription/AI providers (OpenAI, Anthropic) in progress

The modular provider pattern means you can mix and match services based on your requirements, compliance needs, or cost constraints.

Migration Notes

If you were running the previous Docker-based LegalEase:

Export your PostgreSQL data before migrating
The new architecture uses different data models optimized for Firestore
Transcripts and documents need to be re-processed through the new pipeline
User authentication is now handled by Firebase Auth (Google sign-in supported)

The landing documentation has been fully updated to reflect the new architecture. See the Installation Guide for the new setup process.

Sep 15, 2024

Transcript Review & Summaries

End-to-end transcript workflow with Ollama summaries.

Transcript review now goes beyond raw text dumps.

What you can do

Search within a transcript, filter by speaker, and toggle segment visibility.
Generate an Ollama-backed summary that includes an executive overview, key moments, timeline, speaker statistics, action items, and topic tags.
Regenerate summaries on demand or run batch jobs across multiple transcripts.
Mark important segments as key moments; metadata is stored in the database for later retrieval.

Integration perks

Transcript segments share the main Qdrant collection, so a single query can surface both document and transcript hits.
Download caption files (SRT/VTT) or a polished DOCX with speaker formatting for further editing.

This release lays the groundwork for deeper analytics while keeping everything self-hosted.

Sep 5, 2024

Hybrid Search Filters

Case scoping, chunk selection, and transparent scoring.

The search interface now mirrors the flexibility of the backend hybrid engine.

Highlights

Filter by case IDs, document IDs, or both to narrow the scope of each query.
Select which chunk granularities to include (summary, section, microblock, transcript segment) so you can focus on either high-level summaries or precise snippets.
Switch between hybrid, dense-only, and keyword-only retrieval strategies from the UI without modifying backend settings.
View _score_debug metadata on each result to understand how BM25 and dense scores were fused.

Developer conveniences

Debounced input prevents unnecessary API calls while typing.
The search panel logs the final request payload to the browser console in development mode, making it easy to replay queries via cURL or the API docs.

Aug 28, 2024

PDF Viewer Improvements

Better navigation, highlighting, and MinIO integration.

The built-in PDF viewer has been refreshed to work hand-in-hand with the new document pipeline.

What's new

Bounding boxes from Docling are now honoured in the UI, so clicking a search result jumps to the exact location within the PDF.
Page thumbnails, zoom, and keyboard navigation stay in sync, avoiding the desynchronisation issues present in earlier builds.
MinIO-backed page renders load progressively, making large documents responsive even when running remotely.
Toolbar controls were simplified; print, download, and jump-to-result actions now live in a single toolbar.

These changes make reviewing lengthy filings and exhibits significantly smoother without relying on external PDF tooling.

Aug 20, 2024

Hybrid Search Foundation

Docling chunking + Qdrant hybrid retrieval now live.

LegalEase’s core search stack is in place:

Documents are parsed by Docling, chunked into multi-scale segments, and indexed with both dense embeddings and BM25 sparse vectors.
Qdrant handles named vectors so hybrid queries can combine multiple granularities (summary/section/microblock) in a single request.
Reciprocal Rank Fusion is the default strategy, balancing keyword hits with semantic context.
The search API returns detailed metadata (scores, chunk type, page number) so the UI and external clients can explain why a result ranked.

This foundation underpins every other workflow—transcripts, forensic exports, and future analytics reuse the same infrastructure.

Audio Transcription

title: "Transcription Pipeline Updates" description: "WhisperX support with heuristic fallbacks and export formats." date: "2024-09-10" image: https://images.unsplash.com/photo-1590602847861-f357a9332bbc?auto=format&fit=crop&w=800&q=80

The transcription pipeline now runs end-to-end inside LegalEase. Upload audio or video, monitor Celery progress, and receive transcripts with speaker labels and timestamped segments.

Highlights

WhisperX is the primary engine (CUDA and ROCm friendly). When it is not available, LegalEase falls back to the official Whisper API or lightweight heuristics.
Speaker diarisation uses Pyannote when an HF_TOKEN is configured, and gracefully falls back to pause-based speaker detection otherwise.
Download transcripts as DOCX, SRT, VTT, TXT, or JSON from the dashboard.
Celery task status endpoints expose progress so you can build custom indicators in downstream tools.

Notes

Actual throughput depends heavily on your hardware. Expect real-time or faster on modern GPUs, and slower performance on CPU-only environments. No audio leaves your infrastructure unless you opt into the Whisper API fallback.

Speaker Diarization

title: "Speaker Labelling Improvements" description: "Configurable diarisation ranges and smoother speaker changes." date: "2024-09-12" image: https://images.unsplash.com/photo-1557804506-669a67965ba0?auto=format&fit=crop&w=800&q=80

We refined how LegalEase labels speakers inside transcripts:

Configure minimum and maximum speaker counts when uploading a recording. This helps Pyannote converge faster on the right number of participants.
When Pyannote is unavailable, a heuristic diariser detects speaker switches based on pauses and merges overly short segments to reduce flicker.
The transcript viewer now colour-codes speakers consistently and lets you rename them inline; updates apply to all occurrences.
Speaker statistics (total talk time and contribution percentage) are calculated alongside summary generation.

These tweaks make multi-party hearings and interviews easier to review even without cloud-based diarisation services.